Question: how to correct gene expression based on tumor purity
gravatar for liu4gre
2.5 years ago by
United States
liu4gre200 wrote:

Hi all, I am looking at TCGA gene expression data. Also I am interested in tumor purity, which may be inferred by a few tools such as ABSOLUTE and ESTIMATE. Question is how to correct gene expression levels based on these inferred values?

ADD COMMENTlink modified 12 months ago by Kevin Blighe49k • written 2.5 years ago by liu4gre200

Hi have you found an answer to this? For differential expression analysis using RNAseq data, it seems the package DESeq2 has a function that allows you to correct for purity estimates. See this paper:

ADD REPLYlink modified 21 months ago • written 21 months ago by Alejandro Jimenez Sanchez120

Hi, I am wondering if you figured out the answer to your question? I have the same question with regards to how to apply the tumor purity value to the gene expression levels?? I was able to calculate the tumor purity for each TCGA case for my cancer of interest. But now I'm unsure as to how to apply it. Please do let me know if you were able to get a better understanding of how to work with tumor purity.

Thank you

ADD REPLYlink written 19 months ago by rummy.chowdhury0

I have same question

ADD REPLYlink modified 14 months ago • written 14 months ago by Shixiang40
gravatar for Kevin Blighe
12 months ago by
Kevin Blighe49k
Kevin Blighe49k wrote:

You just need to include it as a covariate in your design formula. While not directly modifying your data to adjust for the purity estimate, doing this will adjust the statistical inferences made from that data.



Edit: November 12, 2018:

Some evidence to back this:

"In conclusion, we have shown that the influence of tumour purity on the results of genomic analyses is much stronger than previously appreciated, and ought to be included as a covariate in any future analysis."


It is refuted here, though, and stated that purities estimate should be multiplicative:

"There are some practices to account for purity in differential expression (DE) analysis [46] by adding purities as a covariate in the linear model. As we will show, the purity should have a multiplicative effect instead of an additive effect."


ADD COMMENTlink modified 11 months ago • written 12 months ago by Kevin Blighe49k

But if the the purity value is numeric, how to add it into design formula? To classify samples as low, median and high?

Is there a method to adjust the gene expression by tumor purity?

ADD REPLYlink written 11 months ago by Chun-Jie Liu260

A covariate can be categorical or numeric. I am not aware of a program that directly adjusts for tumor purity (but one likely exists... somewhere).

ADD REPLYlink written 11 months ago by Kevin Blighe49k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1357 users visited in the last hour