Question: Interpretation problem at PCA plot for detecting batch effect
gravatar for modarzi
9 months ago by
modarzi130 wrote:

Hi, I have 74 cancer cases who their RNA-seq gene expression belongs to 14 batches. My exprData was normalized by FPKM method. After running prcomp() on my expression data, I plotted PC1 Vs PC2. my plot is available in this link. Now, I need to know more about the interpretation of that plot. does my exprData has batch effects? does it need to batch effect correction?

I appreciate it if anybody shares his/her comment with me.

batch effect rna-seq pca • 267 views
ADD COMMENTlink modified 9 months ago • written 9 months ago by modarzi130
gravatar for Martombo
9 months ago by
Seville, ES
Martombo2.6k wrote:

There are better ways to assess the presence of batch effects. First thing, FPKM is not a robust normalization method to compare different samples. Use the normalization methods of R packages like DESeq2 or limma-voom, it would then be more appropriate to look for a batch effect. You could also remove lowly expressed genes or select most variables ones for an unsupervised analysis like PCA, so as to remove some of the variability of the dataset. Finally, some ad-hoc methods can establish the presence of significant co-variates present in your data. See the sva package, for example.

ADD COMMENTlink written 9 months ago by Martombo2.6k

Thanks. about the first thing I have to say my data belongs to TCGA and I can also download HTseq-Count data. So should I have batch effect analysis on HTseq-Count data or normalize it by other methods in DESeq2 package? If yes, which normalization method is better?

For the second thing, you wrote it is better I remove lowly expressed genes. can I compute gene variance between the samples and remove genes by zero variance? do you recommend better ways?

Finally, I can't understand your mean about ad-hoc methods. could you give an example?

ADD REPLYlink modified 9 months ago • written 9 months ago by modarzi130
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 990 users visited in the last hour