Is this PCA plot correct??
2
0
Entering edit mode
17 hours ago
m00nphany • 0

enter image description here

I am facing issues with the control and samples clustering even after batch correction - the samples contain multiple tissue types

PCA • 1.1k views
ADD COMMENT
0
Entering edit mode

It would also help to add information about what kind of data this is and what you did to the data to get to this point.

ADD REPLY
1
Entering edit mode
11 hours ago
Mensur Dlakic ★ 30k

I can tell you what you already know: this result is unexpected. Nobody can tell you whether the plot is "correct" or not - it depends on whether you processed the data properly. If yes, then the result is what it is. Either your two groups are not different in ways you expected, or something went wrong during the experimental stage.

ADD COMMENT
1
Entering edit mode
6 hours ago
ATpoint 89k

Assuming that the PCA was "technically" done correctly (bulletproof way would be to use plotPCA from DESeq2 as a wrapper) -- then the separation is simply not due to treatment. You say multiple tissues. PCA uses feature selection of genes (usually by variance) and it is perfectly expected that differences across tissues can be (much) stronger than differences by treatment. Hence two things for starters:

  • color it by tissue
  • regress tissue from the rlog counts (limma removeBatchEffect() using tissue as batch) and repeat PCA
ADD COMMENT
0
Entering edit mode

In addition to these comments, I would suggest checking other PCs, the first two PCs only explain a combined 39% of the variance. Since you have multiple tissues, it could be the the clustering you're looking for is smaller than expected and explained in one of the other PC combinations (e.g., PC1 + PC3).

ADD REPLY

Login before adding your answer.

Traffic: 4823 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6