Question: PCA plot interpretation
gravatar for sally009
2.1 years ago by
sally0090 wrote:

Hi Everybody, I have analyzed some RNA-Seq datasets using Deseq2. I have five samples of the Normal embryo and three samples of Trisomy 21. I got the PCA plot and I am not able to interpret what the plot means. I have done online research and tried to find it out but I am not able to understand as I don't have a statistician background and I am not able to find out what it means. I know from the sample that one group is an outlier. Can anybody please help me with what the data means? I have attached the plot to this forum.

rna-seq pca • 1.6k views
ADD COMMENTlink modified 2.1 years ago by Jean-Karim Heriche21k • written 2.1 years ago by sally0090

ADD REPLYlink written 2.1 years ago by sally0090
gravatar for Kritika
2.1 years ago by
Kritika260 wrote:

Hi @Phur The PCA plot is used for classification As i can see there is 25% AND 32% Variation among all samples that means samples(Normal + trisomy) is very close enough This is 2Dimensional plot so you are getting 1 variation as 25% ands another as 32% Hope this is clear!!

ADD COMMENTlink written 2.1 years ago by Kritika260
gravatar for Jean-Karim Heriche
2.1 years ago by
EMBL Heidelberg, Germany
Jean-Karim Heriche21k wrote:

PCA finds axes along which the data points are maximally spread out. These axes are called components and are defined as a linear combination of the variables present in the data. They are ordered by the fraction of total variance that they capture. Plotting the data points in the space of these new axes is often used to visualize whether different categories of data points (e.g. control and treatment) are well separated or to find if there is some structure in the data (i.e. if one can see some pattern in how the points are distributed). Using only a limited number of components is also a dimensionality reduction technique widely used before the application of machine learning algorithms. It's important to understand PCA to also understand its uses and limitations. For example, it assumes that dimensions with maximum variance are most relevant and that orthogonal axes are informative. There are plenty of tutorials and explanations on the web. You may be interested in this post on Cross Validated.

ADD COMMENTlink written 2.1 years ago by Jean-Karim Heriche21k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2224 users visited in the last hour