Selecting Samples based on Clustering
0
0
Entering edit mode
3.6 years ago
Hyper_Odin ▴ 320

If I only select samples based on their clustering in heatmaps, am I not creating a bias ? For example, I had 35 samples (RNA Seq) nitially, but, in the end, we only selected 12 samples based on their clustering. When I checked for some of the genes in both cases, they were hugely varying in terms of P-adjusted value.

I mean, I know this is a bias but how do I explain this to my boss? Please Helpppp!!!

RNA-Seq sequencing gene • 506 views
ADD COMMENT
0
Entering edit mode

Can you give us some information on what the samples are, your method for clustering, and how clusters were selected?

ADD REPLY
0
Entering edit mode

So, there were 3 types of samples.

  1. Treated Nodules - 16 samples
  2. Control Nodules - 3 samples
  3. Untreated Nodules - 16 samples

Performed Correlation clustering after DESEQ2 using Pheatmap

Heatmap-Most-Var2

Now, let's say, only violet samples with large cluster was selected, and the cyan cluster was completely removed from the final analysis.

ADD REPLY

Login before adding your answer.

Traffic: 1371 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6