Question: How to directly compare normalized read counts across subgroups within a metatranscriptome?
gravatar for NRC
2.1 years ago by
NRC0 wrote:

I would very much like feedback on this metatranscriptomic/DESEq2 issue from a biostat perspective.

I have 19 individual RNAseq samples corresponding to different locations in the environment. They were sequenced separately using Illumina Hiseq (barcoded), and then all reads were used to generate an assembled metatranscriptome.

Reads from each of the 19 samples were mapped to this metatranscriptome. Based on our annotations, we have many different microbial groups present.

What I would like to do is compare transcript abundance across locations for each major taxonomic group. I have subsetted out contigs and their raw counts corresponding to the groups A, B and C. I have normalized counts for each group in DESeq2 and exported the pseudo-counts.

I would like to generate transcriptome profiles (heatmaps) for each of these groups and look for changes in normalized transcript abundance across sites and among groups. The question is, is it incorrect to display these pseudo-counts side-by-side each other in the same heat map? (I think yes.) Is there a better way to do this if you wish to directly compare normalized transcript abundance across independently-normalized taxonomic groups?

I don't think the answer is to normalize groups A,B,C all together and then later subset them out, because there may be important differences in expression tendencies between groups.

I am not actually interested in a pairwise analysis here, mainly just the changes in transcript abundance with location and group.

Apologies if there is an obvious answer here. Thank you so much for your time and thoughts.

ADD COMMENTlink modified 22 months ago by pedrorodrigues10 • written 2.1 years ago by NRC0
gravatar for theobroma22
2.1 years ago by
theobroma221.1k wrote:

You can make a heatmap or plot however you want, it just has to be visually acceptable and fit your interpretation.

Also, if you want to compare changes in transcript abundance with location and group, you could use ANOVA, no?

ADD COMMENTlink modified 2.1 years ago • written 2.1 years ago by theobroma221.1k

I don't think it's valid to compare normalized counts across data sets. Think about this: you have 1 data set consisting of 2 treatments and 3 reps each (EXPT1) and another data set with 2 different treatments and 3 reps (EXPT2). You normalize both independently. It wouldn't make any sense to then compare all pseudo counts between EXPT1 and EXPT2.

So how would I get around this problem in my metatranscriptome where I normalized each subset (taxa) separately?

ADD REPLYlink written 2.1 years ago by NRC0

Oh right, I overlooked you're using count data while writing my reply, so disregard my last. Well, in your case and your current data setup, you can use Venn Diagrams, bar plots and heatmaps or other graphical displays like radar plots. In other words, there is no more statistical analysis to do, only plot the data you have and write your results.

ADD REPLYlink written 2.1 years ago by theobroma221.1k
gravatar for pedrorodrigues
22 months ago by
pedrorodrigues10 wrote:

In regard to group-level normalization rather than global, I think the method proposed by Klingenberg and Meinicke 2017 might be helpful. I have personally not been able to use their method yet, but it looks promising.

ADD COMMENTlink written 22 months ago by pedrorodrigues10
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 770 users visited in the last hour