Question: Normalisation of RNA-seq for different RNA biotypes
gravatar for Nathaniel
2.8 years ago by
Nathaniel70 wrote:

I have RNA-seq count data and I want to perform traditional analysis like PCA/heatmaps on each RNA biotype independently.

My question is: when performing the RNA normalisation step (using for instance DESeq) should I do it on the entire gene expression matrix with all biotypes concatenated, or is better if I do it separately for each gene biotype?

What worries me is that if we do the processing in the concatenated expresion matrix, the mRNA will completely mask the miRNAs, since they are much more expressed. Also, if i then filter let's say the 10% less expressed genes I will probably end up filtering more miRNAs than mRNAs for the same reason.


rna-seq deseq • 949 views
ADD COMMENTlink modified 2.8 years ago by Devon Ryan91k • written 2.8 years ago by Nathaniel70
gravatar for Devon Ryan
2.8 years ago by
Devon Ryan91k
Freiburg, Germany
Devon Ryan91k wrote:

Estimate the size factors and dispersion using the whole dataset and then subset for further comparisons. Your miRNAs won't be masked then or filtered out. Having said that, if you did RNAseq then you're going to get largely meaningless miRNA counts. I would strongly encourage you to simply ignore them. If you want to measure miRNA differences, sequence small RNAs. Otherwise you're just measuring differences in size selection during library prep between the groups...

ADD COMMENTlink written 2.8 years ago by Devon Ryan91k

Great, thank you! does the same apply for long RNAs such as lincRNAs? Those should be safe to analyse, right?

ADD REPLYlink written 2.8 years ago by Nathaniel70

lincRNAs are fine if you did ribo depletion rather than polyA selection.

ADD REPLYlink written 2.8 years ago by Devon Ryan91k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1024 users visited in the last hour