Question

Total RNA-Seq vs mRNA-Seq

3

Entering edit mode

5.5 years ago

Pin.Bioinf ▴ 340

Hello, I have read that 30-50M reads mapped per sample are the general optimal number of reads mapped needed to do a DE expression analysis for mRNA-Seq. What would be the minimum for TOTAL RNA-Seq? Is it a lot more?

Thank you

RNA-Seq • 4.9k views

ADD COMMENT • link updated 5.5 years ago by Kristoffer Vitting-Seerup ★ 4.0k • written 5.5 years ago by Pin.Bioinf ▴ 340

0

Entering edit mode

I do not understand a lot about biology as I am a computer scientist. My colleague asked me if she could add more samples to the run (which would diminish the amount of reads per sample), so I am asking the minimum amount of M reads needed in both cases: total rnaseq and mrnaseq. So what I understood is, if we do ribosomal depletion for total rnaseq then it will be very similar to doing mrnaseq, right? then the minimum M reads needed will be similar. But if we dont do rrna depletion with total rnaseq then we will need 20 times the M reads needed for mrnaseq (as an approximation) right?

ADD REPLY • link 5.5 years ago by Pin.Bioinf ▴ 340

0

Entering edit mode

Please use ADD COMMENT/ADD REPLY when responding to existing posts to keep threads logically organized.

ADD REPLY • link 5.5 years ago by GenoMax 141k

score 2 · Answer 1 · 2018-11-05

2

Entering edit mode

5.5 years ago

WouterDeCoster 47k

As a rough estimate: RNA is for 95% rRNA. If you are not depleting these in your TOTAL RNA-seq, and want to get the same number of reads on your 'interesting' mRNA then you can multiply that number of reads by 20.

ADD COMMENT • link 5.5 years ago by WouterDeCoster 47k

0

Entering edit mode

Thank you WouterDeCoster, so you mean if I do not deplete rrna I should need 600 -1000 M reads per sample? And what if I do ribosomal depletion first?

ADD REPLY • link 5.5 years ago by Pin.Bioinf ▴ 340

1

Entering edit mode

To be fair I don't think people actually do "TOTAL" RNA-seq. If you do ribosomal depletion (efficiently) then you are roughly doing the same as polyA enrichment, with the exception that you'll get a couple of lowly expressed non-polyA-tailed lnc-RNA, which don't change the end result much.

ADD REPLY • link 5.5 years ago by WouterDeCoster 47k

0

Entering edit mode

What is the final goal? Do you want to find differentially expressed, but non-polyadenylated genes/transcripts? Is there anything wrong with polyA-enrichment?

ADD REPLY • link 5.5 years ago by ATpoint 82k

score 1 · Answer 2 · 2018-11-06

This is a two part answer:

On the topic of total RNA

As mentioned by WouterDeCoster 95% of cellular RNA is rRNA - therefore RNA library preperation always either do:

rRNA depletion
poly-A selection

to enrich for the RNA of interest.

This is build into the library preparation protocols so you don't need to think about it. You just tell the sequencing center which one you want (if you don't say anything I would guess 95% would do poly-A selection).

Please note that often rRNA depletion is refereed to as total-RNA.

On the topic of number reads

On the topic of number reads I have 3 comments:

To do a gene differential expression analysis you need 5-10e6 reads
The number of independent biological replicates is the major determining factor in the power you have - you need at least 3 replicates in each condition!
If you want to do a transcript level analysis, enabling analysis of amongst other isoform switches, you need to to sequence deeper - 30-50e6 paried-end reads.

You can find more information about good practices in RNA-seq analysis here. Analysis of isoform switches - such as the analysis presented here can be done with my R-package IsoformSwitchAnalyzeR.