Question: RSeQC Output from infer_experiment.py - what does it mean?
4
gravatar for JJ
2.1 years ago by
JJ490
JJ490 wrote:

Hi all,

Can someone help me understand the RSeQC Output from infer_experiment.py?

So this is the output:

This is PairEnd Data
Fraction of reads failed to determine: 0.0560
Fraction of reads explained by "1++,1--,2+-,2-+": 0.0192
Fraction of reads explained by "1+-,1-+,2++,2--": 0.9247

So it's stranded but is it fr-firststrand or fr-secondstrand? I do not understand the help given here:

For pair-end RNA-seq, there are two different ways to strand reads (such as Illumina ScriptSeq protocol):

1++,1–,2+-,2-+

read1 mapped to ‘+’ strand indicates parental gene on ‘+’ strand

read1 mapped to ‘-‘ strand indicates parental gene on ‘-‘ strand

read2 mapped to ‘+’ strand indicates parental gene on ‘-‘ strand

read2 mapped to ‘-‘ strand indicates parental gene on ‘+’ strand

1+-,1-+,2++,2–

read1 mapped to ‘+’ strand indicates parental gene on ‘-‘ strand

read1 mapped to ‘-‘ strand indicates parental gene on ‘+’ strand

read2 mapped to ‘+’ strand indicates parental gene on ‘+’ strand

read2 mapped to ‘-‘ strand indicates parental gene on ‘-‘ strand

Thanks for your help!

rna-seq • 2.4k views
ADD COMMENTlink modified 2.1 years ago by Devon Ryan94k • written 2.1 years ago by JJ490
8
gravatar for Devon Ryan
2.1 years ago by
Devon Ryan94k
Freiburg, Germany
Devon Ryan94k wrote:

It means you have a standard (dUTP-based) strand-specific library. If you want to use featureCounts, you'll want the -s 2 setting. For HTSeq-count it's --strand reverse.

ADD COMMENTlink modified 2.1 years ago • written 2.1 years ago by Devon Ryan94k

Thanks! I would like to use HTSeq-count but also Stringtie, which takes either

--rf    Assumes a stranded library fr-firststrand.
--fr    Assumes a stranded library fr-secondstrand.

So it's fr-firststrand, correct?

Could you quickly explain what "1+-,1-+,2++,2--" means?

Thanks!

ADD REPLYlink written 2.1 years ago by JJ490
1

Yes, you want --rf, which matches dUTP based methods (the documentation for those options is horrendous).

1+- means that read 1 mapped to the + strand when the gene itself was on the - strand. For dUTP-based methods, read to aligns in the same direction as the transcript from which the sequenced fragment arose ("read 2 sets the strand"), so you expect "2++" and "2--" to get more signal than "2+-" and "2-+".

ADD REPLYlink written 2.1 years ago by Devon Ryan94k

Hi Devon,

So if 1++,1--,2+-,2-+" get more signal (Value) then its fr-secondstrand

and if 1+-,1-+,2++,2-- get more signal (Value) then its fr-firststrand

Am I understand right?

ADD REPLYlink written 2.1 years ago by Bioinfonext200

Yup. You don't see much "fr-secondstrand" data these days.

ADD REPLYlink written 2.1 years ago by Devon Ryan94k

Thanks - now I get it! One more question concerning HISAT2 and Stringtie. Do I have to set this tag when using HISAT2 in order for Stringtie (and other tools for that matter!) to work properly with strand-specific data?

--rna-strandness <string>

Specify strand-specific information: the default is unstranded.
For single-end reads, use F or R. 'F' means a read corresponds to a transcript. 'R' means a read corresponds to the reverse complemented counterpart of a transcript. For paired-end reads, use either FR or RF.
With this option being used, every read alignment will have an XS attribute tag: '+' means a read belongs to a transcript on '+' strand of genome. '-' means a read belongs to a transcript on '-' strand of genome.

The mapping should be the same apart from the XS tag, shouldn't it? Is this tag important for all the downstream tools? Thank you very much for your help!

ADD REPLYlink written 2.1 years ago by JJ490
1

It ends up not mattering much for alignments unless you're supplying a GTF file. For stringTie, it's quite useful, since then the resulting transcripts can have a meaningful strand assigned to them (makes doing things like finding ORFs and performing annotations a bit simpler).

ADD REPLYlink written 2.1 years ago by Devon Ryan94k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2011 users visited in the last hour