Question: Fr-unstranded vs Fr-first strand
gravatar for clizama
2.2 years ago by
clizama0 wrote:

Just curiosity:

I know some Library preps kits are Strand specific and others no.

In the case of strand specific, for example FR-First strand

What happen if I do a analysis using FR-unstraded and FR-first strand for the same data using top-hat. I should have different numbers of count with HT-seq and in terms of differential expression, for example comparing control vs treatments, the numbers of genes that are significant will change randomly in case of the FR-unstranded.


rna-seq • 1.5k views
ADD COMMENTlink modified 2.2 years ago by Kevin Blighe51k • written 2.2 years ago by clizama0
gravatar for Kevin Blighe
2.2 years ago by
Kevin Blighe51k
Kevin Blighe51k wrote:

From my experience, it does not make much difference (if any at all) provided you are just looking to determine raw counts over known transcripts. For HT-seq, which uses a GTF/GFF reference over which abundances are counted from an aligned BAM, you may observe differences at genes where transcription occurs on both strands, like antisense transcripts (e.g. XIST and TSIX). For other programs, like Kallisto, which counts abundances over a FASTA reference transcriptome, I believe that no or absolute minimal differences in count abundances will be observed (and I have tested this).

A good aligner will be able to align all reads, irrespective, and record strand-specific information in the BAM file. This information can then be used by programs like Cufflinks in order to determine strand-specific count abundances, or indeed HT-seq to corroborate strand-specific information in the aligned BAM file with that in the reference GTF/GFF.

I did this recently for a bacterial RNA-seq project where transcription occurred virtually across the entire circular genome on both strands. Had I selected unstranded, in this case, I would observe roughly double count values and half the identified transcripts (and all would be identified on a single strand). The bacterial genome is obviously different from mammalian, though.

I would encourage you to read this great thread: How To Determine If Paired–End Illumina Rnaseq Reads Are Strand–Specific

The issue of strandedness in RNA-seq analysis is one that causes a lot of headaches, so, I'm sure that others have more to add.

ADD COMMENTlink modified 2.2 years ago • written 2.2 years ago by Kevin Blighe51k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1910 users visited in the last hour