StringTie strandness of library in dUTP protocol
2
0
Entering edit mode
5.0 years ago
compuTE ▴ 140

Can somebody explain to me why StringTie calls fr-firststrand to a library preparation that is prepared by dUTP protocol? dUTP protocol keeps the reverse strand of the amplified cDNA, right? So if it is the reverse strand, wouldn't it make more sense to call it secondstrand?

Maybe the answer is "because thats the way they wanted to name it", but maybe I am missing something and if so, I would like to know.

I am a computer scientist and I am trying to get the biology over this. Sorry in advance if this is basic wet-lab terminology I'm not aware of.

stringtie libraryprep rnaseq dutp • 2.6k views
ADD COMMENT
2
Entering edit mode
5.0 years ago
Juke34 8.5k

It's just matter of referential, if you use the mRNA strand as referential or the first strand synthetised. To understand what is called first strand have a look to the figure here or here for an up-to-date version of it.

ADD COMMENT
0
Entering edit mode

Your figures are very helpful, thank you.

ADD REPLY
2
Entering edit mode
5.0 years ago
h.mon 35k

StringTie uses this nomenclature due to historical reasons: it inherited this convention from TopHat. TopHat had a reasoning, which is explained at its manual:

  • fr-unstranded Standard Illumina Reads from the left-most end of the fragment (in transcript coordinates) map to the transcript strand, and the right-most end maps to the opposite strand.

  • fr-firststrand dUTP, NSR, NNSR Same as above except we enforce the rule that the right-most end of the fragment (in transcript coordinates) is the first sequenced (or only sequenced for single-end reads). Equivalently, it is assumed that only the strand generated during first strand synthesis is sequenced.

  • fr-secondstrand Ligation, Standard SOLiD Same as above except we enforce the rule that the left-most end of the fragment (in transcript coordinates) is the first sequenced (or only sequenced for single-end reads). Equivalently, it is assumed that only the strand generated during second strand synthesis is sequenced.

P.S.: as both tools are from the same group and share a number of authors, one could argue that the reasoning for this nomenclature is the same for both tools.

ADD COMMENT

Login before adding your answer.

Traffic: 1832 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6