Proper Bam / Tophat Output For Strand Specific Paired-End Data?
1
1
Entering edit mode
11.5 years ago
user ▴ 940

If I have a strand-specific paired-end library and align with Tophat with an option like fr-firstrand, what is the proper to encode the strand of the pairs in the BAM, and how do mappers like Tophat handle it? With fr-firstrand we know that the first strand is the strand of the read, but by definition the second mate will be on opposite orientation. If the first read in the pair has strand + in its alignment in the BAM file, then will the second read in the pair also have strand + or can we count on it being -? Thanks

tophat RNA-seq bam • 3.4k views
ADD COMMENT
0
Entering edit mode
11.5 years ago
Ido Tamir 5.2k

Tophat1 should give you:

  +/- or -/+ if you specify fr-X: --> <-- 
  +/+ or -/- if you specify ff-X: --> -->

Tophat1 also gives you singletons as output (i.e. only the first or second read map). Tophat2 can also give your discordant reads i.e. +/+ although fr (--> <--) was specified.

Important is the XS flag which tells you which strand the read comes from depending on X-firststrand or X-secondstrand.

ADD COMMENT

Login before adding your answer.

Traffic: 2966 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6