HTSeq count (reverse count with paired-end data)
1
1
Entering edit mode
9.9 years ago
biotech ▴ 570

To which strand should a pair be counted to? I'm confused counting reads in paire-end RNA-seq data.

https://drive.google.com/file/d/0B8-ZAuZe8jldb3FJVXlZQkNxbjA/edit?usp=sharing

HTSeq RNA-Seq • 6.0k views
ADD COMMENT
1
Entering edit mode

Probably the strand of read1, but it depends on how you made your library.

ADD REPLY
0
Entering edit mode

If you are using HTSeq count, you should also be providing a GTF/GFF file as a reference against which the reads are counted.

According to the manual:

For stranded=no, a read is considered overlapping with a feature regardless of whether it is mapped to the same or the opposite strand as the feature. For stranded=yes and single-end reads, the read has to be mapped to the same strand as the feature. For paired-end reads, the first read has to be on the same strand and the second read on the opposite strand. For stranded=reverse, these rules are reversed

ADD REPLY
0
Entering edit mode

Of course I'm using it.

ADD REPLY
1
Entering edit mode
9.8 years ago
madk00k ▴ 360

The read counting depends on the library protocol. For example, in case the applied protocol is non-strand specific or strand-specific forward, then the read pair is counted, since the first read in the pair has alignment strand which matches the gene strand. In case the protocol is strand-specific reverse the read is not counted. There is a good explanation about strand-specific RNA-seq here.

There are tools which allow to estimate the strand-specificity based on the alignment data. For example, this one.

Hope this helps.

ADD COMMENT

Login before adding your answer.

Traffic: 2966 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6