How many of the reads end up being counted by HTSeq count
1
0
Entering edit mode
4.1 years ago
Apex92 ▴ 290

I am new to these types of quality control checks. I have not faced any error in my analyses but want to perform quality control. I have aligned my sncRNA reads with bowtie (-m 1 option) and after making the bam output from the alignment I performed HTSeq count. Now I want to see if most of the reads actually end up being represented in the HTSeq file as gene counts. How can I do that?

Should I just simply sum up the total of the counts reported by HTSeq and divide it by the total number of unique reads?

Any help will be appreciated, thanks

rna-seq Assembly sequence • 1.1k views
ADD COMMENT
1
Entering edit mode
4.1 years ago
Ido Tamir 5.2k

https://htseq.readthedocs.io/en/master/count.html#usage at the bottom of the file you have some lines with some special counters you have to exclude for calculating the ratio. All of it has to sum up to the number of input reads. http://bioinf.wehi.edu.au/featureCounts/ is faster and gives you the same output, but the special counters are in a separate file.

ADD COMMENT
0
Entering edit mode

Thank you, Tamir for the answer. So after removing that extra lines, should I just simply sum up the total of the counts reported by HTSeq and divide it by the total number of unique reads?

ADD REPLY
0
Entering edit mode

better to sum up without the extra lines, sum up with the extra lines and then divide.

ADD REPLY

Login before adding your answer.

Traffic: 2847 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6