strange htseq-count result
1
0
Entering edit mode
4.5 years ago
lilingjoyo ▴ 40

hello all,

I'm using STAR to 2-pass alignment and htseq-count to output gene count table. But my htseq-count result table is a bit strange, like follow:

__no_feature    5802774

__ambiguous 4365152

__too_low_aQual 0

__not_aligned   0

__alignment_not_unique  0


all alignments are in no feature or ambiguous couters. Why does this happen. My code is:

htseq-count -s no Aligned.out.sorted.sam gencode.v28.annotation.gtf > samp.htseq-count.tab


Any one has got same problem?

RNA-Seq htsesq-count STAR • 1.6k views
1
Entering edit mode

I added (code) markup to your post for increased readability. You can do this by selecting the text and clicking the 101010 button. When you compose or edit a post that button is in your toolbar, see image below:

0
Entering edit mode

Does the gtf used for htseq-count contain the same chromosome identifiers as your alignment in sam format?

0
Entering edit mode

I thinks so. In my chrName.txt one of my star index files, the chromosome id is in the form of chr#. In my gencode gtf file, it's same.

0
Entering edit mode

To save your time and shorten the pipeline, you can use STAR's --quantMode option (have a look in manual). It will generate exactly the same read-counts as htseq-count (note that probably this would not solve the issue you posted here).

0
Entering edit mode
4.5 years ago
Macspider ★ 3.6k

-s no

This is (imho) the problem. Perhaps your library is stranded. I'm saying this because I see ~ half of the scores in __no_feature and another half in __ambiguous. Try -s yes!

0
Entering edit mode

In fact, I tried both. They put almost same result. All reads are in "no feature" or "ambiguous" group.