Hi all,
I am trying to perform an RNAseq analysis. I have created an index using HISAT2, aligned my paired-end reads to the index, and am now trying to count reads using HTseq.
Unfortunately, when counting, HTseq yields the following result.
__no_feature 6280013
__ambiguous 0
__too_low_aQual 4434361
__not_aligned 4758201
__alignment_not_unique 3366408
What I have done to troubleshoot this.
- Create new index using different file (transcriptome instead of genome)
- Use .gff or .gtf file for counting.
- Sort .sam file prior to counting.
- Ensure
--stranded=no
and--nonunique=all
These changes have not yielded correctly counting output.
At this point I am not sure what to do. I imagine there is something wrong with the way I set my index up and potentially not matching the .gtf file, however, I have downloaded these from the same source so I am not sure.
Thanks
Have you examined the alignments with IGV or a similar genome browser? Are the reads aligning in correct place (e.g. under exons). Also try using
featureCounts
to see if the result is any different.Yes, I have looked at them in a genome browser and they look ok, so I am confused why HTSeq is unable to count them. I tried using featureCounts per your suggestion but this has resulted in 0 reads being counted.