Question: HTSeq-count in RNA-Seq
0
gravatar for mlemusfuentes
14 months ago by
mlemusfuentes10 wrote:

strong texthello everybody,

I'm doing for the first time an analysis of differential gene expression. already perform the alignment with tophat, then use samtools sort for the reads and now I want to count the readings with the HTSeq-count.

htseq-count -f bam -r name filesorted.bam hg19_genes.gtf > directoryoutput/file.HTSeq-count.txt

I get a message that says the following:

**0 GFF lines processed**

**Warning: no features of type 'exon' found**

and the output file that generates have not name of the genes.

I would appreciate any help or guidance.

rna-seq htseq-count • 1.1k views
ADD COMMENTlink modified 14 months ago • written 14 months ago by mlemusfuentes10
1

already perform the alignment with tophat

You should know that the old 'Tuxedo' pipeline of Tophat(2) and Cufflinks is no longer the "advisable" tool for RNA-seq analysis. The software is deprecated/ in low maintenance and should be replaced by HISAT2, StringTie and ballgown. See this paper: Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. There are also other alternatives, including alignment with STAR and bbmap, or pseudo-alignment using salmon followed by DESEq2 or edgeR.

ADD REPLYlink written 14 months ago by WouterDeCoster41k
1

I like you are not getting tired of posting this. Maybe a bot posting this to everything new post containing 'tophat' would be helpful :-D

ADD REPLYlink written 14 months ago by ATpoint23k
2

That is @WouterTopHatBot :-)

ADD REPLYlink written 14 months ago by genomax71k

Yeah I've thought of that

ADD REPLYlink written 14 months ago by WouterDeCoster41k

htseq-count -h

-t FEATURETYPE, --type=FEATURETYPE
                    feature type (3rd column in GFF file) to be used, all
                    features of other type are ignored (default, suitable
                    for Ensembl GTF files: exon)

The warning message suggests your hg19_genes.gtf doesn't contain exon records.

ADD REPLYlink modified 14 months ago • written 14 months ago by Eric Lim1.4k

hi Eric Lim, thanks for you answer, but I'm confused, how to use -t?

htseq-count -f bam -r name -t?...

Watch for your comments

thanks.

ADD REPLYlink written 14 months ago by mlemusfuentes10

Hi, you should add a reply or comment instead of an answer.

I don't know where you're getting the gtf from and the contents of the file. I'd recommend using the same gtf/gff you used for tophat. You can also download gtf for hg19 at https://www.gencodegenes.org/releases/28lift37.html .

ADD REPLYlink written 14 months ago by Eric Lim1.4k

mlemusfuentes : Show us 10 lines from your GTF file by given us output of head -10 your.gtf.

ADD REPLYlink written 14 months ago by genomax71k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1175 users visited in the last hour