Question: How to run HISAT2 / Stringtie / Ballgown analysis starting with gff file
gravatar for al-ash
9 months ago by
European Union
al-ash0 wrote:

Hi! I'm trying to extract exon and splice site information from the gff file (which I downloaded from NCBI: using the python scripts provided in HISAT2 package and for downstream analysis starting with HISAT2.

The scripts work fine for the example gtf file which I downloaded as supplementary file from the Nature Protocols article ( ) but when I use these on the gff file the scripts run without returning any error but the output files are empty.

I guess it is because of the gtf fromat (although I see that the information which should be extracted from the gtf file is present in the same columns also in my gff file so I thought the scripts might work as well on the gff file). I simply tried to rename the *.gff to *. gtf, but the python scripts outputed again empty files.

I'll be thankfull for any suggestions on how to extract the exon and splice site information from the gff file!

hisat2 stringtie gff gtf • 1.1k views
ADD COMMENTlink modified 9 months ago • written 9 months ago by al-ash0
gravatar for Jeffin Rockey
9 months ago by
Jeffin Rockey390
Jeffin Rockey390 wrote:

Please do not simply rename a gff file to gtf or vice versa. There are quite serious differences especially in 9th column.

If you find that a tool works with gtf and not gff3 , an approach to resolution is to convert the gff3 to gtf and then try the tool again.

Please see biostars dicussions like this to get a suitable way to convert the gff to gtf.

Another method to convert is from ucsc

Step1: gff3ToGenePred yourfile.gff yourgenemodel.genePred

Step2: genePredToGtf file yourgenemodel.genePred yourgenemodel.gtf

Also , there is a program in genometools

I hope if the conversion happens successfully for your file, the extraction program should also work as expected with the converted gtf.

ADD COMMENTlink written 9 months ago by Jeffin Rockey390

Thanks! I used gffread to convert my gff to gtf and the new gtf was processed by the scripts without any apparent problem and the output files containing extracted exon and splice site information look reasonable.

ADD REPLYlink written 9 months ago by al-ash0

Good to know that it worked for you.


ADD REPLYlink written 9 months ago by Jeffin Rockey390
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 655 users visited in the last hour