Question: How to run HISAT2 / Stringtie / Ballgown analysis starting with gff file
gravatar for al-ash
3.0 years ago by
al-ash110 wrote:

Hi! I'm trying to extract exon and splice site information from the gff file (which I downloaded from NCBI: using the python scripts provided in HISAT2 package and for downstream analysis starting with HISAT2.

The scripts work fine for the example gtf file which I downloaded as supplementary file from the Nature Protocols article ( ) but when I use these on the gff file the scripts run without returning any error but the output files are empty.

I guess it is because of the gtf fromat (although I see that the information which should be extracted from the gtf file is present in the same columns also in my gff file so I thought the scripts might work as well on the gff file). I simply tried to rename the *.gff to *. gtf, but the python scripts outputed again empty files.

I'll be thankfull for any suggestions on how to extract the exon and splice site information from the gff file!

hisat2 stringtie gff gtf • 4.1k views
ADD COMMENTlink modified 3.0 years ago • written 3.0 years ago by al-ash110
gravatar for Jeffin Rockey
3.0 years ago by
Jeffin Rockey1.1k
Jeffin Rockey1.1k wrote:

Please do not simply rename a gff file to gtf or vice versa. There are quite serious differences especially in 9th column.

If you find that a tool works with gtf and not gff3 , an approach to resolution is to convert the gff3 to gtf and then try the tool again.

Please see biostars dicussions like this to get a suitable way to convert the gff to gtf.

Another method to convert is from ucsc

Step1: gff3ToGenePred yourfile.gff yourgenemodel.genePred

Step2: genePredToGtf file yourgenemodel.genePred yourgenemodel.gtf

Also , there is a program in genometools

I hope if the conversion happens successfully for your file, the extraction program should also work as expected with the converted gtf.

ADD COMMENTlink written 3.0 years ago by Jeffin Rockey1.1k

Thanks! I used gffread to convert my gff to gtf and the new gtf was processed by the scripts without any apparent problem and the output files containing extracted exon and splice site information look reasonable.

ADD REPLYlink written 3.0 years ago by al-ash110

Good to know that it worked for you.


ADD REPLYlink written 3.0 years ago by Jeffin Rockey1.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1192 users visited in the last hour