Need help : How can I get the Emboss sixpack output in GFF file
0
0
Entering edit mode
4.5 years ago
shiv ▴ 10

Hi,

I have a genome sequence in Fasta format and I want to perform the Emboss Sixpack tool to get all ORFs (open reading frames) by command-line option. I need the output in GFF/GFF3 format, but every time I am getting the output only in Fasta format. Can anyone help me to sixpack's output in a feature table(GFF/GFF3)?

Thanks in advance

genome • 1.2k views
ADD COMMENT
1
Entering edit mode

I think if you specify -osformat gff it will output in the format you request. See the manual:

http://bio.biomedicine.gu.se/cgi-bin/emboss/help/sixpack under Output file format

ADD REPLY
1
Entering edit mode

Hi,

Thank you for the quick response. I was specifying -offormat gff and getting the output in Fasta format only. Your suggestion worked for me. :)

ADD REPLY
0
Entering edit mode

hi, How can I get the strand (+/-) information where ORFs originated by using EMboss sixpack? I want to get full GFF or GTF like feature tables for each ORF. Just to clarify, My aim to predict all ORFs in that genome Fasta and then remove the ORFs sequences which are overlapped with the annotated region on the same strand (I have an annotated feature file in GFF format) and keep only those ORFs which are not overlapping with the annotated region(co-ordinates). It would be great if anyone can help me with this.

ADD REPLY
0
Entering edit mode

The gff should contain this information, the start position will be smaller than the end position. eg:

V00294_3 [465 - 49] (REVERSE SENSE) E. coli laci gene (codes for the lac repressor).

It's all in the manual.

ADD REPLY
0
Entering edit mode

This type of output can get from the Emboss getORF package.

ADD REPLY

Login before adding your answer.

Traffic: 2668 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6