Adding CDS features to Gff3 file
3
0
Entering edit mode
6.8 years ago
abhijit.synl ▴ 60

Hello,

I would like to add the CDS feature to my Gff3 file, that has exons in it. I saw that genometools can do this for you

gt cds -startcodon -finalstopcodon -seqfile foo.fasta -o foo.gff3 [GFF3-FILE]

I am looking for the CDS to be the longest ORF with an ATG start codon per mRNA. Will the above command give me that? Secondly I did not see a flag for specifying the genetic code. Is this by default "standard vertebrate"?

Thanks for the help
Abhijit

Genome-Annotation • 4.5k views
ADD COMMENT
0
Entering edit mode

Though not directly but can be acheived using a combination of Genometools and PASA

ADD REPLY
0
Entering edit mode

This is resolved. please close thread

ADD REPLY
0
Entering edit mode

hi,i had the same problem recently, can you tell me the solution detail

ADD REPLY
0
Entering edit mode

thanks for your prompt reply, these help me a lot. but these still had a problem, after i processed the data accroding your method. the problem is : "gt cds: error file is not sorted." i think that may cause by absent some scaffold in my gff3 file, gt cds need all scaffold according to the order. gt cds will thought the file is not sorted, if some scaffold miss, even gff3 file had been sorted before. what's your opinion? thanks again!

ADD REPLY
0
Entering edit mode

Please attach a sample of your Gff3 file. Maybe the first 250 lines.

ADD REPLY
0
Entering edit mode

i'm sorry i don't known how attach a file in this site, can you give me your email?

ADD REPLY
0
Entering edit mode

See this post on how to use GitHub Gist to share plain text files: A: How to Use Biostars Part-3: Formatting Text and Using GitHub Gists

ADD REPLY
0
Entering edit mode

abhijit.synl & pigeon0411 :

Please do not add answers unless you're answering the top level questions. You should have been using Add Comment and Add Reply appropriately.

I'm moving the extraneous "answers" to comments, but please be more careful in the future.

ADD REPLY
0
Entering edit mode
5.2 years ago
abhijit.synl ▴ 60

You will need to install the following and make sure they are in your PATH variable

1) Gffread utility 2) Genometools 3) GAAS Toolkit

a) Tidy your GFF3

gffread -C -F -T --no-pseudo -o FOO1".gtf" FOO.gff3
gt gtf_to_gff3 -tidy -force -o FOO".gff3" FOO1".gtf" (gt = GenomeTools executable)
gt gff3 -sort -tidy -retainids -addintrons -setsource "Genebank" -force -o FOO"_modi.gff3" FOO".gff3"

b) Keep Longest Isoform

gff3_sp_keep_longest_isoform.pl --gff FOO"_modi.gff3" --output FOO"_modi_longest.gff3" gff3_sp_keep_longest_isoform.pl = From GAAS Toolkit)

Hope this helps. This was a simpler solution than using PASA as mentioned in my previous post Abhijit

ADD COMMENT
0
Entering edit mode
5.2 years ago
pigeon0411 ▴ 10

hi,i find GAG-Genome Annotation Generator can add startcodon&stopcodon to gff file, thanks for everyone!

ADD COMMENT
0
Entering edit mode
5.2 years ago
Juke34 8.8k

An easy way to do so would be to use TransDecoder

ADD COMMENT

Login before adding your answer.

Traffic: 2106 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6