I have an annotation file in GFF3 format, but I do not have the amino acid and cds sequences any more. Is there a tool which can retrieve those files from a genome in FASTA format and a GFF3 file?
Thank you in advance
I have an annotation file in GFF3 format, but I do not have the amino acid and cds sequences any more. Is there a tool which can retrieve those files from a genome in FASTA format and a GFF3 file?
Thank you in advance
While using agat to convert to protein I got:
(In case your file contains only CDS features, and your organism is prokaryote (e.g rast file), using ID as comon_tag might be the solution.)
13 warning messages: Peculiar rare case, we found 8 three_prime_utr while 12 expected.
Either some are supernumerary or some have been merged they overlap or are adjacent while they are not suppose to.
In case you were using gtf file as input (no parent/id attributes), check you provide the attribute (i.e comon_tag) used to group features together (e.g. locus_tag, gene_id, etc.).
(In case your file contains only CDS features, and your organism is prokaryote (e.g rast file), using ID as comon_tag might be the solution.)
Is there a way to find out more information about it or to fix it?
Thank you in advance,
Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
This question has been extensively discussed previously at: changing ID in an existing GFF3 file