Question: Help fixing issue with genbank submission (FeatureBeginsOrEndsInGap)
0
gravatar for jmartin77777
11 weeks ago by
jmartin777770 wrote:

I've submitted an annotated assembly to GenBank and after their review they found some problems I need to address. These annotations are on a draft assembly, contigs organized into scaffolds. The annotation was done using MAKER on the scaffold fasta file. The problem GenBank has pointed out is that some of my CDS fall very close to 'gaps'. Ie. my CDS are with 3bp of either the end of a scaffold, or the end of a contig within a scaffold. The contigs are defined by segments of the scaffold split by stretches of 10+ Ns.

GenBank flagged 575 of my CDS with this problem, so its too many to handle with manual fixes. I would like to have a programmatic fix but the code to handle this would be challenging for me, so before I start on a script I thought I would ask to see if anyone else may already have code for this. The annotations I need to correct are in GFF3 format, so I changes to the CDS may also require changes to parent features (exon, mRNA & gene).

Thanks, John

submission annotation • 124 views
ADD COMMENTlink modified 6 weeks ago by Biostar ♦♦ 20 • written 11 weeks ago by jmartin777770

GAG (https://github.com/genomeannotation/GAG) might be able to help- no guarantees. I have used it with MAKER outputted GFF3 files, but you need to do a little scripting to get the functional annotation (i.e., what gene name your predicted genes might below to) to work properly.

ADD REPLYlink modified 6 weeks ago • written 6 weeks ago by jean.elbers1.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1976 users visited in the last hour