Help fixing issue with genbank submission (FeatureBeginsOrEndsInGap)
0
0
Entering edit mode
5.2 years ago

I've submitted an annotated assembly to GenBank and after their review they found some problems I need to address. These annotations are on a draft assembly, contigs organized into scaffolds. The annotation was done using MAKER on the scaffold fasta file. The problem GenBank has pointed out is that some of my CDS fall very close to 'gaps'. Ie. my CDS are with 3bp of either the end of a scaffold, or the end of a contig within a scaffold. The contigs are defined by segments of the scaffold split by stretches of 10+ Ns.

GenBank flagged 575 of my CDS with this problem, so its too many to handle with manual fixes. I would like to have a programmatic fix but the code to handle this would be challenging for me, so before I start on a script I thought I would ask to see if anyone else may already have code for this. The annotations I need to correct are in GFF3 format, so I changes to the CDS may also require changes to parent features (exon, mRNA & gene).

Thanks, John

Annotation Submission • 840 views
ADD COMMENT
0
Entering edit mode

GAG (https://github.com/genomeannotation/GAG) might be able to help- no guarantees. I have used it with MAKER outputted GFF3 files, but you need to do a little scripting to get the functional annotation (i.e., what gene name your predicted genes might below to) to work properly.

ADD REPLY

Login before adding your answer.

Traffic: 3443 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6