Entering edit mode
7.2 years ago
Denis
▴
320
Sorry for the weird question. I have GFF and Fasta files. I'm looking for the way to check if they relate to the same project version, i.e. in the GFF file i have correct annotation for the sequences in Fasta file. Is it realistic task?
I think it is easy to spot if a fasta / gff pair are not compatible (different chromosome names, features outside chromosome ranges). However, there is no guarantee the gff and fasta they are from the same project if you don't find incompatibilities.
One additional note: You'll make your and your colleagues life easier in the future if you include the sequence part in the gff directly. Any gff can contain an arbitrary sized sequence section after a
##FASTAdirective See other syntax section in the gff specification