I have a list of GenBank specimen vouchers. Does anyone know some parser or script to get accession numbers and publications information about these sequences?
To make a table like:
Vouchers: VL03F635, WE02491
IDs: EF104615, AY557140, AY556740
References: Wiemers, 2003; Wiemers et al., 2010
It's not difficult to write a python script for that aim, but the problem is in GenBank records. Years and authors appear like different fields and i do not know how to process such a data.