Hi,
I have a large list of NCBI accession codes of complete genome sequences of a species, and need to download a few gene from each one of these sequences (same genes for each sequences, whit slightly different position in different sequences). Being the list large, I cannot manually check for the genes positions in the annotations of each sequence and download the regions of interest, and I was wondering if there is a way of using the NCBI tools to do this in a more automatized fashion. I tried playing with efetch and eUtils, but with no success so far.
Does anybody have any idea how to do this?
I know that the "download all the sequences and align them to find the genes" should work, but few of these sequences have Ns stretches that makes the alignment problematic.
Thanks in advance for the help.
Best