I am trying to acquire the genomic fasta sequence for a large list of orthologous gene ids from NCBI. I figured I could convert them to some other identifier, using biopython or mygene, that I could feed in to batch entrez to acquire the fasta sequence of the whole gene sequence; however the accession that matches to the gene fasta is the accession for the whole chromosome/scaffold So I have a few questions, what is the syntax that batch entrez accepts subregion arguements? is there a better identifier that maps to the genomic sequence? is there an easier way to accomplish my task?
It's okay, I found out that Geneious can do what I want in 5 seconds and I wasted a bunch of hours for nothing, although I am curious how Geneious does it, they probably just know how to parse the xml files properly and query further as needed which is what I was trying to do at one point.