I am using Entrez.efetch in BioPython to fetch sequences from genomic coordinates (chr, start, stop, strand):
handle = Entrez.efetch(db="nucleotide",
record = SeqIO.read(handle,"fasta")
The package requires the GI of the chromosomes, but I don't know how to find these. Tried to manually search in the nucleotide database in ncbi, but this gives a lot of different GIs for each chromosome.
I am looking for the most recent mus musculus genomic sequences.
Any suggestions how to get these GIs or another way to circumvent this problem?