I have a list of protein names and I wanted to get the fasta sequence of all the proteins in the list. In batch entrez it requires GI or accession number, but I have only protein names! Is there is any other way to download sequences in batch!? Please help me
You should be able to provide an search term to esearch, but searching by gene name can be tricky, genes with similar names will also get downloaded. You should check the downloaded data before using it, also remember that you will likely capture all the various isoforms of the protein. You should manually validate the data you've downloaded before proceeding.