Entering edit mode
3.0 years ago
sviatoslav.kendall ▴ 880
I'm trying to do a bulk download from CNGB of all the protein sequences they have for a select species. I can find these sequences on their website and know that they number about 30K in total (although at least some are cross-listed on other repositories like Uniprot). But trying to find any of them in their FTP directory seems hopeless without some kind of index telling you what directory to search in.
The documentation seems pretty sparse so I'm not even sure whether this is possible. Has anyone else figured out a way to do bulk downloads from CNGB?