Entering edit mode
8 hours ago
Ananaya
•
0
I'm Struggling to automate bulk downloads of ~36k+ metagenome assemblies from JGI's IMG database. The web interface works but isn't feasible at scale. What I've Tried:
- jgi-query (Python)- https://lnkd.in/erXHH5SM
- get_jgi_genomes (Perl)- https://lnkd.in/e_5keBQx
- Custom scripts using JGI's XML API
Issues I'm hitting: "Portal not found" errors for many IMG IDs Authentication cookies expiring mid-download Tape storage timeouts Variable portal naming (IMG_, Ga0, Gp0*)
I know others have faced this (Biostars thread), but haven't found a reliable solution for 2025. Has anyone: Successfully automated bulk IMG downloads recently? Found updated tools that work with current JGI API?
Any guidance appreciated!
Please post direct links to software you mention. Not sure why you are using LinkedIn short URL's.
Historically downloads from JGI have been riddled with problems as noted in prior posts.
Looks like there is a new data portal now online at JGI : https://data.jgi.doe.gov/ Try your luck there.