Question: Download Pubchem Chemical Compunds fingerprint
0
gravatar for emanismail.92
6 months ago by
emanismail.920 wrote:

I want to download fingerprint of pubchem compound with CID I used the Pubchem api: https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/42628049/property/Fingerprint2D/xml

but I have a huge list that I need its corresponding fingerprint Is there any way to download as bulk or any way to speed up retrieving data ?

fingerprint pubchem • 423 views
ADD COMMENTlink modified 6 months ago by h.mon23k • written 6 months ago by emanismail.920

Hello emanismail.92!

It appears that your post has been cross-posted to another site: https://bioinformatics.stackexchange.com/questions/4788

This is typically not recommended as it runs the risk of annoying people in both communities.

ADD REPLYlink written 6 months ago by h.mon23k
0
gravatar for h.mon
6 months ago by
h.mon23k
Brazil
h.mon23k wrote:

Not elegant, but works:

cat CIDs.txt | \
while read CID; do
    curl -L https://pubchem.ncbi.nlm.nih.gov/rest/pug/compound/cid/${CID}/property/Fingerprint2D/xml -o ${CID}.xml
done

Where CIDs.txt is a file with the CIDs of interest, one per line.

edit: pay attention to NCBI limits:

Request Volume Limitations

All PubChem web pages (or requests to NCBI in general) have a policy that users should throttle their web page requests, which includes web-based programmatic services. Violation of usage policies may result in the user being temporarily blocked from accessing PubChem (or NCBI) resources. The current request volume limits are:

No more than 5 requests per second.

No more than 400 requests per minute.

No longer than 300 second running time per minute.

ADD COMMENTlink modified 6 months ago • written 6 months ago by h.mon23k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1615 users visited in the last hour