Question: all unique compound structure (SMILES) of PubChem Database
1
gravatar for ajingnk
4.4 years ago by
ajingnk130
United States
ajingnk130 wrote:

Hi everyone,

I want to get all unique compound structures of PubChem Database. I have download SDF file for PubChem, but it is 45G after gzip. If I convert all SDF file to SMILES, that won't be easy... Is there any way to retrieve all SMILES for the whole PubChem?

 

 

Thanks,Jing

pubchem compound • 2.8k views
ADD COMMENTlink modified 4.4 years ago • written 4.4 years ago by ajingnk130
1
gravatar for ajingnk
4.4 years ago by
ajingnk130
United States
ajingnk130 wrote:

In case someone also needs it, PubChem has InCHI data for all compound.

The FTP InCHI data can be downloaded from the following FTP directory: ftp://ftp.ncbi.nlm.nih.gov/pubchem/Compound/Extras/CID-InChI-Key.gz

ADD COMMENTlink written 4.4 years ago by ajingnk130
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1140 users visited in the last hour