Mapping Drug/Chemical Names To Cas Number
3
2
Entering edit mode
11.2 years ago
Nims ▴ 90

I would like to know it there is any way to map a list of drug/chemical names to their corresponding CAS RN, if only names of drug/chemicals are known.

drug target • 7.2k views
ADD COMMENT
1
Entering edit mode

Singh, what numbers are you talking about and what do you want to do with the CAS RNs when youve got them ?

ADD REPLY
1
Entering edit mode
11.2 years ago
cdsouthan ★ 1.9k

PubChem have these in the CID synonym lists. They are somewhat noisy because the name and RN mappings done by different secondary sources (in the SIDs) are not necessarily concordant with SciFinder direct (salt-parent cross-overs are common in fact) But why not use InChIKey ? these are openly provenanced.

You can get some background here http://www.slideshare.net/cdsouthan/southan-bio-it2012drugnames

ADD COMMENT
0
Entering edit mode
ADD REPLY
0
Entering edit mode

So to be more precise - I have only drug/chemical names and nothing else. And I want to fins their corresponding CAS number.

ADD REPLY
0
Entering edit mode

You can use MGs list, exept you might choose "filtered". If you only have names (from where?) your mapping results will be noisy but there is actually no way to resolve these cleanly.

ADD REPLY
1
Entering edit mode
11.2 years ago
SimonCB765 ▴ 150

If your drug is in the Therapeutic Target Database, then you can use their file of cross-references to get the CAS number (along with other identifiers).

ADD COMMENT
0
Entering edit mode

Thanks a ton.I just missed the file you pointed out. Is there also something that could co-relate TTD targets to corresponding NCBI gene (I could only see UniProt IDs)?

ADD REPLY
0
Entering edit mode

Unfortunately, I never found anything that did that. I ended up using UniProt accessions as my 'true' ID, and mapped TTD target IDs and NCBI gene IDs to UniProt accessions (along with all other IDs I needed). Not perfect, but going between UniProt and NCBI gene IDs was relatively simple.

ADD REPLY
0
Entering edit mode

So if I am not wrong - you are getting NCBI gene ID indirectly from UniProt, probably mapping first to a UniProt record and then NCBI gene ID.

ADD REPLY
0
Entering edit mode

The question is which mapping you trust more/has greater coverage. Do you trust UniProt to record the NCBI IDs more accurately and completely, or NCBI to record the UniProt accession more accurately and completely. The way I did it was to go from all my sources to UniProt. In the case of NCBI gene IDs I believe I used the gene2accession file (ftp://ftp.ncbi.nih.gov/gene/DATA/) to map from Gene IDs to UniProt accessions.

ADD REPLY
0
Entering edit mode
11.2 years ago

One good source is WikiPedia, which has been working with Chemical Abstracts, resulting in the http://commonchemistry.org/ project.

ADD COMMENT

Login before adding your answer.

Traffic: 1376 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6