Open Chemistry Data In Rdf
2
4
Entering edit mode
13.3 years ago
Nico Adams ▴ 460

Dear All,

I am currently looking to put together a comprehensive list of chemical data serialised in RDF. To be included in the list, the data has to be (a) open, (b) relating to chemical entities and (c ) available in an RDF serialisation. Without wanting to be too prescriptive, when I say "chemical" I mean small molecular entities or synthetic polymers - I am less interested in proteins or DNA etc.. My question is, which data providers the community knows that fit the criteria. So far, I have

Which other data sources can you guys think of? I’d be grateful for your answers.

For your information, I have also asked the same question over on the BlueObelisk site.

chemoinformatics • 4.2k views
ADD COMMENT
1
Entering edit mode

Hi Egon, why not - should we maybe convert this into a community wiki thing and try and keep it up-to-date?

ADD REPLY
0
Entering edit mode

The Open in bioinformatics does not really seem to stretch all the way down to small molecules :(

ADD REPLY
0
Entering edit mode

Nico, do you like to extend this to half-Open data, like that the GNU or CC licenses?

ADD REPLY
1
Entering edit mode
13.3 years ago

OpenTox has an increasing amount of chemistry data. I have also RDF solubility data from the Open Notebook Science project, and a RDF version of the ChemPedia substances (on Science 3.0). And there is also Chem2Bio2RDF.

ADD COMMENT
1
Entering edit mode
13.2 years ago

What you are looking for is also one of the core goals of the OpenPhacts project that is part of Innovative Medicine Initiative (IMI) that just started (no website yet, but I could bring you in contact with the developers). It is also important for the large knowledge collider project ([?]LarKC[?]) that is in fact one of the partners in OpenPhacts. One of the other partners is [?]RSC[?], the Royal Society of Chemistry, the intend of the OpenPhacts project is to also provide (part of) the [?]ChemSpider[?] content in RDF.

As far as I know there is no RDF serialisation for [?]HMDB[?], but I would be surprised if they would not be interested in providing one. So it might worth to contact David Wishart about that.

ADD COMMENT
0
Entering edit mode

Yes, I'm aware of this project, and looking forward to what the bring to the community. Any insight yet to the license or waiver?

ADD REPLY
0
Entering edit mode

I don't know for LarKC, but for OpenPhacts the answer is no. License development is thought to be part of the sustainability plan development. After all this is a multi million investment in an open reasoning and knowledge environment for (primarily) drug development. You don't want to see that go away when the project finishes. I personally expect that the answer will be different for different parts of the project:

ADD REPLY
0
Entering edit mode
  • The toolbox is developed as an OpenSource project. There might be parts that will be developed under a dual licensing model, but since a lot of that extends on things that are already available under for instance Creative Commons and Apache licenses those are likely to persist.
  • The available content (the collected knowledge in the form of concepts and nanopublications/semantic triples) will also be available, but it may very well be dual licensed. (Meaning that if you want to make money using it, you will have to pay to keep it maintained).
ADD REPLY
0
Entering edit mode
  • The questions used to reason with (so not the content but the questions you want to answer using that content) will be closed and the aim is to offer a secure environment for that. After all this is for drug development, and you don't want to see drugs not being developed because they are no longer patentable.
ADD REPLY

Login before adding your answer.

Traffic: 1439 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6