Question: Text-Mining Clinicaltrials.Gov For Drug-Gene Interactions
gravatar for Obi Griffith
7.4 years ago by
Obi Griffith18k
Washington University, St Louis, USA
Obi Griffith18k wrote:

I have little direct experience with text-mining tools. Can anyone suggest a good tool or approach for text-mining drug-gene relationships from clinical trials available at They provide xml files for each clinical trial record but unfortunately gene information is not a standard field (but often mentioned in free-form descriptive fields). I would have a list of genes and a list of drugs and want to know when they co-occur in a clinical trials record. However, it would be nice to get more than just simple co-occurrence. Is anyone aware of a tool that could rank co-occurrences in some reasonable way based on term incidence, proximity, natural language processing concepts, etc. Here is an example record to give some context.

gene drug • 2.4k views
ADD COMMENTlink written 7.4 years ago by Obi Griffith18k
gravatar for Arun
7.4 years ago by
Arun2.3k wrote:

There was a recent blog post from here: It mentions some of the best resources available for text mining. Does this help at all, at least to get you started?

ADD COMMENTlink written 7.4 years ago by Arun2.3k
gravatar for Mary
7.4 years ago by
Boston MA area
Mary11k wrote:

What you are trying reminds me of the XplorMed tool.

It used to have more features, but it might still work for you with your input data. It used to be able to start with a keyword, ID, or PubMed query and look for co-occurrence of terms, with ranking. Currently it asks for abstracts but you might be able to fake it out with the Clinical Trials xml records instead. At least it's worth a try.

It would probably help to read their publications about how they did it even if it doesn't work, and you might be able to get the software and tweak it yourself if the abstract trick doesn't work.

ADD COMMENTlink written 7.4 years ago by Mary11k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1969 users visited in the last hour