MESH terms for list of pubmed IDs
1
0
Entering edit mode
3.9 years ago
bioguy ▴ 50

I have a list of pubmed IDs and am trying to use literally any tool to access any 1) paper authors 2) date of publication and 3) MESH terms (or other identifying characteristics, e.g. field of study etc) for each paper. Ideally would end up with everything in csv or JSON format, but I'll take whatever.

Looking here (How do I get full text AND MeSH terms from Entrez on the PMC database in biopython?) but having a hard time reproducing the solution. Can anyone out there assist? Been playing with biopython entrez library but having limited luck. Example code to get list of 10 random IDs:

get list of IDs

def search(query):
Entrez.email = 'your.email@example.com'
handle = Entrez.esearch(db='pubmed', 
                        sort='relevance', 
                        retmax='10',
                        retmode='xml', 
                        term=query)
results = Entrez.read(handle)
return(results)

id_list = search('infectious disease')

MESH terms text mining pubmed esearch entrez • 2.7k views
ADD COMMENT
2
Entering edit mode

Figured it out: can download full data from here, https://mbr.nlm.nih.gov/Download/2019/Data/, and intersect with pubmed ID

ADD REPLY
1
Entering edit mode
3.9 years ago

If you have a lot of pubmed IDs, maybe you can consider downloading all available pubmed data from ftp://ftp.ncbi.nlm.nih.gov/pubmed/baseline. It's a really huge directory, but every pmid and their annotation (author, citation, MeSH, title, etc ...) are in.

Hope it helps !

ADD COMMENT
0
Entering edit mode

This is magnificent, thank you so much.

ADD REPLY

Login before adding your answer.

Traffic: 2521 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6