Gene And Transcript Id Conversion
1
1
Entering edit mode
11.8 years ago
Davy ▴ 410

Hi All,

So I'm trying to basically get a list of all Entrezgenes and their corresponding gene names, and transcript ids. I've tried biomart ensembl and UCSC genome browser but had no luck with either. Ensembl gives me lots of missing rows of data, and ucsc doesn't appear to offer any entrez ids at all. Anyone know where I could download, or query a table of all entrezgenes?

• 4.6k views
ADD COMMENT
3
Entering edit mode
11.8 years ago

All the information you're looking is available under: ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/

ADD COMMENT
0
Entering edit mode

Excuse my ignorance, but any suggestions for the best way to deal with these files? I was looking into parsing them with python, but that seems very DIY for something that should be relatively routine.

ADD REPLY
1
Entering edit mode

I would use the ASN1 files, convert it to XML using asn2xml ( http://www.ncbi.nlm.nih.gov/IEB/ToolBox/XML/ncbixml.txt ) and parse it with a (python-based) sax parser

ADD REPLY

Login before adding your answer.

Traffic: 2560 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6