Remove/Identify pseudogenes from a list of Entrez IDs programatically
1
0
Entering edit mode
8.8 years ago
salamandra ▴ 550

If one has a list of Entrez IDs, how to identify programmatically those that are pseudogenes in order to remove them from the list and keep only those IDs corresponding to genes?

entrez pseudogenes • 2.4k views
ADD COMMENT
1
Entering edit mode
8.8 years ago

You can download the Gene Info file for your organism of interest from here ftp://ftp.ncbi.nih.gov/gene/DATA/GENE_INFO/

ftp://ftp.ncbi.nih.gov/gene/README (column definitions, go to gene_info)

Second and tenth columns represent "gene id" and "type of gene". You will have to write a small script.

ADD COMMENT
0
Entering edit mode

Thank you!

Do you know if there's a similar way of identifying deprecated Entrez IDs and replace old IDs by the corresponding new ones?

ADD REPLY

Login before adding your answer.

Traffic: 2515 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6