Question: Retired Ensembl genes - should I identify their successors?
gravatar for maureen.pittman
4.3 years ago by
United States
maureen.pittman0 wrote:


I am running WGCNA analysis on public RNA-seq expression data. Some of the gene IDs are in Entrez, others in Ensembl. I want them all in Entrez. I tried using both NCBI and BioMart to convert the Ensembl IDs, but it seems that any way you cut it, around 3000 out of 20,000 genes don't have matches. After a little investigating, I discovered that these IDs are "retired" (example).

Should I hunt harder to match up these retired Ensembl IDs to their current equivalent Entrez ID? Or is it safe to assume that these are "fringe elements" whose status as gene was revoked, and it's okay to leave them out of my analysis?

Any insight is appreciated.



ADD COMMENTlink written 4.3 years ago by maureen.pittman0
gravatar for WouterDeCoster
4.3 years ago by
WouterDeCoster44k wrote:

I just checked your example, and by clicking on 'Latest Version' I get to;g=ENSG00000006074;r=17:34391640-34399392, which looks pretty real to me:;g=ENSG00000275385;r=17:36064280-36072032

So just based on this this seems worth investigating further. Some people from Ensembl are active here and will probably be able to give you a better answer.

ADD COMMENTlink written 4.3 years ago by WouterDeCoster44k

Wouter is right. There are often legitimate reasons why we would change the ID of a gene, maybe we've split it into two genes, merged two together, but the gene would still exist. There are also cases where we have retired the genes because they're dodgy. It's worth investigating them, you'll probably gain some and lose some in the process though.

ADD REPLYlink written 4.3 years ago by Emily_Ensembl21k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 603 users visited in the last hour