Question: question on genes with ensembl gene ID, but without associated gene name and corresponding Entrez ID
0
gravatar for Yizhao
5.8 years ago by
Yizhao10
China
Yizhao10 wrote:

HI,

Now, i encounter a confusion about gene info mapping between ensembl ID(database accession in Ensembl) and ENTREZ ID(gene accession in NCBI GENE). I've found that not all genes with ensembl ID possess corresponding ENTREZ IDs, but still annotate to specific GO terms. Could anybody  help me make it clear? I want to know why, why did the  researchers tolerate this kind of gene annotation? 

Any answer or hint will be appreciated. Thx in advance.

 

ensembl gene feature • 4.9k views
ADD COMMENTlink modified 5.7 years ago • written 5.8 years ago by Yizhao10
2
gravatar for Devon Ryan
5.8 years ago by
Devon Ryan96k
Freiburg, Germany
Devon Ryan96k wrote:

The various organizations use different criterion and algorithms to call genes, so there's no reason to expect any two of them to agree on every gene.

ADD COMMENTlink written 5.8 years ago by Devon Ryan96k

Due to your reply, there's no exactly answer to leo.ng's question? Sometime i also have the same problem like this. I've found that not all genes with ensembl ID possess corresponding ENTREZ IDs, but still annotate to specific GO terms. Can someone help me to make it clear?

ADD REPLYlink written 5.8 years ago by delta.realestate.vn0

Sure, there are a number of ways of associating a given gene with a GO term. Among these are finding a similar gene with known function, in which case you can borrow it's GO terms (other possibilities include using gene covariation modules to functionally group genes, though I don't know how often this is used in practice).

ADD REPLYlink written 5.8 years ago by Devon Ryan96k
1
gravatar for Magali_Ensembl
5.8 years ago by
United Kingdom
Magali_Ensembl130 wrote:

As has already pointed out, the different gene sets from the different resources do not fully overlap.

To overcome this, we (Ensembl) try to map our gene models to as many external sources as possible.

This way, a gene might not have a corresponding match in RefSeq (EntrezGene) but it can map a Uniprot entry. This allows us to assign GO terms from different sources.

 

Hope that helps,

Magali

ADD COMMENTlink written 5.8 years ago by Magali_Ensembl130

OK, but it would help if the HGNC, UniProt, Entrez Gene and Ensembl teams got together to sort out which "genes" where and why they don't overlap, at least for human proteins.

ADD REPLYlink modified 5.8 years ago • written 5.8 years ago by cdsouthan1.8k

In a way, isn't this the point of gencode? It's a collaboration between Ensembl, UCSC, etc. etc. to annotate genomes. That's about the most definitive source you'll get. I'll also add the for mouse and human, the new addition of TSLs to Ensembl is quite nice in this regard.

ADD REPLYlink written 5.8 years ago by Devon Ryan96k
0
gravatar for cdsouthan
5.8 years ago by
cdsouthan1.8k
cdsouthan1.8k wrote:

This question comes up all the time. While Ryan is right, what you can do practically  is generate consensus sets.  However the numbers depend on which portal, what starting points and the sequence in which you make the intersects.

This simple UniProt query

http://www.uniprot.org/uniprot/?query=database%3A%28type%3Aensembl%29+AND+reviewed%3Ayes+AND+organism%3A%22Homo+sapiens+%28Human%29+[9606]%22+AND+database%3A%28type%3Ageneid%29&sort=score

tells you that 18,324 protein "genes"  (in the cannonical SwissProt sense) agree with both EGIDs and Ensembl IDs

Coming from the EGID side

http://www.ncbi.nlm.nih.gov/gene/?term=%22Homo+sapiens%22[porgn]+AND+%22matches+ensembl%22[Properties]

gives 21569 but these are not all proteins

The Ensembl coding side gives 20,364 (incl. 509 readthrough)

Note if the Ensembl/Havanna gene build includes an ORF that EGID does not  (I think) it may sometimes  get a GO term via IntePro-to-GO

ADD COMMENTlink modified 5.8 years ago • written 5.8 years ago by cdsouthan1.8k
0
gravatar for Yizhao
5.7 years ago by
Yizhao10
China
Yizhao10 wrote:

Hi, all,

thx for all your enthusiastic reply.

Sorry for the late follow-up because of my trip.

Anyway, Thanks .

ADD COMMENTlink written 5.7 years ago by Yizhao10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1659 users visited in the last hour