Question: genes not mapping to ENTREZID
gravatar for adR
7 weeks ago by
adR50 wrote:

Hi Dear Scientists

I want to ask a simple question but important to me. My question is, I have a list of DEG between the two conditions. While converting gene Symbole to their biological IDs(ENTREZID) few genes failed to map. Since they don't have ENTREZID, they aren't related to any KEGG pathways. Therefore, can I consider this unmapped list of genes as candidates to study their novel functions related to my conditions?

Thank you for sharing your knowledge!

entrezid kegg pathway gene • 143 views
ADD COMMENTlink modified 7 weeks ago by waeemarjuriya0 • written 7 weeks ago by adR50

How did you come up with your gene list? Do you expect the genes to be coding/novel? Try BLASTing the sequences that aren't found, or if you only care about KEGG annotations, you can enter a (protein) sequence in Pfam and look at the domain annotations which you can then relate to a KEGG pathway.

ADD REPLYlink written 7 weeks ago by joe180

Thanks, Joe for your answer! Actually, I did not design the study originally but they aim to answer both. Could please direct me on how to do BLASTing?. Where to access it!

ADD REPLYlink written 7 weeks ago by adR50

So "BLASTing" a sequence will help to identify the source (organism, engineered, etc) and comes in 2 main varieties - by DNA sequence and by protein sequence. BLAST will return a list of proteins/DNA with similarity to your sequence.

Alternatively/additionally, Pfam has a "sequence search" option which allows you to identify domains within a protein sequence, which can help you identify what type of protein you have if there are no similar proteins found by BLAST.

ADD REPLYlink written 7 weeks ago by joe180
gravatar for Kevin Blighe
7 weeks ago by
Kevin Blighe71k
Republic of Ireland
Kevin Blighe71k wrote:

They may be non-coding RNAs or just predicted genes, or something else. Can you paste a few IDs? It is expected that some genes will be recorded in one database but not another. Each database has different 'rules' about what should be included | excluded

ADD COMMENTlink written 7 weeks ago by Kevin Blighe71k

Hi Kevin, Thanks for answering my question. Here are three genes that I did not find their KEGG IDs or their ENTREZID. ADCK3, AES, AF09010

ADD REPLYlink modified 7 weeks ago • written 7 weeks ago by adR50

The problem seems to be that these are aliases of what is the official gene symbol. For example:

You can try to avail of limma's alias2Symbol(), if you want:

Otherwise, try to get the source annotation that was used, which may still have the Entrez mappings.

ADD REPLYlink modified 7 weeks ago • written 7 weeks ago by Kevin Blighe71k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1585 users visited in the last hour