Question: Microarray Probe IDs to ENSEMBL ID
gravatar for Scott McKay
6 months ago by
Scott McKay20
Scott McKay20 wrote:

I am currently converting a list of microarray probe ids to ENSEMBL IDs through biomaRt. I am currently in excel and I am seeing the same probe ID mapped to two different ENSEMBL IDs. Any idea what this means? I thought ENSEMBL IDs were based on the genes, not splicing variates or isoforms. Thanks!

ADD COMMENTlink modified 6 months ago by Ben_Ensembl1.2k • written 6 months ago by Scott McKay20
gravatar for Ben_Ensembl
6 months ago by
Ben_Ensembl1.2k wrote:

Hi Scott McKay,

In the annotation process, Ensembl annotates transcripts and groups them together to form genes. We use stable IDs for all of our annotated transcripts and genes (and other annotated features too). We use ENSG# IDs for human genes, and ENST# IDs for human transcripts. There is further documentation about the stable IDs in Ensembl on the following pages: [1] [2]

Then, Ensembl maps microarray probes to the individual transcripts of a gene:

Best wishes,

Ben Ensembl Helpdesk

ADD COMMENTlink written 6 months ago by Ben_Ensembl1.2k

Hi Ben,

biomaRt gave me each probe ID mapped to multiple ENSG# IDs. As in 1 probe had been assigned to multiple genes. I would understand if one gene ID had multiple array IDs, but that does not seem to be the case. Any insight on this?

ADD REPLYlink written 6 months ago by Scott McKay20


Could you share you biomaRt query?

ADD REPLYlink written 6 months ago by Ben_Ensembl1.2k


Sorry for the late reply. The query was as follows: library(biomaRt) ensembl = useMart("ensembl",dataset="hsapiens_gene_ensembl") probeids=read.table(file.choose()) ##single column txt file of probe ids getBM(attributes=c('affy_hugene_1_0_st_v1', 'ensembl_gene_id'), filters = 'affy_hugene_1_0_st_v1', values = probeids, mart = ensembl)

it returns me a list of probe ids and ensembl gene ids, but probe ids will be listed twice with 2 DIFFERENT gene ids

ADD REPLYlink written 5 months ago by Scott McKay20


No problem. Individual probes are approx. 25 bp in length and are grouped together to form probesets. The individual probes can be used in different probesets to assess the expression of more than one gene, since the 25bp sequence can map to more than one region in the genome.

Best wishes


ADD REPLYlink written 5 months ago by Ben_Ensembl1.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 696 users visited in the last hour