Question: Microarray Probe IDs to ENSEMBL ID
0
gravatar for Scott McKay
10 weeks ago by
Scott McKay20
Scott McKay20 wrote:

I am currently converting a list of microarray probe ids to ENSEMBL IDs through biomaRt. I am currently in excel and I am seeing the same probe ID mapped to two different ENSEMBL IDs. Any idea what this means? I thought ENSEMBL IDs were based on the genes, not splicing variates or isoforms. Thanks!

ADD COMMENTlink modified 10 weeks ago by Ben_Ensembl1.0k • written 10 weeks ago by Scott McKay20
0
gravatar for Ben_Ensembl
10 weeks ago by
Ben_Ensembl1.0k
EMBL-EBI
Ben_Ensembl1.0k wrote:

Hi Scott McKay,

In the annotation process, Ensembl annotates transcripts and groups them together to form genes. We use stable IDs for all of our annotated transcripts and genes (and other annotated features too). We use ENSG# IDs for human genes, and ENST# IDs for human transcripts. There is further documentation about the stable IDs in Ensembl on the following pages: [1] http://www.ensembl.org/info/genome/stable_ids/index.html [2] http://www.ensembl.org/info/genome/stable_ids/prefixes.html

Then, Ensembl maps microarray probes to the individual transcripts of a gene: http://www.ensembl.org/info/genome/microarray_probe_set_mapping.html

Best wishes,

Ben Ensembl Helpdesk

ADD COMMENTlink written 10 weeks ago by Ben_Ensembl1.0k

Hi Ben,

biomaRt gave me each probe ID mapped to multiple ENSG# IDs. As in 1 probe had been assigned to multiple genes. I would understand if one gene ID had multiple array IDs, but that does not seem to be the case. Any insight on this?

ADD REPLYlink written 9 weeks ago by Scott McKay20

Hi,

Could you share you biomaRt query?

ADD REPLYlink written 9 weeks ago by Ben_Ensembl1.0k

Hi,

Sorry for the late reply. The query was as follows: library(biomaRt) ensembl = useMart("ensembl",dataset="hsapiens_gene_ensembl") probeids=read.table(file.choose()) ##single column txt file of probe ids getBM(attributes=c('affy_hugene_1_0_st_v1', 'ensembl_gene_id'), filters = 'affy_hugene_1_0_st_v1', values = probeids, mart = ensembl)

it returns me a list of probe ids and ensembl gene ids, but probe ids will be listed twice with 2 DIFFERENT gene ids

ADD REPLYlink written 9 weeks ago by Scott McKay20

Hi,

No problem. Individual probes are approx. 25 bp in length and are grouped together to form probesets. The individual probes can be used in different probesets to assess the expression of more than one gene, since the 25bp sequence can map to more than one region in the genome.

Best wishes

Ben

ADD REPLYlink written 9 weeks ago by Ben_Ensembl1.0k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 759 users visited in the last hour