I am currently converting a list of microarray probe ids to ENSEMBL IDs through biomaRt. I am currently in excel and I am seeing the same probe ID mapped to two different ENSEMBL IDs. Any idea what this means? I thought ENSEMBL IDs were based on the genes, not splicing variates or isoforms. Thanks!
Hi Ben,
biomaRt gave me each probe ID mapped to multiple ENSG# IDs. As in 1 probe had been assigned to multiple genes. I would understand if one gene ID had multiple array IDs, but that does not seem to be the case. Any insight on this?
Hi,
Could you share you biomaRt query?
Hi,
Sorry for the late reply. The query was as follows:
library(biomaRt) ensembl = useMart("ensembl",dataset="hsapiens_gene_ensembl") probeids=read.table(file.choose()) ##single column txt file of probe ids getBM(attributes=c('affy_hugene_1_0_st_v1', 'ensembl_gene_id'), filters = 'affy_hugene_1_0_st_v1', values = probeids, mart = ensembl)
it returns me a list of probe ids and ensembl gene ids, but probe ids will be listed twice with 2 DIFFERENT gene ids
Hi,
No problem. Individual probes are approx. 25 bp in length and are grouped together to form probesets. The individual probes can be used in different probesets to assess the expression of more than one gene, since the 25bp sequence can map to more than one region in the genome.
Best wishes
Ben