biomaRt external_gene_name same entry multiple times when mapping to homolog mouse gene
0
0
Entering edit mode
3.2 years ago
nhaus ▴ 300

Hello,

I am trying to convert human gene names to mouse gene names using biomaRt. Here is the code I am using:

library("biomaRt")

ensembl = useMart("ensembl")
ensembl = useDataset("hsapiens_gene_ensembl",mart=ensembl)

all <- getBM(attributes = c("external_gene_name","mmusculus_homolog_associated_gene_name"), mart=ensembl)

The problem here is, that for some genes that I am interested in, there are two mmusculus_homolog_associated_gene_name. One contains the right homolog, the other is simply empty. One such gene is PTPRD.

When I want to perform the actual conversation with this code:

mapping <- getBM(attributes = c("external_gene_name","mmusculus_homolog_associated_gene_name"),
             filters = "external_gene_name",
             values = GENES-OF-INTEREST,
             mart=ensembl)

The entry of PTPRD is picked, where the homolog is empty and not Ptprd. This happens for many different genes, and messes up my conversation table.

If anyone can help me figure out this problem or tell me why there are multiple external_gene_name in the first place, I'd be very happy.

EDIT: Added image to emphasize what I am seeing when analyzing the all object

ensembl mapping homology • 856 views
ADD COMMENT
0
Entering edit mode

NaN generally stands for not-a-number as in "undefined" or "unrepresentable". I don't think there is a gene called nan. There is one called Nans.

ADD REPLY
0
Entering edit mode

Sorry, its not actually NaN. Its just empty. Ill specify this in my question.

ADD REPLY
0
Entering edit mode

You can find a list of human-mouse gene homologs at this link from MGI/Jax.

ADD REPLY

Login before adding your answer.

Traffic: 2049 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6