I'd assume that Mouse C57BL/6NJ is closest to the previous mmusculus
Be cautious, if you need Ensembl gene / transcript identifiers (such as ENSG.., ENST...), the "Mouse C57BL/6NJ genes" dataset will not do the trick :
library('biomaRt')
mc57bl6nj = useMart("ENSEMBL_MART_MOUSE",
dataset="mc57bl6nj_gene_ensembl")
mc57bl6nj_infos <- getBM(attributes=c('ensembl_transcript_id',
'ensembl_gene_id',
'external_gene_name'),
mart = mc57bl6nj)
head(mc57bl6nj_infos)
ensembl_transcript_id ensembl_gene_id external_gene_name 1
MGP_C57BL6NJ_T0004927 MGP_C57BL6NJ_G0004247 A930041C12Rik 2
MGP_C57BL6NJ_T0088905 MGP_C57BL6NJ_G0033948 Polb 3
MGP_C57BL6NJ_T0062354 MGP_C57BL6NJ_G0027864 Chrnb2 4
This info is in the "mmusculus_gene_ensembl" dataset, which is somewhat difficult to query atm (the exact same command will either fail or work ...) :
> mmusculus = useMart("ENSEMBL_MART_ENSEMBL",
+ dataset="mmusculus_gene_ensembl")
Error in checkDataset(dataset = dataset, mart = mart) :
The given dataset: mmusculus_gene_ensembl , is not valid. Correct dataset names can be obtained with the listDatasets() function.
> mmusculus = useMart("ENSEMBL_MART_ENSEMBL",
+ dataset="mmusculus_gene_ensembl")
Error in checkDataset(dataset = dataset, mart = mart) :
The given dataset: mmusculus_gene_ensembl , is not valid. Correct dataset names can be obtained with the listDatasets() function.
> mmusculus = useMart("ENSEMBL_MART_ENSEMBL",
+ dataset="mmusculus_gene_ensembl")
When it's working :
mmusculus = useMart("ENSEMBL_MART_ENSEMBL",
dataset="mmusculus_gene_ensembl")
mmusculus_infos <- getBM(attributes=c('ensembl_transcript_id',
'ensembl_gene_id',
'external_gene_name'),
mart = mmusculus)
head(mmusculus_infos)
ensembl_transcript_id ensembl_gene_id external_gene_name
ENSMUST00000082423 ENSMUSG00000064372 mt-Tp
ENSMUST00000082422 ENSMUSG00000064371 mt-Tt
ENSMUST00000082421 ENSMUSG00000064370 mt-Cytb
I hope this get fixed soon
I am getting the same error when using hsapiens_gene_ensembl. Any help would be appreciated.
It seems to be there currently :
Another post is referring to this : bioconductor
I just tried this:
The output is different every time, so you don't always get the same species. Very odd.