NCBI H.Sapiens OrgDb missing many EntrezIDs
1
0
Entering edit mode
2.1 years ago
Nathan • 0

I'm using ArchR to analyze a H.sapiens PBMC scATAC dataset I have and decided to use Ensembl's GRCh38, Release 103 genome as my reference.

In order to do this I needed to use ArchR's createGenomeAnnotation & createGeneAnnotation and to define the genome. For createGeneAnnotation an OrgDb object was need which I used AnnotationHub to access.

The specifications of the extracted OrgDb

Once I loaded in this OrgDb I realized there were GENEIDs (EntrezIDs) present in my GTF for GRCh38, Release 103 that were missing from the OrgDb. I thought this meant it wasn't up to date, but realized only one was returned when I queried annotation hub for it like this query(hub, c("Homo sapiens","OrgDb")).

Is there somewhere I can get a more up to date version of the H.sapiens OrgDb? I was under the impression it was regularly updated so had trouble believing that it was missing so many IDs.

Any and all guidance would be greatly appreciated

AnnotationHub AnnotationDbi R ArchR • 673 views
ADD COMMENT
0
Entering edit mode
2.1 years ago
Gordon Smyth ★ 7.0k

Cross-posted (and answered) on Bioconductor https://support.bioconductor.org/p/9142452/

ADD COMMENT

Login before adding your answer.

Traffic: 2309 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6