Total number of genes in hg38
3
0
Entering edit mode
6.5 years ago

I am studying RNA-Seq data and I need to know how many genes are incorporated in reference genome h38 build (UCSC) ?

RNA-Seq • 3.7k views
ADD COMMENT
2
Entering edit mode

This was this article in 2015.

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4339237/

See Figure 4 there. Three databases had their own opinions.

28442 for UCSC at that time

ADD REPLY
0
Entering edit mode

Hi Natasha, thank you for sharing the paper. I now have the answer to my question.

ADD REPLY
1
Entering edit mode

Hi Glory Basumata,

If an answer was helpful you should upvote it, if the answer resolved your question you should mark it as accepted.
Upvote|Bookmark|Accept

Cheers,
Wouter

ADD REPLY
0
Entering edit mode

Thanks for the update Wouter. I am a new user here in this community, so I didn't know about upvote :) Cheers!

ADD REPLY
0
Entering edit mode

What have you tried?

ADD REPLY
5
Entering edit mode
6.5 years ago

locate refGene file, uncompress it, look for the 13th column containing gene names, make sure there are no repetitions, and count them:

curl http://hgdownload.cse.ucsc.edu/goldenPath/hg38/database/refGene.txt.gz \
| zcat | cut -f13 | sort -u | wc -l
28054
ADD COMMENT
0
Entering edit mode

Thank you Jorge. This helped me.

ADD REPLY
4
Entering edit mode
6.5 years ago

These metrics are summarized by Ensembl (for their annotation) at https://www.ensembl.org/Homo_sapiens/Info/Annotation

ADD COMMENT
0
Entering edit mode

Thank you for your help WouterDeCoster.

ADD REPLY
4
Entering edit mode
6.5 years ago
GenoMax 148k

GENCODE has a statistics page for this information.

ADD COMMENT
0
Entering edit mode

Hi genomax, thank you for sharing the weblink. I now have rough estimate of the no. of genes to work with.

ADD REPLY

Login before adding your answer.

Traffic: 1479 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6