convert pseudogene IDs in Gencode releases to HGNC gene symbols
1
0
Entering edit mode
8.9 years ago
biocyberman ▴ 860

The pseudogene file I downloaded from GeneCode looks like this:

chr10   Yale_UCSC       transcript      5304431 5304865 .       -       .       gene_id "PGOHUM00000289731"; transcript_id "PGOHUM00000289731"; gene_type "pseudogene"; gene_status "UNKNOWN"; gene_name "PGOHUM00000289731"; transcript_type "pseudogene"; transcript_status "UNKNOWN"; transcript_name "PGOHUM00000289731"; level 3; tag "2way_pseudo_cons"; yale_id "PGOHUM00000289731"; ucsc_id "NM_000987.3-9"; parent_id "ENSG00000161970";
chr10   Yale_UCSC       transcript      6071517 6071921 .       -       .       gene_id "PGOHUM00000289732"; transcript_id "PGOHUM00000289732"; gene_type "pseudogene"; gene_status "UNKNOWN"; gene_name "PGOHUM00000289732"; transcript_type "pseudogene"; transcript_status "UNKNOWN"; transcript_name "PGOHUM00000289732"; level 3; tag "2way_pseudo_cons"; yale_id "PGOHUM00000289732"; ucsc_id "NM_000994.3-10"; parent_id "ENSG00000144713";
chr10   Yale_UCSC       transcript      6293104 6294274 .       -       .       gene_id "PGOHUM00000289733"; transcript_id "PGOHUM00000289733"; gene_type "pseudogene"; gene_status "UNKNOWN"; gene_name "PGOHUM00000289733"; transcript_type "pseudogene"; transcript_status "UNKNOWN"; transcript_name "PGOHUM00000289733"; level 3; tag "2way_pseudo_cons"; yale_id "PGOHUM00000289733"; ucsc_id "NM_001007070.1-2"; parent_id "ENSG00000137575";
chr10   Yale_UCSC       transcript      7295121 7295279 .       -       .       gene_id "PGOHUM00000289735"; transcript_id "PGOHUM00000289735"; gene_type "pseudogene"; gene_status "UNKNOWN"; gene_name "PGOHUM00000289735"; transcript_type "pseudogene"; transcript_status "UNKNOWN"; transcript_name "PGOHUM00000289735"; level 3; tag "2way_pseudo_cons"; yale_id "PGOHUM00000289735"; ucsc_id "NM_004374.3-4"; parent_id "ENSG00000164919";

How do I convert the gene_id (e.g PGOHUM00000289731) to HGNC symbols? Biomart and BioDBnet do not seem to offer this

pseudogene gene genecode • 2.3k views
ADD COMMENT
1
Entering edit mode
8.9 years ago
Ashis ▴ 90

You can probably download the complete HGNC dataset. Here the "pseudogene.org" column contains gene ids similar to your ones. You might want to write a small program to get corresponding hgnc symbols.

ADD COMMENT
0
Entering edit mode

The answer did not point exactly to what I was looking for, but it is sufficient to get me there. The file I need to download is this one: ftp://ftp.ebi.ac.uk/pub/databases/genenames/new/tsv/locus_groups/pseudogene.txt

ADD REPLY

Login before adding your answer.

Traffic: 2104 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6