Question: misc_RNA in Ensembl
0
gravatar for Martombo
6.2 years ago by
Martombo2.6k
Seville, ES
Martombo2.6k wrote:

Do you know which are the criteria used to classify a gene as "misc_RNA" by Ensembl? I couldn't find an answer on the Ensembl page describing non-coding RNA: 

http://www.ensembl.org/info/genome/genebuild/ncrna.html.

A few example of such genes retrieved from BioMart:

http://www.ensembl.org/Homo_sapiens/Gene/Summary?g=ENSG00000207157;r=13:23726725-23726825;t=ENST00000384428

http://www.ensembl.org/Homo_sapiens/Gene/Summary?g=ENSG00000242037;r=13:95351479-95351756;t=ENST00000470538

http://www.ensembl.org/Homo_sapiens/Gene/Summary?g=ENSG00000223298;r=13:95963084-95963209;t=ENST00000411366

these examples show that they are pseudogenes. why aren't they associated to the "pseudogene" gene type? what makes them misc_RNA?

thank you!

gene type biomart ensembl genome • 4.3k views
ADD COMMENTlink modified 6.2 years ago by Emily_Ensembl21k • written 6.2 years ago by Martombo2.6k
3
gravatar for Emily_Ensembl
6.2 years ago by
Emily_Ensembl21k
EMBL-EBI
Emily_Ensembl21k wrote:

misc_RNA is defined as any ncRNA that we can't categorise as anything else.

If you look at your genes of interest here, the word 'pseudogene' is found in the gene name ('RNA, Ro-associated Y3 pseudogene 4 [Source:HGNC Symbol;Acc:42488]'), which we lift directly from HGNC. However, these genes do not fit our definition of pseudogenes so are not classified as such. We can't change the official HGNC name, but we will only annotate genes as what we believe them to be.

ADD COMMENTlink written 6.2 years ago by Emily_Ensembl21k
1

There is some cryptic circularity going on here.

For example I can't provenance RNY3P4 as anything. It appears to be a RefSeq prediction of something http://www.ncbi.nlm.nih.gov/gene?cmd=Retrieve&dopt=full_report&list_uids=100873808 and points back to HGNC - but I wasn't aware they did predictions of psedogenes (they might annotate) so where did this come from?

But ENSG00000207157.1 says "No overlapping RefSeq" clips the RefSeq down from 301 to a 101 exon? on the basis of an Rfam model?

But nothing from Havana/Vega in this location?

ADD REPLYlink modified 7 months ago by RamRS28k • written 6.2 years ago by cdsouthan1.8k
1

We got it from an RFam record. And no, no manual annotation on these guys.

ADD REPLYlink modified 7 months ago by RamRS28k • written 6.2 years ago by Emily_Ensembl21k

We're into serious "what is a gene" territory here... I might pose it as a general question. It's getting crucial as more equivocal automated and manual annotations keep stacking up.

ADD REPLYlink modified 7 months ago by RamRS28k • written 6.2 years ago by cdsouthan1.8k

Thanks for your reply! What is the difference between "type" and "locus_tag". I am looking for all rRNA sequences from GenBank file. I am a little confused why there are many 5S_rRNA are labeled as "misc_RNA"? Such as:

gene            493631..493747
                     /gene=ENSDARG00000085618
                     /locus_tag="5S_rRNA"
                     /note="5S ribosomal RNA [Source:RFAM;Acc:RF00001]"
     misc_RNA        493631..493747
                     /gene="ENSDARG00000085618"
                     /db_xref="RFAM_trans_name:5S_rRNA.1275-201"
                     /note="rRNA"
                     /note="transcript_id=ENSDART00000121018"

Thanks in advance.

ADD REPLYlink modified 7 months ago by RamRS28k • written 3.2 years ago by AlicePsyche30
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 954 users visited in the last hour