Question: Annotation Human genome transcripts, Transcript Support Level meaning
gravatar for MarVi
16 months ago by
MarVi20 wrote:

Dear all,

I have som few questions related to the annotation of non-coding transcripts. Hopefully, someone here knows some or all the answers.

Does someone know what does it mean the Transcript support level: NA in some of the transcripts in the Gencode/Ensembl annotation gtf? In their web site, they specified that those transcripts were not analyzed for several reasons transcript quality tags. Then, those non-coding transcripts ou pseudogenes might be suspect and not valid. But, how those transcripts were annotated in their very beginning? What does it mean that they weren't analyzed? How these non-coding transcripts are included in the annotation files? I found that some of them are predicted by algorithms, those algorithms are based on transcripts assemblies alignments or...? and when it says the transcript is 'manually' annotated, what manually is?

I thank in advance the answers.

ADD COMMENTlink modified 16 months ago by Emily_Ensembl21k • written 16 months ago by MarVi20
gravatar for Emily_Ensembl
16 months ago by
Emily_Ensembl21k wrote:

As it says on the piece of documentation you linked to, TSL:NA means that the transcript was not analysed for one of the following reasons:
* pseudogene annotation, including transcribed pseudogenes
* human leukocyte antigen (HLA) transcript
* immunoglobin gene transcript
* T-cell receptor transcript
* single-exon transcript (will be included in a future version)

TSL is analysed after the transcripts are annotated, not as part of the annotation. These kinds of transcripts, which are different to most transcripts and therefore cannot be analysed in the same way are excluded.

The processes of automatic and manual annotation are all described in the documentation. This paper describes the automatic annotation in detail and this paper has more detail on the manual annotation.

ADD COMMENTlink written 16 months ago by Emily_Ensembl21k

Thank you very much for your kind response!

I have been revising the literature that you cited here, and it is very useful! However, I still have a question, why there are some transcripts that are pseudogenes but are assigned to a TSL different from "NA"?

ADD REPLYlink written 16 months ago by MarVi20
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1358 users visited in the last hour