Question: Genome annotation using transcriptome data
gravatar for KG
13 months ago by
KG10 wrote:


We have generated a newer version of genome assembly for a yeast species. We have also sequenced the transcriptome. Now we would like to annotate the genome. My questions are:

  1. How to use the transcriptome data for annotation?
  2. Can you recommend a pipeline for genome annotation which use transcriptome data for functional validation of annotated features?

Thank you for your time and help.

ADD COMMENTlink written 13 months ago by KG10

Are you looking for a validation of predicted transcripts? You could try a genome-guided transcript assembly, then map the transcripts back to the genome and compare the predicted to the assembled transcripts.

Wrt. to "functional validation" are you referring to the gene function? Validated functional annotation would require functional assays like knock-down, knock-out, localization, overexpression, binding assays, etc. or do you want to infer function based on gene expression pattern?

ADD REPLYlink modified 13 months ago • written 13 months ago by Michael Dondrup46k

I think Augustus can take transcript data as input. I would go with Michael's approach (genome-guided transcript assembly) as well as denovo transcript assemblies with different assemblers and then maybe have a look at Mikado.

When you have a set of transcripts with support by your RNA-seq, you can then go down the traditional bioinformatics route for functional annotation (however, as Michael says, this is rather functional prediction). I.e., you will predict ORFs (e.g. with transdecoder) and then could use InterProScan for the functional annotation. Additionally, (if those are not already included in InterProScan) you could use hmmscan and blastp on the translated ORFs, blastn the untranslated ORFs to find ncRNAs, etc.

ADD REPLYlink modified 13 months ago • written 13 months ago by cschu1811.8k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2309 users visited in the last hour