Question: Genome annotation using transcriptome data
0
gravatar for KG
6 months ago by
KG10
KG10 wrote:

Hi,

We have generated a newer version of genome assembly for a yeast species. We have also sequenced the transcriptome. Now we would like to annotate the genome. My questions are:

  1. How to use the transcriptome data for annotation?
  2. Can you recommend a pipeline for genome annotation which use transcriptome data for functional validation of annotated features?

Thank you for your time and help.

ADD COMMENTlink written 6 months ago by KG10

Are you looking for a validation of predicted transcripts? You could try a genome-guided transcript assembly, then map the transcripts back to the genome and compare the predicted to the assembled transcripts.

Wrt. to "functional validation" are you referring to the gene function? Validated functional annotation would require functional assays like knock-down, knock-out, localization, overexpression, binding assays, etc. or do you want to infer function based on gene expression pattern?

ADD REPLYlink modified 6 months ago • written 6 months ago by Michael Dondrup45k

I think Augustus can take transcript data as input. I would go with Michael's approach (genome-guided transcript assembly) as well as denovo transcript assemblies with different assemblers and then maybe have a look at Mikado.

When you have a set of transcripts with support by your RNA-seq, you can then go down the traditional bioinformatics route for functional annotation (however, as Michael says, this is rather functional prediction). I.e., you will predict ORFs (e.g. with transdecoder) and then could use InterProScan for the functional annotation. Additionally, (if those are not already included in InterProScan) you could use hmmscan and blastp on the translated ORFs, blastn the untranslated ORFs to find ncRNAs, etc.

ADD REPLYlink modified 6 months ago • written 6 months ago by cschu1811.5k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1624 users visited in the last hour