Tutorial: InsideDNA: Improving gene prediction with protein profiles in Augustus for annotation of vision-related genes in Zebrafish
gravatar for shilparaopradeep
3.9 years ago by
shilparaopradeep190 wrote:

Gene prediction is one of the most common tasks in bioinformatic analysis of newly sequenced genomes. AUGUSTUS is an excellent gene prediction tool which works with eukaryotic genomes. It allows to predict genes ab initio (de novo) or based on some hints (e.g. RNA-seq/EST, protein alignments, synthetic genomic alignment). In this tutorial we explain how to use protein profiles to improve gene search in the genomic fasta files. For this purpose, we discuss AUGUSTUS protein profile extension (PPX) and explain all steps necessary to run a prediction with an addition of a protein profile.

PPX extension allows to supplement gene prediction procedure with the information about protein family conservation. Information about protein family conservation normally comes from so called protein block profiles. Normally, protein profile files contain position-specific frequency matrices that model conserved regions in a multiple sequence alignment (MSA) with no indels. When PPX extension is used for gene prediction, those genes that match provided profiles are predicted with much higher prediction accuracy then the rest of the genes predicted ab-initio.

Read more:http://bit.ly/1UPvdTG

tutorial bioinformatics genome • 1.8k views
ADD COMMENTlink modified 3.6 years ago by Devon Ryan94k • written 3.9 years ago by shilparaopradeep190
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1050 users visited in the last hour