Hi,
I am new to Augustus and I used Augustus for fungal genome analysis.
I used command:
augustus --species=human --UTR=on sequence.fasta > sequence_augustus.gff
I got result
----- prediction on sequence number 1 (length = 11239, name = contig00001) -----
# Constraints/Hints:
# Predicted genes for sequence number 1 on both strands
# start gene g1
contig00001 AUGUSTUS gene 1476 4367 1 + . g1
contig00001 AUGUSTUS transcript 1476 4367 . + . g1.t1
contig00001 AUGUSTUS tss 1476 1476 . + . transcript_id "g1.t1"; gene_id "g1";
contig00001 AUGUSTUS exon 1476 1559 . + . transcript_id "g1.t1"; gene_id "g1";
contig00001 AUGUSTUS exon 2030 4367 . + . transcript_id "g1.t1"; gene_id "g1";
contig00001 AUGUSTUS start_codon 2378 2380 . + 0 transcript_id "g1.t1"; gene_id "g1";
contig00001 AUGUSTUS CDS 2378 3223 . + 0 transcript_id "g1.t1"; gene_id "g1";
contig00001 AUGUSTUS stop_codon 3221 3223 . + 0 transcript_id "g1.t1"; gene_id "g1";
contig00001 AUGUSTUS tts 4367 4367 . + . transcript_id "g1.t1"; gene_id "g1";
# protein sequence = [MLDLEIAKNCADGELDGKMVEEHPGVDNKDGSHTDSKGGNKAKGEADWAGQAELPSHVTQTPIETELPLTIAPAIDAH
# TATEGVVSAAVVAANTRATASIGTSNALGNLSKLPEISRSLIYHFVTAETDFPCVNSECRPPRLLSTISKTIKKEVELYYRCNHRTLMVELNPFEFHH
# LEDSFPAWSLEEYRAYMAPYMKPVTSRVHDLSIVDEVIIHLADDGPDCLTLLYTLALDHTDPTIPGQSDHEPFRLTLEYASAEGDETRDMHNHEYCTR
# FRTFFEP]
# end gene g1
My question what above result indicates? And what about protein sequence? If this protein correspond to the first Nucleotide sequence (contig 1) in my complete fasta file (contains many contigs), should I use this protein sequence and do the BATCH CD search for annotation? Or any other tools for annotation for fungal genome?
Any suggestions.
Regards!
Shashank
Thank you for the information about Augustus.
Actually my purpose of doing this is to annotate the fungal genome. I am not bioinformatician, I have seen MAKER, it requires many dependencies (somehow I am unable to make it work).
Is there any server and easy software where I can annotate fungal genome?
Best!
Shashank