I'm really a beginner and wonder how to get predicted protein sequence from a genome?
Here is a Genbank assembly of a genome (quite small), and inside there is no available protein sequence to download (e.g. XXX.pep.faa ). Then I try to predict protein sequence by myself useing software like EVM. But when I download the gff3 file of this genome, the gff3 doesn't seem like a standard gff3 file.Instead of seeing the information of CDS or exon, Isee a lot of 'Genbank: URL....'.
Some others suggest me to predict the protein sequence by predicting the ORF first and translate using standard codon.
Do you have any idea about this? Thanks for your time!