Search genomes for hits from protein HMM
1
0
Entering edit mode
2.4 years ago
Gio ▴ 10

Hello all,

My goal is to build a phylogenetic tree of a selected set of genomes based on proteins solely related to RM systems.

Thus I have aligned all of my RM system proteins. Selected the relevant genomes. Built an HMM from the RM protein sequences using HMMER3.

I now need to search the genomes (DNA sequences in FASTA format) using this HMM built from proteins.

However HMMER doesn't seem to have a search function across alphabets (DNA/AA). How can this be accomplished? Do I have to translate my >200 genomes into proteins using a program that finds and translates ORF? Any recommendations for such a program?

Thank you

hmm phylogeny • 904 views
ADD COMMENT
1
Entering edit mode
2.4 years ago
Mensur Dlakic ★ 30k

Yes, you will have to do gene finding first.

https://github.com/hyattpd/Prodigal

prodigal -i genome.fa -a proteome.faa -o /dev/null -q
ADD COMMENT

Login before adding your answer.

Traffic: 4096 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6