Hmmer for protein prediction
1
0
Entering edit mode
18 months ago

Hello, Regarding using the Hmmer (3.1b1) my requirement is to search amino acid sequences of predicted protein products against an HMM database prokaryotic viral orthologous groups. Which I know can be done using hmmscan.

The user guide says hmmpress has to be performed to prepare a database to run hmmscan on the the test file. But the vog file from the website ( http://dmk-brain.ecn.uiowa.edu/pVOGs/downloads.html ) downloads a tar.gz file and hmmpress asks for a Hmmer3 file. I have no idea about this step and getting confused over it.

Can anyone help me in this reagrds,Please?

Assembly genome next-gen • 544 views
ADD COMMENT
1
Entering edit mode

you will for sure have to unpack the tar.gz file you downloaded (it will normally contain a number of other files)

tar -zxvf <download.tar.gz>

should do the trick

ADD REPLY
0
Entering edit mode

Yes, I had already done that. Thank you.!

ADD REPLY
1
Entering edit mode

ok, so what was in the un-zipped tar file then? I'm guessing nothing that could be used for resolving your issue?

ADD REPLY
0
Entering edit mode

Thank you for the help.

ADD REPLY
3
Entering edit mode
18 months ago
Mensur Dlakic ★ 12k

hmmscan scores a database of HMMs vs. a database of sequences. An HMM database is what needs to be processed by hmmpress.

Download:

wget http://dmk-brain.ecn.uiowa.edu/pVOGs/downloads/Ampullaviridae/AmpullaviridaevogHMMprofiles.tar.gz

Unpack:

tar -zxvof AmpullaviridaevogHMMprofiles.tar.gz

Concatenate HMMs into a database:

cat AmpullaviridaevogHMMprofiles/*.hmm > Ampullaviridae_all.hmm

Press:

hmmpress Ampullaviridae_all.hmm

Scan:

hmmscan Ampullaviridae_all.hmm your_sequences.fasta > search_output.txt

You may want to delete a directory with individual HMMs:

rm -rf AmpullaviridaevogHMMprofiles
ADD COMMENT
0
Entering edit mode

Thank you so much sir. You're a gem, I was struggling with this since 3 days.

Thank you so much

ADD REPLY

Login before adding your answer.

Traffic: 1682 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6