Question: HMMER: hmmscan ---> hmmsearch
gravatar for Metaxo
13 months ago by
Metaxo0 wrote:

I'm using HMMER to find domains to a battery of viral contigs (in fasta file) against Pfam database. I'm not using MSAs because most of the contigs are not related so cannot be aligned. I used hmmscan first but the running speed is just too slow. I wanted to use hmmsearch but the query file needs to be in hmm profile format in order to fit in the hmmsearch program. If I were to use hmmbuild to convert this fasta file to hmm, it would require it to be MSA. So I'm kinda stuck in here.

So.. here's what it is : Is there anyway i can end up using hmmsearch instead of hmmscan for a non-MSA fasta file?

hmmer protein pfam hmm • 1.4k views
ADD COMMENTlink modified 9 months ago by h.mon30k • written 13 months ago by Metaxo0

You could switch to HHSuite instead.

ADD REPLYlink written 13 months ago by Joe17k

Will that be faster than the hmmscan?

ADD REPLYlink written 13 months ago by Metaxo0

I don’t have speed comparisons to hand but HHsuite is fast.

It will accept a list of fastas as input though, so it solves your input issue. If you’re using long sequences I think it will be reasonably slow no matter what you do, it’s more common to compare gene size sequences.

ADD REPLYlink written 13 months ago by Joe17k

Thanks! I will try that right now

ADD REPLYlink written 13 months ago by Metaxo0
gravatar for h.mon
9 months ago by
h.mon30k wrote:

I know the question is a bit old, but I didn't see it before. You don't need to convert the fasta files to hmms, you can certainly use hmmsearch with the viral contigs (but remember, HMMER does protein-protein comparisons or DNA-DNA comparisons, but not DNA-protein comparisons). The format of the command-line is the same:

# hmmscan
hmmscan Pfam.hmm myorfs.fa

# hmmsearch
 hmmsearch Pfam.hmm myorfs.fa

The post hmmscan vs. hmmsearch speed: the numerology explains in detail when to use one or the other, and it states:

hmmscan and hmmsearch are doing exactly the same compute, at heart: comparing one profile to one sequence at a time. Their bit score results are identical. You can save hmmsearch tabular output files and use ’em just the same way you were going to use the hmmscan files. (Um, watch out for E-values: remember that E-values depend on the size of the database you search.)

ADD COMMENTlink modified 9 months ago • written 9 months ago by h.mon30k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 674 users visited in the last hour