Hi,
Can somebody please put me out of my misery and let me know what the '-B' parameter in hhblits is supposed to be doing? Officially, it specifies the 'maximum number of alignments in alignment list'. So, I assumed that increasing the -B integer would increase the max. number of HMM/MSA database hits to my query HMM, and by extension the total number of sequences that would be included in my output MSA.
However, I've tried to systematically vary -B from 1 to 50, 75, 100, and 200 and the total number of sequences in the output MSA does not change. I also do not discern clear changes in the output summary file (.out). It does not seem to be having any effect at all?
Thank you in advance.
Thank you very much for the reply. It makes sense but there is something that I am still confused by; I will go into the specifics.
I am using an E-value threshold of 1e-10 and there are 137 MSA/HMMs that pass this threshold and from this I get 1159 sequences in my final output MSA. I therefore assumed that setting -B to 100 would restrict the number of MSA/HMMs (from 137 to 100) that would contribute to the number of sequences in my final output MSA. In every case I still get 1159 final sequences.
You say that it specifies the number of alignments that will be reported but what does this mean exactly if it is not affecting the summary list in the .out file? Some other visual output that is not affecting my final result MSA file? Thank you again.
I think both
-B
and-Z
affect only the visual side, meaning only what is in the report. It should not affect how many sequences you get in the output file.If you want fewer sequences, I suggest you go with
hhfilter
and play with the diversity switch (-neff
).Thanks. That was my impression also but I just wanted to make sure before starting my analysis. That makes sense.