Hi, I´m using blast+ (blastn) to do a local blast with a database I´ve downloaded. As a result I would like to have only one hit per each read I have, so, I´m using the option "max_target_seqs 1", it happens that sometimes I get more than one hit for each of my query sequences, for what I understand my query reads are matching with the same target sequence in different forms. Anyone knows an option that could give me only one hit? I´ve tried the option max_hsps 1 but this allows each target sequences to be hit only one time, which I don´t want, once several of my reads may probably correspond to the same target read (in my case, the same species).
ha, tricky question nowadays ;) (google for other posts here on biostar with the max_target_seq as keyword)
Long story short: you're better of running the blast with default values and then filter out only a single hit for each query in post-processing. Have a look here, 'trick 6' : https://www.cheatography.com/melissamlwong/cheat-sheets/awk-one-liners-for-blast-results-manipulation/ (haven't tested that one myself though)
You mention 'read', is that read as in NGS or something different? If the former: blast is probably not the best approach then (unless you only have a very limited amount of reads to process)
Thanks lieven.sterck :)