To filter out blast annotated reads on the basis of hits
0
0
Entering edit mode
7.3 years ago

Hello,

I have performed blast on my query file. Now i want to remove the sequences with no hits. How can I do that. ?

genome sequence blast alignment • 1.4k views
ADD COMMENT
0
Entering edit mode

I am not sure I understand the problem. Is this not a simple matter of 1) parsing the BLAST output to identify which query sequences gave hits, and 2) extracting this subset of sequences from the query FASTA file?

ADD REPLY
0
Entering edit mode

If you do a tab output -outfmt 6 for your blast, get the unique query ids with cut -f 1 blast_output.txt | sort | uniq > ids.txt. Then search Biostars, or code a way to keep sequences that match those ids, or remove the ones that don't.

ADD REPLY

Login before adding your answer.

Traffic: 1523 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6