Blastp command line to get only full matches of query sequence
0
0
Entering edit mode
7.0 years ago
rlee03 • 0

Hi, I have a downloaded blast database already. Is it possible to blast this database and only get all the sequences in the database that contain EXACTLY this sequence ADARS ?

In other words, if any protein sequence contains ADARS EXACTLY, then I would you in my XML output/, this is my code so far. Thanks.

blastp -outfmt 5 -query /usr/local/projects/heptamer/peptide.fa -word_size 2 -matrix=PAM30 -db /usr/local/projects/heptamer/blastDB/specialBlastDB/BacteriaNR/all_bacteria_nonredundant_protein.faa -out /usr/local/projects/heptamer/blastResults/blastp/Bacteria/BacteriaFTP.xml -evalue 1000000000000000000000000000 -max_target_seqs 5000000

blast blastp commandline exact • 2.7k views
ADD COMMENT
0
Entering edit mode

Have you looked at the qcovs parameter?

ADD REPLY
0
Entering edit mode

What about grep -B 1 ADARS all_bacteria_nonredundant_protein.fa or variants thereof ?

ADD REPLY
0
Entering edit mode

This assumes unformatted fasta, though.

ADD REPLY
0
Entering edit mode

Yes, the rest is covered in the 'variants' part :)

ADD REPLY
0
Entering edit mode

Thank you all for your help. The qcov worked!

ADD REPLY

Login before adding your answer.

Traffic: 2592 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6