Hello,
I am trying to do blastn using NCBI-BLAST BLAST 2.2.31+ using the following 2 following sequences in a file:
>PositiveControl
GGTTGAGGCTAAGCCAG
>PositiveControl_NNNNN
GGTTGANNNNNAGCCAG
against the following sequence:
>SubjectPositiveControl
ACCTCTGCATTAGGACTCTGCATCGACCGTAGCCAGTTCTTAGGCGGTTGAGGCTAAGCCAGGTAGGCCAAGTCTCACTGGAGCCGGTTGCGAGGGATTCCTTCGGGCTGAGGGAGAGTTTATGTGGAGTGGTCTGAAGGTGGTT
My >Positive Control is just copy of a part of >SubjectPositiveControl, and >PositiveControl_NNNNN is same as >PositiveControl but with 5 letters changed to Ns. My blast search finds >PositiveControl but aligns >PositiveControl_NNNNN only partially, where there are exact sequences (eg GGTTGA). Here is the command I used:
blastn -query PositiveControl.txt -subject SubjectPositiveControl.txt -task "blastn-short" -max_target_seqs 1 -word_size 4 -outfmt "6 qseqid sseqid qlen length qseq sseq pident sstrand" > PositiveControlResults.txt
Is there any way to perform this search with BLAST? Or is there any other tool/software/website that can do that? Thank you.
What is your expected alignment ? Can you also post contents of
PositiveControlResults.txt?BLAST+ is currently in v.2.6. Please upgrade if possible.
seqkit locate -d(http://bioinf.shenwei.me/seqkit/usage/#locate) can locate motif containingN, but it's exact not similar search.