Question

Blast With Biopython

0

Entering edit mode

6.9 years ago

neilgupte7 • 0

i wanted to blast two sequences ,but i need the query to be searched with the complement of my subject strand .and how to display such alignment inside biopython using blast?

EXAMPLE:
 AGTC
 ||||
 AGTC

OUTPUT BECOMES-
--AGTC
--||--
TCAG--

 AGTC
 ||||
 TCAG

Help will be appreciated, THANK YOU

blast biopython pairwise alignment • 4.7k views

ADD COMMENT • link updated 6.9 years ago by gb ★ 2.2k • written 6.9 years ago by neilgupte7 • 0

0

Entering edit mode

What is the actual question here?

Do you want to know how to 'render' the alignment visually, or how to do the BLAST, or how to reverse complement? All 3?

ADD REPLY • link 6.9 years ago by Joe 22k

0

Entering edit mode

rendering part might also help basically i wanted to know,can we do Blast complementary strands inside biopython and visaulise the particular output.?

ADD REPLY • link 6.9 years ago by neilgupte7 • 0

0

Entering edit mode

Don’t type entirely in capitals please.

ADD REPLY • link 6.9 years ago by Joe 22k

0

Entering edit mode

sorry ,my bad !did not pay attention towards the case

ADD REPLY • link 6.9 years ago by neilgupte7 • 0

0

Entering edit mode

Standalone blast or online?

ADD REPLY • link 6.9 years ago by gb ★ 2.2k

0

Entering edit mode

standalone BLAST connecting it with a python sript

ADD REPLY • link 6.9 years ago by neilgupte7 • 0

score 3 · Answer 1 · 2018-08-14

3

Entering edit mode

6.9 years ago

gb ★ 2.2k

Something like this:

cline = NcbiblastnCommandline(query="m_cold.fasta", db="nt", strand="plus",evalue=0.001, out="m_cold.xml", outfmt=5)

Or:

cline = NcbiblastnCommandline(query="m_cold.fasta", db="nt", strand="minus",evalue=0.001, out="m_cold.xml", outfmt=5)

With the strand parameter you can change which orientation you want. Probably in your case you need to set it on plus but reverse complement all your input sequences.

For displaying the alignment you can do something like this:

from Bio.Blast import NCBIXML
blast_record = NCBIXML.read(result_handle)
for alignment in blast_record.alignments:
 for hsp in alignment.hsps:
     print("****Alignment****")
     print("sequence:", alignment.title)
     print("length:", alignment.length)
     print("e value:", hsp.expect)
     print(hsp.query[0:75] + "...")
     print(hsp.match[0:75] + "...")
     print(hsp.sbjct[0:75] + "...")

Output will be:

****Alignment****
sequence: >gb|AF283004.1|AF283004 Arabidopsis thaliana cold acclimation protein WCOR413-like protein
 alpha form mRNA, complete cds
length: 783
e value: 0.034
tacttgttgatattggatcgaacaaactggagaaccaacatgctcacgtcacttttagtcccttacatattcctc...
||||||||| | ||||||||||| || ||||  || || |||||||| |||||| |  | |||||||| ||| ||...
tacttgttggtgttggatcgaaccaattggaagacgaatatgctcacatcacttctcattccttacatcttcttc...

I got this code from here:

http://biopython.org/DIST/docs/tutorial/Tutorial.html#htoc98

ADD COMMENT • link 6.9 years ago by gb ★ 2.2k

0

Entering edit mode

I am using Anaconda Spyder to run Biopython but i dont get the output when using NcbiblastnCommandline but when i use NcbiblastpCommandline i get the output ....But the output is in following format

*Alignment* sequence: seq2 seq2 <unknown description=""> length: 149 e value: 0.0138168 CTAGCTCGATCGATCGATGCTAAGCTTACGTAGCT... CTAGCT GATCG C TACGTAGCT... CTAGCT-GATCGTAC-----------TACGTAGCT..

what should i do to get the output as follows

tacttgttgatattggatcgaacaaactggagaaccaacatgctcacgtcacttttagtcccttacatattcctc... ||||||||| | ||||||||||| || |||| || || |||||||| |||||| | | |||||||| ||| ||... tacttgttggtgttggatcgaaccaattggaagacgaatatgctcacatcacttctcattccttacatcttcttc...

ADD REPLY • link 6.9 years ago by neilgupte7 • 0

0

Entering edit mode

Please use the code formatting button (it has101010 written on it in the bar above) to improve your posts readability.

ADD REPLY • link 6.9 years ago by Joe 22k

0

Entering edit mode

Not sure if I understand because you comment is hard to read. But when you do a blastp you do a protein alignment. So you have matches and positives. You need to check what's going wrong with blastn, blastn is for nucleotide alignments.

ADD REPLY • link 6.9 years ago by gb ★ 2.2k