Blast Contigs To A Reference Sequence And Obtain An E-Value
4
1
Entering edit mode
10.6 years ago
ant0422ant ▴ 10

Well, I'm a newbie of blast, so...hope I'm not asking a stupid question....

I've already have a set of contigs which is done by de novo assembler. Format of these contigs is fasta. Now I want to align these contigs to a reference sequence by using blast. And obtain only single e-value or a value shown the similarity between contigs and reference sequence.

I used blastn on linux system with "-subject reference.fa" command, but it only showed identity and e-value for each contig. And it would be problematic for summarizing these values.

So, is it possible for blast to do such a job? I mean, calculate a single value which can present similarity between contigs and reference sequence.

I've searched for hours and can't seem to find a straight answer. Please do help me....

blast assembly alignment • 6.1k views
ADD COMMENT
2
Entering edit mode
10.6 years ago
Neilfws 49k

You're aligning multiple query sequences to one reference. A single measure of similarity for multiple queries does not make sense. So the answer is: you don't want to do that, using BLAST or anything else.

ADD COMMENT
0
Entering edit mode

Well, I just want to calculate how similar between contigs I assembled and the reference. I think there are still some metrics which can complete the task~ Anyway thanks a lot!!

ADD REPLY
1
Entering edit mode
10.6 years ago
chefer ▴ 350

Maybe the best would be to generate a density plot or histogram of the e-value distribution from all the assembled contigs? BLAST does not have the functionality you describe.

ADD COMMENT
1
Entering edit mode
10.6 years ago

You could use BLAST -m8 output and tabulate the bit scores - that might be a somewhat meaningful metric but it wouldn't really stand up to scrutiny

perhaps MAUVE would give you some kinda single metric: http://gel.ahabs.wisc.edu/mauve/

ADD COMMENT
0
Entering edit mode

I will give a try on using MAUVE. And is "BLAST -m8" used on blastall?

Thanks so much for the answer!!!

ADD REPLY
1
Entering edit mode
10.6 years ago
Rohit ★ 1.5k

First thing is there is nothing called a stupid question.

The tabular outfmt 7 of blast combined with pident (check the stand-alone manual), gives best results but do use the bit scores for comparison.

If you would like to use the results for further data merging then I guggest you to use XML output (outfmt 5) and then parse whatever you need.

ADD COMMENT
0
Entering edit mode

And I've tried these two formats, but too many results to summarize. Anyway thanks your answer!! B-)

ADD REPLY

Login before adding your answer.

Traffic: 3093 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6