Question: Question about blastp result
0
gravatar for huiyus97
3 months ago by
huiyus970
huiyus970 wrote:

Hi,

I need to blast several proteomes against several databases, and I found that I got way too many blastp result compared to the blastp result we got from a company using the same proteome and against the same database. For example, I blasted a proteome against the PHI database using this code:

$blastp -in input.fasta -evalue 1e-5 -outfmt 6 -out outfile -db database.fasta -max_target_seqs 1

As a result, I got 4000 proteins with a e-value=1e-5, however, the company got 1400 proteins with the same e-value.

Could someone please tell me what causes this problem?

Thanks in advance!

blastp • 167 views
ADD COMMENTlink modified 5 weeks ago by Biostar ♦♦ 20 • written 3 months ago by huiyus970

hard to tell without having more detailed info.

are you running the same parameter settings? same blast version? same database version? ...

One thing already is that using -max_target_seqs is quite tricky (do some searching on this forum as well as internet for this and you will quickly notice)

Also, why do you assume you're getting too much hits, perhaps the company is reporting way too few hits?

ADD REPLYlink modified 3 months ago • written 3 months ago by lieven.sterck9.1k

Did you use an identical copy of the database, blast program and parameters? This may not be a problem at all. If one or more of the things mentioned in first sentence differ then your result will also be different.

ADD REPLYlink written 3 months ago by GenoMax92k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2158 users visited in the last hour