Hi all ,
I have performed a local protein blast and it gave me an output which contains a lot of duplication . However, I sorted out the result w.r.t - e-value( <=0.01) , -query coverage (>= 70%), -%id (>= 30%) , -higher bit score
Now, coming to my Problem-
- I'm seeing that the query_Start_position for multiple sequences are different (for eg. 158 and 178) but the End_position is same (let's say 840) and in many cases it's vice versa.
How do I sort it out? Which one to consider ? And if one hit shows query sequence position from 5 - 100 and other shows 10 - 90 (which is in between 5 - 100 ) , which one should I consider?
Thanks in advance. Wang