Question: How to get hits sequences of all hsps from tblastn and connect them?
gravatar for chenmy2007525
4.2 years ago by
chenmy200752510 wrote:

I have a tblastn tabular output with millions of hits.P,Q and R were used as query. Con,Don and Eon were my contigs used to be blast. The tblastn output had many HSPs. For example, I used P to tblastn Con. p1,p2,p3 and p4 are four HSPs, and corresponding protein sequences in Con are C1,C2,C3 and C4. How can I connect C1,C1,C2,C3 and C4, and remove the overlapping sequence? I want to use the connected protein sequences to do sequence alignment.

P ------------------ ---- -
p1---- p2 ------ p4---


C1---- C2 ---- C4---

blast sequence alignment genome • 912 views
ADD COMMENTlink modified 4.2 years ago • written 4.2 years ago by chenmy200752510
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1933 users visited in the last hour