How to get hits sequences of all hsps from tblastn and connect them?
0
1
Entering edit mode
7.6 years ago

I have a tblastn tabular output with millions of hits.P,Q and R were used as query. Con,Don and Eon were my contigs used to be blast. The tblastn output had many HSPs. For example, I used P to tblastn Con. p1,p2,p3 and p4 are four HSPs, and corresponding protein sequences in Con are C1,C2,C3 and C4. How can I connect C1,C1,C2,C3 and C4, and remove the overlapping sequence? I want to use the connected protein sequences to do sequence alignment.

P ------------------ ---- -
p1---- p2 ------ p4---

         p3------

C1---- C2 ---- C4---

        C3-----
blast alignment sequence genome • 1.4k views
ADD COMMENT

Login before adding your answer.

Traffic: 1778 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6