Consensus Sequence And Assembly Of Multiple Group Of Sequences
0
0
Entering edit mode
8.1 years ago

Hi,

I've a little question about consensus sequences. So I've a bunch of small overlapping sequences from different groups and I want to regroup sequences from the same group together and compute the consensus sequences. The thing is that I don't know from which group sequences are coming

A little example is better to understand (here there are thre groups - line 1-4 : g1 ; line 5-8 : g2 ; line 9-12 : g3):

AAATTTGGGCCC
AAATTTGGG
AAATTTG
   TTTGGGCCCAAA
ATGCATGCAT
ATGCATGC
  GCATGCATGC
 TGCATGCAT
ACGTACGTACGT
ACGTACGTA
  GTACGTACGTAC

And the expected output would be :

Group1 : AAATTTGGGCCCAAA

Group2 : ATGCATGCATGC

Group3 : ACGTACGTACGTAC

The problem is to cluster the sequences together to form the groups. After the consenus sequence is pretty simple to do.

Anyone has an idea ?

Thanks

N.

consensus • 1.8k views
ADD COMMENT
1
Entering edit mode

If you are just looking for Clustering then blastclust would do.

ADD REPLY
0
Entering edit mode
ADD REPLY
0
Entering edit mode

"After processing" with what?

ADD REPLY
0
Entering edit mode

I edited my question.

ADD REPLY
0
Entering edit mode

Maybe use CAP3?

ADD REPLY

Login before adding your answer.

Traffic: 2430 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6