Multiple Alignment (~1400 sequences)
1
0
Entering edit mode
7.0 years ago
l.souza ▴ 80

Hello,

I need to align some sequences from a virus. The virus has 7 serotypes, and I got about 200 sequences of each sorotype. I have to determine the identity between the sequences of the same sorotype, and to find out the phylogeny between the sorotypes.

What is the best way to align the sequences in each of this situations?

muscle sequence multiple-alignment mafft • 1.6k views
ADD COMMENT
0
Entering edit mode

Are the sequences from different genes? Whole viral genomes?

ADD REPLY
0
Entering edit mode

They are the whole viral genome

ADD REPLY
1
Entering edit mode
7.0 years ago
h.mon 35k

Assuming you have thousands of somewhat related assembled virus genomes, you should:

  1. align them with MAFFT.
  2. reconstruct phylogeny with RAxML or ExaML.
ADD COMMENT
0
Entering edit mode

Is there an specific reason to use RAxML or ExaML?

ADD REPLY
0
Entering edit mode

They are fast and deal easily with large datasets. There are other software available though, I recommended two I've worked before.

ADD REPLY
0
Entering edit mode

Ok. Thank you so much! Can't wait to try it...

ADD REPLY
0
Entering edit mode

Is there any similar option to RAxML that allows me define other substition model than GTR family?

ADD REPLY
0
Entering edit mode

Long time since I used it, but I think for DNA only GTR or CAT (a fast approximation to GTR, for large datasets). There are more option for protein models.

ADD REPLY
0
Entering edit mode

I've already solved it... I'm gonna use GTR with a gamma correction!

Thank you, even so!

ADD REPLY

Login before adding your answer.

Traffic: 3225 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6