Question: RAxML protein partition files
0
gravatar for Moses
6 months ago by
Moses120
united states/ Bloomingtion/ Indiana University Bloomington
Moses120 wrote:

Hi All,

Im using the supermat function in R to generate a super matrix (i.e. concatenate the multiple sequence alignments) and run RAxML over the concatenated MSA file, to do that supermat function is also producing the partitions files. when input the partition file to run RAxML using the -q parameter, the program is giving me the following error:

ERROR: Bad base (I) at site 4 of sequence 1
Printing error context:

 400  6099
1   AAMIALKCETDFVAKNADFVALTQ

Problem reading alignment file

When I run however without the -q parameter, over the same concatenated multiple sequence alignment file, the program starts running without errors, first few lines of my alignment file in phylip format looks like this:

400  6099
1   AAMIALKCETDFVAKNADFVALTQAILDAAIANKCQTLDDVKALPM-GSGT----IADAIVERSGITGEKTELDGYFFVSGA-----CTAVYNHMNKNQ-----

and then the first few lines of the partition file looks like this:

DNA, EF_TS_NCBI400Genes_msa = 1 - 370
DNA, EFG_C_NCBI400Genes_msa = 371 - 463
DNA, EFG_II_NCBI400Genes_msa = 464 - 538
DNA, EFG_IV_NCBI400Genes_msa = 539 - 669
DNA, EFP_N_NCBI400Genes_msa = 670 - 728

Now I notice that it specifies the partitions as DNA, whereas my sequences are amino acids, is that the only problem? I have tried replaceing DNA by protein/PROT/PROTEIN etc still gave the same error, is there a flag that I should use in the supermat function so that I can tell it it;s a protein sequence?

Any help is appreciated. Thanks.

ADD COMMENTlink modified 6 months ago by h.mon30k • written 6 months ago by Moses120
1
gravatar for h.mon
6 months ago by
h.mon30k
Brazil
h.mon30k wrote:

You are incorrectly specifying the partition type. The RAxML manual says:

For DNA data this just means that you have to add DNA to each line in the partition file, for AA data this is done by specifying the respective AA substitution matrix (e.g., WAG or LG) you want to use for a partition. For binary data you'd specify BIN and for multi-state data MULTI.

ADD COMMENTlink modified 6 months ago • written 6 months ago by h.mon30k

okay yes that did the trick. I should read this manual more carefully. Thank you for your time and answer!

ADD REPLYlink written 6 months ago by Moses120
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 892 users visited in the last hour