Question: Sequence Evolution Library In Java
0
gravatar for Rob
8.3 years ago by
Rob3.3k
United States
Rob3.3k wrote:

Hello,

I'm writing a biological evolution simulator. Currently, all of my code is written in Python. For the most part, this is great and everything works sufficiently well. However, there are two steps in the process which take a long time and which I'd like to rewrite in Scala.

The first problem area is sequence evolution. Imagine you're given a phylogenetic tree which relates a large set of proteins. The length of each branch represents the evolutionary distance between the parent and child. The root of the tree is seeded with a single sequence, and then an evolutionary model (e.g. http://en.wikipedia.org/wiki/Models_of_DNA_evolution) is used to evolve the sequence along the tree structure; taking into account the branch lengths. PyCogent takes a long time to perform this step, and I believe that a reasonable Java/Scala implementation would be significantly faster. Do you know of any libraries that implement this type of functionality. I want to write the application in Scala, so, due to interoperability, any Java library will suffice.

The second problem area is the comparison of the generated sequences. The problem is, given a set of sequences for the proteins in a number of different extant species, attempt to use the sequence to reconstruct the phylogenetic tree which relates the species. This problem is inherently computationally demanding, because one must basically do a pairwise comparison between all sequences in the extant species. Here again, however, I feel like a Java/Scala implementation would perform significantly faster than a Python one, if for nothing else than the unfortunately slow speed of looping in Python. This part I could write from scratch more easily than the sequence evolution part, but I'd be willing to use a library for it as well if a good one exists.

Thanks, Rob

model python java evolution • 2.4k views
ADD COMMENTlink written 8.3 years ago by Rob3.3k

For your second problem area, I wouldn't recommend trying to re-implement phylogenetic inference in java. It is so computationally intensive that generally this is done in C. I only know of one tool that does part of this in java, and that's treefinder (http://www.treefinder.de/).

ADD REPLYlink written 8.3 years ago by Rvosa570

I have never heard that loops are slow in python. Aren't you confusing it with R?

ADD REPLYlink written 8.2 years ago by Giovanni M Dall'Olio26k
7
gravatar for Botond Sipos
8.3 years ago by
Botond Sipos1.7k
United Kingdom
Botond Sipos1.7k wrote:

Check out the Phylogenetic Analysis Library.

ADD COMMENTlink written 8.3 years ago by Botond Sipos1.7k

Excellent! This library doesn't look like it has been updated in a while, but it seems to contain all of the functionality I need. I'll probably give it a try.

ADD REPLYlink written 8.3 years ago by Rob3.3k

My friend is using pal. He is very satisfied with that.

ADD REPLYlink written 7.2 years ago by lh331k
3
gravatar for Rvosa
8.3 years ago by
Rvosa570
Leiden, the Netherlands
Rvosa570 wrote:

Check out JEBL: http://sourceforge.net/projects/jebl/, which can roughly be seen as the successor to PAL, at least to the extent that some of the same people are involved (who seem to have abandoned PAL).

ADD COMMENTlink written 8.3 years ago by Rvosa570
2
gravatar for Jeremy Leipzig
8.3 years ago by
Philadelphia, PA
Jeremy Leipzig18k wrote:

I wouldn't be surprised to see improvements if the slowest elements of Pycogent were moved to c - seems like that group already has people like Daniel McDonald who know how to wrap c, since it looks like some of it is already in c:

find . | grep "\.c$"
./cogent/align/_compare.c
./cogent/align/_pairwise_pogs.c
./cogent/align/_pairwise_seqs.c
./cogent/evolve/_likelihood_tree.c
./cogent/maths/_matrix_exponentiation.c
./cogent/maths/_period.c
./cogent/maths/eigen.c
./cogent/maths/matrix_invert.c
./cogent/maths/spatial/ckd3.c
./cogent/struct/_asa.c
./cogent/struct/_contact.c
ADD COMMENTlink written 8.3 years ago by Jeremy Leipzig18k

Yea, some parts of what I'm doing are clearly already in C, but others aren't quite there yet. I'll be sure to keep an eye out for library updates.

ADD REPLYlink written 8.3 years ago by Rob3.3k
0
gravatar for Audriusa
7.2 years ago by
Audriusa10
Switzerland
Audriusa10 wrote:

There is an interesting Java - based package designed to model evolution of all kinds (JGap). It provides the good framework (chromosomes, selection, inheritance, evolution) and is normally used as a tool to solve various difficult problems by the method of simulated evolution. JGap also offers some interesting visualization tools. It is old, mature project with hundreds of downloads per week. I think I can surely recommend it.

ADD COMMENTlink written 7.2 years ago by Audriusa10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1778 users visited in the last hour