Question

Well-resolved Phylogenetic Dataset

1

Entering edit mode

8.1 years ago

jnf3769 ▴ 40

Hello all,

I am looking to test an alignment-free phylogenetic tree building algorithm I wrote. It can perform both gene and species trees. I have already tested it on a single gene primate tree, but I need some more data to further characterize the algorithm. I know there is a lot of data on TreeBASE, but I am having a hard time pulling data down. Additionally, I am generally unaware of which trees are considered well-resolved.

Any info would help greatly

phylogenetics dataset phylogeny data • 2.4k views

ADD COMMENT • link updated 8.1 years ago by kloetzl ★ 1.1k • written 8.1 years ago by jnf3769 ▴ 40

score 2 · Accepted Answer · 2016-03-08

2

Entering edit mode

8.1 years ago

kloetzl ★ 1.1k

You might want to use data sets already used in other papers on alignment-free comparisons. Here you can download the data from andi (shameless self-plug). I also have the roseobacter data set from the spaced words paper. Send me a mail, if you are interested.

ADD COMMENT • link 8.1 years ago by kloetzl ★ 1.1k

0

Entering edit mode

I have a followup question about the 109 E. coli ST131 strains. In the assemblies (ordered or not), there are multiple nodes per fasta file. Am I right in assuming that that means there are more than one contig per file (that is, the genome was not closed)?