Question: Compute resources for running ALLPATHS-lg
gravatar for bio_d
6 months ago by
bio_d0 wrote:


I am trying a denovo assembly of a non-model reptile using ALLPATHS-lg. I have 204GB of paired-end and mate-pair data.

Since we have limited computational resources we will be applying for computational resources (e.g from XSEDE). Could anyone suggest the compute hours and disk space in terms of Service Units(S.U) that should be necessary for completing the assembly process?

Thanks in advance.

sequence assembly • 178 views
ADD COMMENTlink written 6 months ago by bio_d0

What is the expected genome size? What is the proportion of paired-end to mate-pairs, how many sizes of mate-pairs? Do you know if the genome / sample being sequenced has high polymorphism rate?

Did you check the Assemblathon paper?

ADD REPLYlink written 6 months ago by h.mon24k

The genome size is approximately 2.6G.

We have 200 bp paired-end libraries and the following sizes for mate-pair libraries: 3kB, 5.2kB, 10kB and 20kB.

Unfortunately, I have not come across the Assemblathon paper.

ADD REPLYlink written 6 months ago by bio_d0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2030 users visited in the last hour