Question: SAVAGE is taking too long to run the haplotype reconstruction
1
gravatar for fernandalpcosta
3 months ago by
Campinas/UNICAMP
fernandalpcosta0 wrote:

Hello all,

I'm running SAVAGE (https://bitbucket.org/jbaaijens/savage/src/master/) to get the haplotypes of a big virus (almost 3x the HIV virus' size) and it took over 72h hours to get to stage b of the SAVAGE pipeline.

The sequencing has a coverage of 20000x and the reference genome has 32KB of size.

I'm using 7 threads to run this analysis on a 8 vCPUs, 52 GB RAM and 10T disk machine.

I also tested HaploClique (https://github.com/cbg-ethz/haploclique) and PredictHaplo (http://bmda.cs.unibas.ch/software.html) softwares on this analysis, but both also took over 72h and never finished.

Is there another software that runs the whole analysis in less than 72h per virus in the conditions/specifications I mentioned above?

Thank you all in advance for any tips or help you may give me,

ADD COMMENTlink written 3 months ago by fernandalpcosta0

Is the ultra high depth of sequencing causing this issue? Do you really need that much coverage.

ADD REPLYlink written 3 months ago by genomax55k

I'm not sure if that much coverage is necessary, but it's not unusual for virus's haplotype reconstruction. You thing that 10000x is enough?

ADD REPLYlink written 3 months ago by fernandalpcosta0

Someone else will have to comment on the re-construction part. You could try different amounts (starting with 1000x) and see if it makes a big difference in results as you go up.

ADD REPLYlink written 3 months ago by genomax55k

That's a great idea! I will try that! Thank you.

ADD REPLYlink modified 3 months ago • written 3 months ago by fernandalpcosta0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 561 users visited in the last hour