Question: Assembler suggestions for ~90Mb genome (using single-end Illumina & PacBio data only)
gravatar for jmartin
3 months ago by
jmartin0 wrote:

I have a large pool of single-ended Illumina data (HiSeq 2500) and PacBio RS data and I am looking for suggestions of the best assemblers to use in this situation.

I was first interested in MaSuRCA because it would run a hybrid assembly using both the short reads and the long reads in the same assembly operation (as opposed to building an assembly w/ Illumina and using the long reads to close gaps via some other tool). But I think MaSuRCA needs paired end illumina data to work. I am not 100% sure on that since they also give instructions for using single-ended data, and that single-ended data is entered as 'paired end' data in the config. So its possible they actually meant that Illumina data is the requirement (ie. it can't work on just long reads). But I think that's wishful thinking on my part.

So, are there any good assemblers that can use short Illumina SE + long PacBio RS data? What is the best assembly strategy for this kind of input data? Unfortunately I am not in a position to generate PE data for this organism.

assembly • 191 views
ADD COMMENTlink modified 3 months ago by colindaven1.8k • written 3 months ago by jmartin0
gravatar for h.mon
3 months ago by
h.mon28k wrote:

The fastest - and probably best in this case - strategy is to assemble using only PacBio (using wtdbg2 or flye, for example), then polish (several rounds) with Illumina, or Illumina+PacBio.

ADD COMMENTlink written 3 months ago by h.mon28k
gravatar for colindaven
3 months ago by
Hannover Medical School
colindaven1.8k wrote:

h.mon's answer seems very reasonable.

I would also just assemble with Pacbio, perhaps using Canu or wtdbg2.

Even SPADES might be worth a try since the genome is not huge. I would forget the hybrid assemblers and just use the illumina reads for polishing using RACON etc, if SE reads are useful.

ADD COMMENTlink written 3 months ago by colindaven1.8k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1969 users visited in the last hour