Question: Short-read genome assembly with related reference genome
gravatar for Benni
3.0 years ago by
Benni30 wrote:

I have paired-end Illumina data of a bacteria (1) and a reference genome of the same bacteria (2). However, the reference bacteria is not the same as (1), because (1) was treated differently and changed it`s genome. Now I want to assemble the genome of (1) using the reference genome, since de novo assembly with only Illumina reads produces a lot of contigs.

I mapped the reads of (1) to the reference genome and got 99% Pairwise identity and 35.9% Identical sites. What is the best way to get the correct genome of (1)?

I could modify the reference genome until it fits all the reads. Regions of low/high coverage indicate, that the reference genome is the wrong representation of (1) at this spot. To correct this I could try to find contigs by de novo assembly of (1) that explain this area better.

This is my idea, but I want to ask you experiences people, how you would approach this problem and what tools I may use.

assembly sequencing ngs illumina • 2.1k views
ADD COMMENTlink modified 3.0 years ago by jean.elbers1.6k • written 3.0 years ago by Benni30
gravatar for jean.elbers
3.0 years ago by
jean.elbers1.6k wrote:

You might try SPAdes ( followed by ragout ( and could possibly compare synteny between your genome and the reference with (

ADD COMMENTlink written 3.0 years ago by jean.elbers1.6k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1913 users visited in the last hour