Question: Compare A Denovo Assembly To The Reference Genome
2
gravatar for Rohit
3.7 years ago by
Rohit1.2k
European union
Rohit1.2k wrote:

Hello.

I was working on a pipeline for genome assembly and I have used the human paired-end NGS data.

I want to compare my denovo assembly to the available human genome which is based on Sanger sequencing mainly. How can I compare my denovo assembly to the sanger sequencing inorder to check -

  1. How much of the genome was covered in the denovo assembly?

  2. How many regions of the genome have I missed in the denovo contigs?

  3. Do I compare the contigs or scaffolds to the reference which is ordered as chromosomes?

I know Quast is used to test different genome assembly qualities, but what about comparing a denovo assembly to a reference genome???

ngs comparison reference denovo • 4.9k views
ADD COMMENTlink modified 3.7 years ago by Buttonwood30 • written 3.7 years ago by Rohit1.2k
4
gravatar for cts
3.7 years ago by
cts1.5k
Pasadena
cts1.5k wrote:

Quast also works with a reference genome, just set the reference with -R. According to their manual, Quast has tests that will give you answers to points 1 & 2. As for point 3, Quast has a --scaffolds option which, if given scaffolds will break them back into contigs. You could run it with and without the --scaffolds option to see the difference in the metrics. There is also ALE for looking at assembly metrics although I'm not very familiar with it

ADD COMMENTlink written 3.7 years ago by cts1.5k

I'm afraid Quast is not favorable for bigger genomes. Good for bacterial I guess.

ADD REPLYlink modified 3.7 years ago • written 3.7 years ago by Rohit1.2k
2
gravatar for lh3
3.7 years ago by
lh330k
United States
lh330k wrote:

There are quite a few multi-genome aligners. Mauve is a recent one which is extensively used in assemblathon. Bwa-mem and bwa-sw can do pairwise alignment if you prefer SAM as the output. It seems to me that Quast is primarily designed for small genomes. It calls MUMmer to do the actual alignment. As I remember, MUMmer does not work with genomes longer than 512Mb and even if it does, it will require huge RAM to hold the suffix tree. I do not know if Quast is able to split the genome.

ADD COMMENTlink written 3.7 years ago by lh330k

Can BWA do pairwise alignment of two genomes? I thought it was used for reads to reference alignment

ADD REPLYlink written 3.7 years ago by Hranjeev1.4k
1

The first BWA algorithm only works with short reads. You can align bacteria genomes with bwa-mem and bwa-sw. You can also align ~1Mbp contigs to human genomes with them.

ADD REPLYlink written 3.7 years ago by lh330k

Quast uses Mummer and I don't think it splits the genome in anyway. It runs till you are out of memory, no matter how huge your RAM is (~800Gb).

The problem I had with Mauve was that I did not know how to use it from the command-line. Its best to figure out now.

Or test it with BWA, though I donno how well I can use the SAM output

ADD REPLYlink modified 3.6 years ago • written 3.7 years ago by Rohit1.2k
1
gravatar for Buttonwood
3.7 years ago by
Buttonwood30
China
Buttonwood30 wrote:

I think you can refer to Lastz. http://www.bx.psu.edu/miller_lab/dist/README.lastz-1.02.00/README.lastz-1.02.00a.html

ADD COMMENTlink modified 3.7 years ago • written 3.7 years ago by Buttonwood30
0
gravatar for HG
3.7 years ago by
HG970
Germany
HG970 wrote:

You can have a look GAGE -B paper: Supplementary Table S13. for question 1 and for question 3 progressiveMauve may help you to find the answer.

ADD COMMENTlink written 3.7 years ago by HG970
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 605 users visited in the last hour