Question

Evaluating alignment performance in vg

1

Entering edit mode

4.6 years ago

cmirchan ▴ 10

Hi,

I am currently trying to compare the alignment performance between vg and BWA-Mem. However, what I have found is not what I expected, vg does more or less the same as BWA in terms of number of mapped reads and number of correctly mapped reads. For inputs I have a VCF of ~1000 samples, a linear reference FASTA, and a VCF of a sample not in the large VCF. I have followed the workflows outlined in the wiki for construction, indexing, simulation and mapping.

The script map-sim seems to do what I am trying to achieve, and I have tried to emulate the workflow laid out in the script, but some parts have my confused. I am most interested in the simulation part of the script, particularly how the sim-base, sim-ref, and hap-base graphs are constructed.

Thanks in advance,

Cade

vg vgteam • 1.2k views

ADD COMMENT • link updated 4.4 years ago by glenn.hickey ▴ 520 • written 4.6 years ago by cmirchan ▴ 10

score 1 · Answer 1 · 2019-10-27

I've also had struggles getting the map-sim script to work -- simlarly to you I was stuck at creating the sim-base and hap-base graphs.

I figured it out in the end, and have made a simple tool here based on the map-sim script basically just requires an individual vcf file (e.g. the HG002 file that the vg paper used in their benchmarks) as input, and it will let you simulate reads on a graph from that individual (the simulation is actually done with some custom Python code because I needed to simulate many reads and then vg sim was too slow):

https://github.com/ivargr/graph_read_simulator

This code is not very well tested, but I think it should work if you get the dependencies right by running pip install on the setup file.

score 1 · Answer 2 · 2019-11-27

1

Entering edit mode

4.4 years ago

glenn.hickey ▴ 520

toil-vg has scripts to generate haplotpye sequences to simulate from. There's an (old) example here: https://github.com/vgteam/toil-vg/wiki/Chr21-simulation-experiment-on-AWS

ADD COMMENT • link 4.4 years ago by glenn.hickey ▴ 520