What is the best way to generate vcf for vg toolkit from raw sequences
1
1
Entering edit mode
22 months ago
Sumit ▴ 10

Hi,

I am trying to generate variation graph (vg) from raw SARS-CoV2 sequences (~16k sequences). What would be the best way to generate VCF given these raw sequences? And, does the method works as the data scales up?

Thanks,

vg vgteam variation_graph • 792 views
ADD COMMENT
0
Entering edit mode
16 months ago

You can try PGGB for this purpose, since you can enter raw fasta sequences. Not sure if it will scale to that number of genomes though.

https://github.com/pangenome/pggb

It will give you odgi and VCF output, you can further convert from there if needed.

ADD COMMENT

Login before adding your answer.

Traffic: 2270 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6