Hi, I'm running vg autoindex --workflow giraffe to try to create a graph reference using a VCF of variants I have that I later want to align some reads to with giraffe. I tested it out with a few variants on one chromosome, and this took a very long time with a lot of memory. I was wondering if it was possible to parallelize constructing a full graph reference by chromosome and merging the results into one reference at the end.
I tried looking through vg combine --help but it seems to only support combining .vg files rather than the .gbz, .dist, and .min files produced using the vg autoindex command.
What's the best way to scale up creating a graph reference from a VCF using the tool? Would combine still work, or is there another subtool to help here?
Thanks!
Thanks, I didn't realize
autoindexwas already running in parallel. I increased the memory for my task and it was able to complete.