Merging genomic databases from different intervals (GenomicsDBImport)
0
0
Entering edit mode
11 months ago

Hi all. I'm trying to put a large dataset through GenomicsDBImport, but it's not finishing within my cluster's 72 hr kill time. I know I can make this more efficient by running different parts of the genome simultaneously using -L to specify intervals. But I don't see a tool to merge the resulting interval-based genomic databases afterwards. How can I do that? Or can I just direct everything to the same directory? Thanks!

gatk • 431 views
ADD COMMENT
0
Entering edit mode

I don't think you can merge GenomicDBs.

Instead, you should create the interval-based GenomicsDBImport databases, then genotype each one to give multiple vcfs, then merge the vcfs.

ADD REPLY

Login before adding your answer.

Traffic: 2440 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6