I have a group of samples (150 bp paired-end DNA sequencing reads) ranging in sequencing depth from ~10X to ~100X.
I understand that for the samples to be comparable, their FQs could be downsampled to uniform coverage.
But, I'd like to use all of the data per sample to call variants, rather than downsampling reads.
Is there a tool to downsample VCFs to similar coverage distributions with a desired median coverage?
If it is possible, is this a reasonable way to accomplish the same thing as downsampling reads?
Alternatively, can I filter the VCFs to render these samples comparable?