Hi,
I have a multisample VCF file with variable mean sequencing depths for subsets of the samples as follows. 60 samples @ 8X; 15 samples @ 20X; 1 sample @ 2X mean coverage and 45% missing genotypes.
I normally filtered my VCF with minimum and maximum depth range based on half and double the mean depth. In this case that criteria will be different for the different groups.
Is there a way to filter the VCF based only on one subset of samples at a time without splitting the VCF? For example keep all samples but filter variants so the 60 samples @ 8X have minimum depth of 4X and maximum of 16X.
The sample at 2X is an old museum sample and quite important. How can I filter the VCF to keep all samples but only this one sample has no missing variants?
I've searched the forums but can't quite find the answers I'm looking for.