Question: Exclude sites on the basis of the proportion of missing data using BCFTOOLS
0
gravatar for GabrielMontenegro
17 months ago by
United Kingdom
GabrielMontenegro560 wrote:

I want to do something very simple. I want to keep sites that do not have any missing genotypes (i.e. 100% present across all samples).

In vcftools I usually do --max-missing 1.0 as indicated in the manual, but I want to use BCFTOOLS as it is much quicker. However, I cannot find the equivalent flag to do this. I have only found F_MISSING, which I think removes individuals and not sites.

vcftools bcftools vcf genome • 1.1k views
ADD COMMENTlink modified 17 months ago • written 17 months ago by GabrielMontenegro560
3
gravatar for finswimmer
17 months ago by
finswimmer14k
Germany
finswimmer14k wrote:
$ bcftools view -e 'GT[*] = "mis"' multi-sample.vcf

This would exclude sites where any genotype is missing.

ADD COMMENTlink modified 17 months ago • written 17 months ago by finswimmer14k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1601 users visited in the last hour