Question: Exclude sites on the basis of the proportion of missing data using BCFTOOLS
0
gravatar for GabrielMontenegro
11 months ago by
United Kingdom
GabrielMontenegro540 wrote:

I want to do something very simple. I want to keep sites that do not have any missing genotypes (i.e. 100% present across all samples).

In vcftools I usually do --max-missing 1.0 as indicated in the manual, but I want to use BCFTOOLS as it is much quicker. However, I cannot find the equivalent flag to do this. I have only found F_MISSING, which I think removes individuals and not sites.

vcftools bcftools vcf genome • 634 views
ADD COMMENTlink modified 11 months ago • written 11 months ago by GabrielMontenegro540
3
gravatar for finswimmer
11 months ago by
finswimmer13k
Germany
finswimmer13k wrote:
$ bcftools view -e 'GT[*] = "mis"' multi-sample.vcf

This would exclude sites where any genotype is missing.

ADD COMMENTlink modified 11 months ago • written 11 months ago by finswimmer13k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1500 users visited in the last hour