filter 1000 genome vcf file to keep common snps
1
0
Entering edit mode
3 months ago
shersky • 0

I want to filter "1000 genome vcf file" to keep only common SNPs. Can someone explain the difference between

1) Filtering by allele frequency AF greater than 0.05 (which I found yields more variants)

bcftools view -i 'AF>0.05'  variants.vcf -o common_variants.vcf

2) Filtering by MAF greater than 0.05

bcftools view -i 'MAF>0.05' variants.vcf -o common_variants.vcf

Which one is correct?

bcftools SNPs • 310 views
ADD COMMENT
1
Entering edit mode
3 months ago
bk11 ★ 2.4k

AF is the frequency for an alternate allele. If AF < 0.5, then AF is equal to MAF. Rare variants generally has AF or MAF < 0.05. MAF (Minor allele frequency) refers to the minor allele (least frequent) frequency. Common SNPs have MAF > 0.05. Your filtering method 2 is the correct one to filter common SNPs.

bcftools view -i 'MAF>0.05' variants.vcf -o common_variants.vcf
ADD COMMENT

Login before adding your answer.

Traffic: 2532 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6