Question: How To Calculate A Population Specific Allele Freq In 1000 Genomes Release 20101123 Using Vcftools?
gravatar for Noel
8.9 years ago by
Noel90 wrote:

Hi all,

Following Lars question,

is there a way to calculate the population specific allele frequencies from the 20101123 1000 genome release using tabix and vcftools?

Currently I am just able to extract the genotypes with tabix for defined chromosomal regions and calculate their frequency with vcftools. However I know those frequencies are underestimated due to the mixture of populations·

I am running:

$ tabix -f -h 19:53344800-53344900 > LIG1.vcf

$ vcftools --vcf LIG1.vcf --freq --out freq-LIG1 suggested by Stephen.

Thank you!!

ADD COMMENTlink modified 8.9 years ago by Raony Guimarães1.1k • written 8.9 years ago by Noel90
gravatar for Raony Guimarães
8.9 years ago by
Dublin / Ireland
Raony Guimarães1.1k wrote:

You could get the individuals from the population you want to calculate from this file :

and calculate the frequencies with the command vcf-subset to get the genotypes only for this individuals:


$tabix -f -h 19:53344800-53344900 > LIG1.vcf

$vcf-subset -c NA0001,NA0002 LIG1.vcf > pop.LIG1.vcf

$vcftools --vcf pop.LIG1.vcf --freq --out freq-LIG1

ADD COMMENTlink written 8.9 years ago by Raony Guimarães1.1k

Thanks a lot!


ADD REPLYlink written 8.9 years ago by Noel90
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1408 users visited in the last hour