Question: How To Calculate A Population Specific Allele Freq In 1000 Genomes Release 20101123 Using Vcftools?
0
gravatar for Noel
8.0 years ago by
Noel90
Noel90 wrote:

Hi all,

Following Lars question,

is there a way to calculate the population specific allele frequencies from the 20101123 1000 genome release using tabix and vcftools?

Currently I am just able to extract the genotypes with tabix for defined chromosomal regions and calculate their frequency with vcftools. However I know those frequencies are underestimated due to the mixture of populations·

I am running:

$ tabix -f -h ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20101123/interim_phase1_release/ALL.chr19.phase1.projectConsensus.genotypes.vcf.gz 19:53344800-53344900 > LIG1.vcf

$ vcftools --vcf LIG1.vcf --freq --out freq-LIG1

...as suggested by Stephen.

Thank you!!

ADD COMMENTlink modified 8.0 years ago by Raony Guimarães1.1k • written 8.0 years ago by Noel90
3
gravatar for Raony Guimarães
8.0 years ago by
Dublin / Ireland
Raony Guimarães1.1k wrote:

You could get the individuals from the population you want to calculate from this file :

ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20101123/interim_phase1_release/interim_phase1.20101123.ALL.panel

and calculate the frequencies with the command vcf-subset to get the genotypes only for this individuals:

Ex.

$tabix -f -h ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20101123/interim_phase1_release/ALL.chr19.phase1.projectConsensus.genotypes.vcf.gz 19:53344800-53344900 > LIG1.vcf

$vcf-subset -c NA0001,NA0002 LIG1.vcf > pop.LIG1.vcf

$vcftools --vcf pop.LIG1.vcf --freq --out freq-LIG1

ADD COMMENTlink written 8.0 years ago by Raony Guimarães1.1k

Thanks a lot!

Noel

ADD REPLYlink written 8.0 years ago by Noel90
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 732 users visited in the last hour