Question: 1000 Genome Derived Allele Frequency
gravatar for chungchau.hon
3.8 years ago by
chungchau.hon10 wrote:

Hi all,

I am looking for a way to extract derived allele frequency (DAF) of varirants from 1000 genome vcf: ALL.wgs.phase3_shapeit2_mvncall_integrated_v5a.20130502.sites.vcf.gz


1    10177    rs367896724    A    AC    100    PASS    AC=2130;AF=0.425319;AN=5008;NS=2504;DP=103152;EAS_AF=0.3363;AMR_AF=0.3602;AFR_AF=0.4909;EUR_AF=0.4056;SAS_AF=0.4949;AA=|||unknown(NO_COVERAGE);VT=INDEL

Does the "AF=0.425319" in the INFO field equivalent to the derived allele frequency?

Thanks in advance.





ADD COMMENTlink modified 3.8 years ago by lh331k • written 3.8 years ago by chungchau.hon10
gravatar for lh3
3.8 years ago by
United States
lh331k wrote:

Please read the vcf spec first. AF is the alternate allele frequency. AA is the ancestral allele. If both present, you can use it to get the derived allele frequency by flipping AF.

ADD COMMENTlink written 3.8 years ago by lh331k


 I would like to ask you by new example for extraction of the DAF from 1000 genomes vcf.

In prior post, example .vcf line does not contain ancestral allele information (shown as "unknown"). It is supposed that if the ancestral allele is not addressed at the variant, we cannot extract DAF from the .vcf line. 

>1 10177 rs367896724 A AC 100 PASS AC=2130;AF=0.425319;AN=5008;NS=2504;DP=103152;EAS_AF=0.3363;AMR_AF=0.3602;AFR_AF=0.4909;EUR_AF=0.4056;SAS_AF=0.4949;AA=A|||;VT=INDEL

If the variant has ancestral allele such as AA=A in the new example, ancestral and reference alleles are same base in the variant.

Does the "AF=0.425319" equivalent to the DAF in this case? I think it is not required for flipping AF (1 - AF). 



ADD REPLYlink written 3.4 years ago by suimye0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 674 users visited in the last hour