Question: What Is Ad (Allelic Depth) In 1000Genomes Vcf?
12
gravatar for Chronos
9.7 years ago by
Chronos600
Germany
Chronos600 wrote:

The VCF header says

AD: Allelic depths for the ref and alt alleles in the order listed

I understand what "read depth" and "allele" are, but I do not quite understand what "allelic depth" means.

I've read both VCF4.0 specification and VCF development history, but none of them had the answer.

genome read allele • 17k views
ADD COMMENTlink modified 3.8 years ago by anderspitman60 • written 9.7 years ago by Chronos600
8
gravatar for Casbon
9.7 years ago by
Casbon3.2k
Casbon3.2k wrote:

Allele specific depth. i.e. if ref is 'A' and alt is 'G' and AD is '6,9' you got 6 A reads and 9 G reads.

If this is produced by GATK look in the header output - it usually explains each flag.

EDIT:

Checked GATK, read depth is only those used in calling.

ADD COMMENTlink modified 9.6 years ago • written 9.7 years ago by Casbon3.2k
1

So, to clarify:

DP is equal to the number of reads used for calling the variant

AD numbers are the total number of reads with each allele (including reads not used for calling due to low quality or whatever).

Seems un-intuitive to me, but I'll take it.

ADD REPLYlink written 9.6 years ago by Michael.James.Clark560

I thought so initially, but then 6+9 should equal DP, correct? However, this is not the case for 1000 genomes data: there the sum of ADs is >= DP.

Or does AD represent a count of per-allele unfiltered reads? DP is for filtered reads only, so it would explain why Sum(AD) >= DP.

ADD REPLYlink written 9.7 years ago by Chronos600

Interesting, sounds like a bug to me.

ADD REPLYlink written 9.7 years ago by Casbon3.2k

Might be, I'm not sure (thus asked). Here's an example line fragment :

GT:AD:DP:GQ:PL  0/0:1,0:1:3.01:0,3,330/0:4,0:2:6.01:0,6,63    0/0:2,1:1:3:0,3,28      ./.     0/0:1,0:1:2.37:0,2,8    0/0:3,0:1:3.01:0,3,33   0/0:7,0:3:9.03:0,9,109./.     ./.     0/0:11,0:3:9.01:0,9,85  ./.     0/0:37,0:1:2.99:0,3,23  ./.     0/0:9,0:5:15.03:0,15,158        ./.     ./.  0/1:16,1:2:21.91:29,0,22 ./.     ./.     0/0:3,0:1:3.01:0,3,33   0/0:2,1:2:6.01:0,6,59

This is X:60009 A>C from ALL.chrX.BI_Beagle.20100804.genotypes.vcf.gz.

ADD REPLYlink modified 12 months ago by RamRS30k • written 9.7 years ago by Chronos600
3
gravatar for anderspitman
3.8 years ago by
anderspitman60
United States
anderspitman60 wrote:

From the GATK forum:

AD and DP : Allele depth and depth of coverage.

These are complementary fields that represent two important ways of thinking about the depth of the data for this sample at this site.

AD is the unfiltered allele depth, i.e. the number of reads that support each of the reported alleles. All reads at the position (including reads that did not pass the variant caller’s filters) are included in this number, except reads that were considered uninformative. Reads are considered uninformative when they do not provide enough statistical evidence to support one allele over another.

DP is the filtered depth, at the sample level. This gives you the number of filtered reads that support each of the reported alleles. You can check the variant caller’s documentation to see which filters are applied by default. Only reads that passed the variant caller’s filters are included in this number. However, unlike the AD calculation, uninformative reads are included in DP.

ADD COMMENTlink modified 3.8 years ago • written 3.8 years ago by anderspitman60

I think you've actually got it backwards. From it looks like DP is all reads (informative and uninformative) whereas AD is only informative reads.

ADD REPLYlink written 8 months ago by lincoln.harris0
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1175 users visited in the last hour