How to correctly calculate Alternative Allele Frequency (AAF) from Samtools mpileup
0
0
Entering edit mode
2.5 years ago
Daniel ▴ 10

I have a hard time to figure it out how correctly calculate the Alternative Allele Frequency (AAF) from Samtools mpileup.

I received the code which using samtools 1.8 mpileup (following code) get the following format:

.... samtools mpileup \
    -l ... capture_targets.bed  \
    -t DP,AD,ADF,ADR,SP,INFO/AD,INFO/ADF,INFO/ADR \
    -d100000000 \
    --output-BP \
    --output-MQ \
    --output-QNAME \ ...

Later in the code it, samtools mpileup data were converted to long format, clean up and add AAF As can be seen in the following code, the "DP4"which is " Number of high-quality ref-forward, ref-reverse, alt-forward and alt-reverse bases, FORMAT"was divided into 4 parts of "AD1", "AD2", "AD3", "AD4".

finally used the AD2/DP to calculate the AAF?

 ... %>% separate(AD,   c("AD1", "AD2", "AD3", "AD4")) %>% 
             mutate(AAF = as.numeric( **AD2**)/as.numeric(DP))

However, to me "**AD2**" is the "ref-reverse" and I think it should be "AD3" as "Alt-forward" for calculation of the AAF?

... %>% mutate(AAF = as.numeric( AD3 )/as.numeric(DP))

`

I think following code should be correct?

Can someone help me please!

Thanks a lot!

AAF samtools SNP mpileup VCF • 531 views
ADD COMMENT

Login before adding your answer.

Traffic: 2905 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6