Question: The Variance in HWE
2.1 years ago
United Kingdom
GabrielMontenegro510 wrote:

I'm reading Design, Analysis and Interpretation of GWAS of Daniel O. Strom.

On chapter 2 I found:

If we have a sample of N unrelated individuals in a population the distribution of A allele counts for each individual follows a binomial with number of trials = 2N and frequency of A allele = p

p can be found as:

p = ( 1/2N ) * SUM (niA)

Where niA= number of A alleles in individual i

And the variance:

( 1/(2N)^2 ) * SUM Var (niA)

But, I do not understand why do we have the 2N squared in the second equation.

Thank you.

hwe
written 2.1 years ago by GabrielMontenegro510
2.1 years ago
United States
atks10 wrote:

It's a property derived from the definition of variance.

written 2.1 years ago by atks10

OK, it's a property, but why that particular number?

written 2.1 years ago by GabrielMontenegro510

The estimate of the population allele frequncy is p^{hat} = sum_i{niA} / 2N where niA is the number of copies of the A allele for individual i and N is the number of individuals. You use 2N because you assume that the variant is diploid.

so Var(p^{hat}) = Var (sum_i{niA} / 2N ) = (1/2N)^2 Var(sum_i{niA}) [because of above mentioned property] = (1/2N)^2 sum_i Var(niA) [because each observation is independent]

written 2.1 years ago by atks10
