Question: Extrapolating allele frequencies from gnomad
gravatar for oakhamwolf
6 months ago by
oakhamwolf0 wrote:

Hi all,

I am playing around with gnomad data and wondering if there is an accepted way to extrapolate allele frequencies from gnomad out to get an approximate number of people affected in a given population.

My efforts have been based on using the Hardy Weinberg formula but I don't think I am using it correctly.

For example, in gnomad the variant ABCA4:c.5461-10T>C which is linked to the autosomal recessive Stargardts disease has a minor allele frequency of 0.0002272 in the non-Finnish European population. A quick google gave me over 700M for Europe and 6M in Finland leaving me with 694M individuals. Applying the Hardy Weinberg equation to this data I get the following:

F(q) = 0.0002272; F(p) = 0.9997728

F(pp) = 0.99954; F(pq) = 0.00022714; F(qq) = 0.00000000516

So taking the predicted hom_alt (autosomal recessive) and applying that to the 694M individuals I get 36 individuals. 1 in 8-10 thousand people have Stargardt disease according to so this should be closer to 87000.

Would someone be able to point me in the right direction or maybe tell me whether or not this is feasible? Many thanks for any help in advance.


gnomad allele frequency • 251 views
ADD COMMENTlink written 6 months ago by oakhamwolf0

As a first quick comment: Hardy weinberg equilibrium is only valid for alleles on which no natural selection happens. Therefore you probably shouldn't use it for disease causing variants.

ADD REPLYlink written 6 months ago by WouterDeCoster43k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1696 users visited in the last hour