Formula To Work Out How Many Individuals To Sequence To Get X% Of Common Variation
1
3
Entering edit mode
13.0 years ago
Pi ▴ 520

Greetings

I am working on a small bioinformatics project. Using the common disease, common variant hypothesis I am assuming that the common disease I am looking at is caused by common variants. Is there a formula which describes how many individuals need to be sequenced to find a specific percentage of the common variants in that species? So for example, if I have the sequence of 100 individuals I would like to estimate how much of the common variation (where common is defined as occuring in 1% of the population) might be present in the data (and hence how much of the common variation might not be present). I don't know if such a formula exists of whether it is too difficult to predict because of the variables (e.g sequencing depth, exome vs whole genome). I looked at the 1000 genome project which estimates to have discovered 95% of common variation. The different sequencing approaches in this project had a different power to detect variants.

Thank-you for your time

genome variant • 2.1k views
ADD COMMENT
5
Entering edit mode
13.0 years ago

You can perform various statistical power estimation using OPERA - Optimal design and Power Estimation for Rare variant Association (OPERA).

Also see Genetic Power Calculator and Stephen's post on the topic here that provides details about tools like CaTS Power, PS Power and G* Power.

ADD COMMENT

Login before adding your answer.

Traffic: 2710 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6