What should be the frequency of a Single Nucleotide variation to be called as pathogenic variant for a rare disease?
2
0
Entering edit mode
8.5 years ago
garimav89 • 0

In a rare disease having a small population size, direct sequencing data has given some single nucleotide mutations present in diseased individuals (Samples = 500 , controls = 500)for example). But the novel point mutation which are found is say only in 1/500. Can this mutation be called as novel and snp?

Secondly if this mutation is also present in healthy individuals, then it should be considered or not?

Some point mutations found in affected individuals (say 4/500), but also in controls (3/500), then can it be called as a mutation?

Thanks for your kind help

snp • 1.1k views
ADD COMMENT
4
Entering edit mode
8.5 years ago

If a variant is only found in one person then it is, by definition, a novel SNP. DO NOT REFER TO THIS AS A MUTATION WITHOUT FURTHER EVIDENCE! You will find many variants, few of them will be mutations.

If a variant is found in 0.2% of patients and a similar rate in controls then it's not causative. That should be obvious. There is no statistical difference between 4/500 and 3/500 (the p-value is 1 if you do a Fisher's test).

Why don't you summarize by genes rather than variants. If you have a bunch of variants in one gene (or a group of genes in a particular complex or pathway) then do the summarization and statistics in that way.

ADD COMMENT
0
Entering edit mode

I have bunch of variants in one gene but the information regarding their presence in same patient or not is not provided in the research papers, I went through.

ADD REPLY
0
Entering edit mode

Ah, so this isn't your own data. You'll have to contact the authors and ask them.

ADD REPLY
3
Entering edit mode
8.5 years ago

So your question is: Is that SNP associated with the disease?

               mutated        normal
diseased             4           496
healthy              3           497

In that case, it's obvious that your SNP is not significantly more present in diseased samples.

To make sure, you can make a Fisher's test (here in R):

fisher.test(matrix(c(4,496,3,497),nrow=2,ncol=2,),alternative="greater")

The results tells you that your SNP is not significantly more present in diseased samples:

Fisher's Exact Test for Count Data

data:  matrix(c(4, 496, 3, 497), nrow = 2, ncol = 2, )
p-value = 0.5
-----------^^
alternative hypothesis: true odds ratio is greater than 1
95 percent confidence interval: 0.2899126   ->  Inf
odds ratio = 1.335632
ADD COMMENT
1
Entering edit mode

ps : pvalue = 1 as Devon Ryan said if you do a two-sided test, i.e., without adding

alternative="greater"
ADD REPLY
0
Entering edit mode

I can't believe I forget alternative="greater"! Good catch!

ADD REPLY

Login before adding your answer.

Traffic: 2162 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6