Question: Is There A Snp-Caller That Takes In Account For Whether A Snp Has Been Seen Before?
4
gravatar for Jeremy Leipzig
2.4 years ago by
Philadelphia, PA
Jeremy Leipzig12k wrote:

I have not done a survey of SNP-callers but I am wondering do any of them assign confidence after annotating against dbSNP rather than before?

snp
ADD COMMENTlink written 2.4 years ago by Jeremy Leipzig12k

I am interested in this..so far I could not find one...

ADD REPLYlink written 2.4 years ago by Rm6.5k

SOAPsnp can do that, I think.

ADD REPLYlink written 2.4 years ago by lh320k

I guess there could be some problems in doing what you suggest. How much evidence from the data would you be ready to throw away in order not to disagree with dbSNP? And would not already existing - less biased - filters (e.g. quality of the bases, uniqueness of the mapping, observing the variant on both strands, etc...) already be sufficient? And if they're not sufficient should one not rethink the experimental design - (say sequence less regions with higher coverage?) instead of using very informative priors? I am genuinely unsure, I am not claiming what you suggest is unsuitable.

ADD REPLYlink written 2.4 years ago by xenophiliuslovegood150

I am saying common sense would dictate the bar should be lower for calling SNPs that we know exist in our population of interest than for novel SNPs, sometimes called SNV's. As Larry and others have mentioned, the "population of interest" is certainly up for debate.

ADD REPLYlink written 2.4 years ago by Jeremy Leipzig12k
3
gravatar for Larry_Parnell
2.4 years ago by
Larry_Parnell15k
Boston, MA USA
Larry_Parnell15k wrote:

Intriguing idea. I don't know of such an approach being implemented in a functional, available SNP-caller. However, a couple points come to mind.

First, not all SNPs in dbSNP are of the same standard or confidence.

Two, such a SNP-caller as envisioned in the question would need to take into account population in the sense that a called SNP might have non-zero minor allele frequency (MAF) in south Asians, for example, but zero MAF in Africans (or in one population from Africa). Think of the results of calculations for Neandertal genome in Homo sapiens: ~4% in non-Africans, essentially 0% in Africans.

Three, applying such to admixed populations would be - complex, to say the least.

So, perhaps all I've done is outline some concerns or constraints for algorithm design. Maybe if there is such a SNP-caller out there, these are a couple points to apply to it as a test of its robustness.

ADD COMMENTlink written 2.4 years ago by Larry_Parnell15k
3
gravatar for lh3
2.4 years ago by
lh320k
lh320k wrote:

I believe SOAPsnp can do this. MAQ actually has such a sub-tool, but I abandoned it back to 2008. I think this strategy causes more problems than good.

ADD COMMENTlink written 2.4 years ago by lh320k
1
gravatar for Damian Kao
2.4 years ago by
Damian Kao10k
UK
Damian Kao10k wrote:

A quick literature search found: http://bioinformatics.oxfordjournals.org/content/25/1/6.long

Apparently the new software they have is called SliderII: http://www.bcgsc.ca/platform/bioinfo/software/SliderII

You are able to use known SNPs as priors to inform the SNP calling. I haven't read much into it yet. This will probably go on my huge to-read pile...

ADD COMMENTlink written 2.4 years ago by Damian Kao10k

SliderII looks interesting but relies on a reference, which could be from a specific population or not. Thus, the dependency on that references of this tool to call SNPs based on prior means that SNP calling in a "new" population, be that human or another species, may make its implementation difficult.

ADD REPLYlink written 2.4 years ago by Larry_Parnell15k
Please log in to add an answer.

Help
Access
  • RSS
  • Stats
  • API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.0.0
Traffic: 881 users visited in the last hour