Question

Variant calling for MNP

0

Entering edit mode

9.9 years ago

nilus1432 ▴ 30

Hi All,

I would like to know is there any program for variant calling for multiple-nucleotide polymorphisms. For example, detecting cases like, Reference = 'ATA', Sample = 'GTC'.

Thanks a lot

RNA-Seq SNP • 5.9k views

ADD COMMENT • link updated 2.8 years ago by Ram 45k • written 9.9 years ago by nilus1432 ▴ 30

Ram · Answer 1 · 2015-08-20

RTG Core calls haplotypes such as this. In reply to rbagnall, the benefit of calling as haplotypes rather than as individual SNPs is that the variant caller is able combine the evidence across both sites rather than calling them independently (e.g. read evidence containing all of ATA, ATC, GTA, GTC looks like het A->G and het A->C when looking at the sites separately, but is completely bogus from a haplotype perspective).

Whether the output is represented as multi-base calls or is decomposed into smaller variants is orthogonal and largely personal opinion. Either way, with complex calls you are likely to encounter cases where variants may have multiple alternative, equivalent representations. Even decomposition tools don't get you to a canonical form for all cases, so haplotype-based variant comparison tools such as rtg vcfeval are critical for accurate comparison of complex variants.

Ram · Answer 2 · 2015-08-20

2

Entering edit mode

9.9 years ago

User 59 13k

Freebayes calls MNPs and complex polymorphisms as well as SNPs and indels

ADD COMMENT • link updated 2.8 years ago by Ram 45k • written 9.9 years ago by User 59 13k

0

Entering edit mode

I'm curious,

What is the benefit of calling a multiple-nucleotide polymorphism as shown in @nilus1432 question, rather than calling an A>G and an A>C single nucleotide variant? The latter are easier to annotate (missense, frequency in population controls etc...).

ADD REPLY • link updated 2.8 years ago by Ram 45k • written 9.9 years ago by rbagnall ★ 1.8k

1

Entering edit mode

Because you can annotate them correctly. Calling two adjacent SNPs and passing them to something like VEP will give you two different SNP effects, but if it's a genuine dinucleotide change within say a codon, you need to be able to assess the effect of their presence as a unit.

ADD REPLY • link updated 2.8 years ago by Ram 45k • written 9.9 years ago by User 59 13k

1

Entering edit mode

Thanks Daniel,

Yes that's true and something I hadn't considered. I will have to see if VEP annotates such MNPs; it looks like SNPEff does. Also, the train of thought led me to this paper which describes the issue a bit more for anyone interested, though I haven't tried the described software: http://www.biomedcentral.com/1471-2164/16/569

ADD REPLY • link updated 2.8 years ago by Ram 45k • written 9.9 years ago by rbagnall ★ 1.8k