Question: Predicted Impact Of Indels?
gravatar for Sam
7.8 years ago by
Sam70 wrote:


Does anyone know of any good algorithms for predicting the functional impact of indels? Similar in nature to SIFT, PolyPhen2, Panther, Mapp, etc. (however as far as I am aware, these are limited to nsSNPs).

Thank you for your input,


prediction variant functional • 6.7k views
ADD COMMENTlink modified 4.7 years ago by suleymanvural40 • written 7.8 years ago by Sam70
gravatar for Aaronquinlan
7.8 years ago by
United States
Aaronquinlan10k wrote:

ANNOVAR, the Ensembl Variant Effect Predictor, and the SeattleSeq SNP Annotation site will each indicate whether an INDEL is intergenic, intronic, or exonic, and if exonic, whether or not it causes a frameshift. This is basically the extent of the functional prediction, as frameshift mutations are classically interpreted as loss-of-function. I should note, however, that I have been studying frameshifts in exome datasets and find that most (can't quantify) occur in the terminal exon. In other words, not all LOF frameshifts are created equal, so I would advise augmenting these calls with a sense of where the mutation occurred w.r.t. the transcript and protein domains.

I recall that all three support VCF, though I believe the Seattle requires a special format for INDELs. For what it's worth, I have found ANNOVAR to be accurate and intuitive to use, even for large datasets.

ADD COMMENTlink modified 7.8 years ago • written 7.8 years ago by Aaronquinlan10k

agreed on annovar. though it requires you to convert from vcf to their format

ADD REPLYlink written 7.8 years ago by brentp22k
gravatar for Larry_Parnell
7.8 years ago by
Boston, MA USA
Larry_Parnell16k wrote:

You need to examine the indel sequence both in isolation (the sequence itself) as well as within the context of the surrounding genomic sequence. An insertion could create a novel transcription factor (TF) binding motif, for example, and not analyzing the join between insertion and reference sequences will miss this. My approach is to analyze in parallel the alleles as though they are straight from a genome sequencing project. I'd ask questions like:

-Does the deletion alter an intron splice site or the prediction of such?

-Do the alleles alter predicted TF binding sites?

-Would the mRNA 3'-UTR structure or interaction with microRNAs be allele-specific?

-If the indel is not 140 bp in length, or some even multiple thereof, could it alter nucleosome phasing?

I like (and have voted up) what Aaron provides. I'm just expanding on that and emphasizing that you may need to think beyond available tools to a whole range of functional consequences.

ADD COMMENTlink written 7.8 years ago by Larry_Parnell16k
gravatar for Hmm
7.4 years ago by
Hmm490 wrote:

try snpEff..

ADD COMMENTlink written 7.4 years ago by Hmm490

nice alternative to ANNOVAR

ADD REPLYlink written 7.2 years ago by Prateek1000
gravatar for suleymanvural
4.7 years ago by
United States
suleymanvural40 wrote:

There is also this one, goes beyond annotating the variant even for frameshifting indels

Predciting the effects of frameshifting indels

And this one, but this can only make effect prediction for non-frameshifting indels


ADD COMMENTlink modified 4.7 years ago • written 4.7 years ago by suleymanvural40
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1744 users visited in the last hour