Question: Indel Left/Right Alignment
gravatar for sa9
6.3 years ago by
USA, Cambridge
sa9800 wrote:

When matching indels between different VCF files (generated by different callers), there is this issue with left / right indel alignment, For example:

enter image description here

Here is a real example for one indel (from the same sample) called by:

Samtools --> 1 161047125 CTATA C

GATK --> 1 161047130 TATAG T

I know GATK has a small tool called "LeftAlignIndels" to solve this issue in the BAM files but I can't use it.

I am wondering if someone knows what is the indel alignment direction in samtools, GATK and Dindel? Is there an easy way to correct this at level of VCF files?


indel • 8.0k views
ADD COMMENTlink modified 10 months ago by hugues.sicotte0 • written 6.3 years ago by sa9800
gravatar for lh3
6.3 years ago by
United States
lh331k wrote:

For NGS analysis, the convention is to left align indels. To use GATK and samtools, you should use an aligner that left aligns indels; otherwise at least samtools will have worse performance and accuracy. It is too late to fix the issue in VCF.

EDIT: wait.. In your example, the two callers deleted different bases. The two calls are intrinsically different. You cannot move the indel to make them the same.

ADD COMMENTlink modified 6.3 years ago • written 6.3 years ago by lh331k
gravatar for pd3
6.3 years ago by
pd3340 wrote:

There is the "vcf norm" tool in htscmd which left-aligns and normalizes indels in VCFs. It can be downloaded from github, google for 'htslib'.

ADD COMMENTlink written 6.3 years ago by pd3340

Thanks Ixe, we used similar tool from GATK specifically for VCF files ( but as lh3 pointed out, it is too late to correct in the VCF file.

ADD REPLYlink written 6.3 years ago by sa9800

Why too late in VCF?

ADD REPLYlink written 6 months ago by SmallChess480
gravatar for Erik Garrison
3.7 years ago by
Erik Garrison2.1k
Somerville, MA
Erik Garrison2.1k wrote:

I suggest taking a look at vt: It includes a very nice left alignment routine. They've got a nice paper describing the method for normalizing the representation:

ADD COMMENTlink written 3.7 years ago by Erik Garrison2.1k

Thanks Erik. Vt is awesome. We have been using it for couple of months now.

ADD REPLYlink modified 3.2 years ago • written 3.7 years ago by sa9800
gravatar for hugues.sicotte
10 months ago by
hugues.sicotte0 wrote:

a) Create a reversed reference genome b) Create a tool that reverse variants alleles .. and changes position (subtrack length_of_chrom +1- vcf_pos) c) Use a left-normalizing/shifting tool. d) use step b) again.

Voila.. Right shifting tool. Will post once I have it done.

ADD COMMENTlink written 10 months ago by hugues.sicotte0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1045 users visited in the last hour