Question: Genotyping diploid organisms from Sanger AB1 files
gravatar for ognjen011
4 weeks ago by
ognjen011150 wrote:


I am trying to automate variant calling on human short exons sequenced with Sanger method. I have the ab1 files, and I want to automate the process for clinical geneticists. The program should be free, I understand that is a huge limitation. Also, it needs to register INDELS in addition to SNPs.

I tried working with tracy, and it works fine but fails silently, and fails to index the reference (I have opened github issues for that). TraceTuner doesn't seem to do anything when I use the -het flag, it just outputs the single consensus fasta (no heterozygous calls). They currently work on Sequencher trial version, but this version doesn't do variant calling. Baseling also doesn't mention automated heterozygous calls. Although there are mentions of writing in ambiguous bases which could be a code for a heterozygous, this becomes a problem for INDELs.

Is there a free resource that somebody uses successfully for human diploid data variant calling? Thanks all!

sanger variant-calling • 120 views
ADD COMMENTlink modified 28 days ago by trausch1.2k • written 4 weeks ago by ognjen011150

Maybe Phred/Phrap can help here, but I am not sure. See this paper.

ADD REPLYlink written 4 weeks ago by Benn6.6k

When I worked on Sanger data I used Phred/Phrap/Polyphred, and it was probably the best tool around. If you are using the results in clinical setting you might investigate the issues brought forward by genomax with your clinical colleagues.

ADD REPLYlink written 4 weeks ago by Fabio Marroni2.1k

While I understand a desire to use a free solution have you checked applicable local administrative policy for this. You may need (or be required) to use a validated/approved (and thus perhaps not free) application, if the data is going to be used for patient diagnostics.

ADD REPLYlink written 4 weeks ago by genomax65k

This is a very valid concern. If all fails, it will be a nice way to screen or double-check.

ADD REPLYlink written 29 days ago by ognjen011150
gravatar for trausch
28 days ago by
trausch1.2k wrote:

The problem with the indexing in tracy was the leading whitespace, right? If there is any other issue please let us know.

tracy decompose -f align -o out -a homo_sapiens -g Homo_sapiens.GRCh38.dna.primary_assembly.fa.gz trace.ab1

Tracy is still work-in-progress so any feedback is very welcome. The above command generates alignments, decomposes InDels, annotates variants with rs identifiers and generates a BCF file for downstream processing. The reference genome index files can be downloaded here and please let us know if your species of interest is missing.

ADD COMMENTlink written 28 days ago by trausch1.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 889 users visited in the last hour