Tool:TRAL - Tandem Repeat Annotation Library
0
5
Entering edit mode
8.7 years ago
Elke Schaper ▴ 110

For my Ph.D., I've implemented solutions to a large number of tasks related to sequence tandem repeats. We've now decided to make the code accessible and reusable for others, hoping that it's going to safe a lot of time for some of you!

Features

  • Detect nucleic or protein tandem repeats with de novo software. TRAL can be used to run, parse, merge and output results from external tandem repeat detection tools in an output format of choice.
  • Detect tandem repeats from a sequence profile HMMs. In case you already know the sequence of your tandem repeat more or less, but are interested in either refining the annotation (e.g. if some repeat units are missing from the annotation) or search for homologous tandem repeats in other sequences.
  • Statistical significance analysis of putative tandem repeats. We and others have found that specificity is a big issue with many tandem repeat annotation tools. To make sure you can trust your tandem repeat annotations, TRAL ships with ad hoc and model-based statistical tests for nucleic and protein tandem repeats. Using these tests, each tandem repeat is tagged with a p-value, and you can decide the threshold.
  • Overlap detection and filtering. When you merge tandem repeat annotations from several sources, you may want to discard overlapping repeats. Several definitions of overlap are implemented in TRAL.
  • new Reconstruct tandem repeat unit phylogenies.

Technical details

Tutorials

  • Extensive tutorials are available on GithubIO. Please mail me if you wish for a tutorial for a specific task!

Example

This is a short example of how you can annotate your sequences with TRF in three lines of code:

#Python3
from tral.sequence import sequence
sequences = sequence.Sequence.create(file = "path/to/my/sequences.fa", input_format = 'fasta', sequence_type = "DNA")
tandem_repeats = [i_seq.detect(denovo = True, detection = {"detectors": ["TRF"]}) for i_seq in sequences]

More examples are available in the docs.

Your feedback - every comment is helpful!

If you believe TRAL might help your research or save you time, please feel free to contact me, or post the project.

  • Feature requests
  • How to implement specific tasks
  • Bug reports

Publications

Here's to some background of TRAL:

HMM sequence annotation tandem-repeat • 3.4k views
ADD COMMENT

Login before adding your answer.

Traffic: 1967 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6