Question: Which Tools do you use for (Tandem) Repeat Detection? Why?
5.3 years ago by
voidnyx10 wrote:


I recently started my work for a university project where I want to analyze repetetive sequences in genomic DNA of two closely related species. I have no prior experience with this and have read some papers to start with.

I found there are a handful of programs for Tandem Repeat detection like TRF, Mreps, ATRhunter, RepeatMasker and so on. Based on different approaches and different opinions i could not really see a consensus in the articles i read on which program to use, not even on distinct (dis)advantages.

What i took away from them was, you have to be extra careful with your parameter settings for most of the programs and basically have to be author of the program to actually understand whats happening.


So I would like to hear opinions from people who actually use some kind of Repeat Detection software on this topic or have experience with the matter. Which programs you think are good? and where do you see their advantages or disadvantages?


PS: I am not sure this is the right place for this kind of "open" question so feel free to move it

ADD COMMENTlink modified 5.2 years ago by Elke Schaper60 • written 5.3 years ago by voidnyx10

Use TRF (this is my subjective experience and objective number of citations from the Pubmed):  

  • TRF - 929  
  • Mreps - 70
  • ATRhunter - 12
  • RepeatMasker uses TRF


ADD REPLYlink modified 5.3 years ago • written 5.3 years ago by PoGibas4.8k

The definition of tandem repeat itself is dark magic - how many times a motif repeats itself? how many mismatches/gaps we allow? I guess it is hard to really judge what tool is clearly more accurate than others. Just stick with what everyone uses - TRF. UCSC shows their setting, which you can copy. If have run RepeatMasker, you can take its output.

ADD REPLYlink written 5.3 years ago by lh332k
5.2 years ago by
Elke Schaper60
Elke Schaper60 wrote:

I've worked quite a bit with tandem repeats for my Ph.D. In the very beginning, I learned the same as you're writing: the tool & combined with the chosen parameters, has a large influence on the result.

Check out mine and others publications on the topic:

Repeat or not repeat?- Statistical validation of tandem repeat prediction in genomic sequences

Detecting short tandem repeats from genome data: opening the software black box


What I've learned from all this is:

- There is useful, and there is less useful tandem repeat detection tools. TRF might be well-cited, but it suffers from very low sensitivity, and blackbox code.

- It is often useful to collect data from several tandem repeat detectors, to make sure your sensitivity is sufficient.

- For protein tandem repeat detection tools, it is necessary to perform a statistical test on each proposed tandem repeat to control for false positive annotations. I'm not sure whether this is true for genomic tandem repeats also, but I wouldn't be surprised, again given the black box character of many of the tools.


However, if you're lucky, you're only interested in non-diverged or perfect tandem repeats. In this case, the detection task is simple, and one tool (but in my experience not TRF) will do, and no additional testing for false positives is necessary.


Good luck!


ADD COMMENTlink written 5.2 years ago by Elke Schaper60

Could you name the tool you would recommend then?

ADD REPLYlink written 4.4 years ago by Biomonika (Noolean)3.1k
5.3 years ago by
h.mon31k wrote:

I am experimenting with MISA now (210 PubMed citations), because it already has some scripts to design primers from its output. So far, I am happy, but I haven't tried any PCR yet.


ADD COMMENTlink modified 5.3 years ago • written 5.3 years ago by h.mon31k

Hi..h.mon. Can u tell me how do u convert MISA generated files to primer3 input file. I used perl program. But it is showing a typical problem "use of ininitialized value $count in concatenation or string in line 34 <SRC> Chunk 22205" please kindly help me how can I get rid of this problem.

ADD REPLYlink modified 4.7 years ago • written 4.7 years ago by rakeshmbb0

Which version of Primer3? MISA is quite old and needs Primer3 version 1 series, it won't work with version 2 series.

ADD REPLYlink written 4.7 years ago by h.mon31k
Yes that's right. But can not give any output in my case. The output of will be input for primer 3. I have both versions of primer 3 anyway
ADD REPLYlink written 4.7 years ago by rakeshmbb0
