Question: How To Bin Metagenomic Assembled Contigs Using Tetranucleotide Frequency
gravatar for Shuixia100
7.7 years ago by
Shuixia100120 wrote:

Hi there,

Did anyone know which tool can calculate the tetranucleotide frequency of fasta file and condust the cluster based on the results.

Im trying to bin my assembled contigs into bins in order to develop draft genome basically following the methodology of a science paper: Hess, M., A. Sczyrba, R. Egan, T.-W. Kim, H. Chokhawala, G. Schroth, S. Luo, et al. “Metagenomic Discovery of Biomass-Degrading Genes and Genomes from Cow Rumen.” Science 331, no. 6016 (January 27, 2011): 463–467.

But I found difficulty when doing the tetranucleotide freq calculation. I do found TETRA, but the software could not handle my dataset with 3G data.

Ive no informatics background, it seems quite simple algorithm. Maybe it had been used in some software package. Please let me know if you know any tool could do this? And hopefully told me about the basic how to, otherwise it is too hard for me to touch into the language. THX a lot.


ADD COMMENTlink modified 14 months ago by Biostar ♦♦ 20 • written 7.7 years ago by Shuixia100120
gravatar for Sean Davis
7.7 years ago by
Sean Davis26k
National Institutes of Health, Bethesda, MD
Sean Davis26k wrote:

You might want to look at the Bioconductor Biostrings package and in particular, the oligonucleotideFrequency method.

ADD COMMENTlink modified 7.7 years ago • written 7.7 years ago by Sean Davis26k

Thanks for your quick reply, could you provide some link of documentation or specifically which command I should use for tetranucleotide freq calculation. THX

ADD REPLYlink written 7.7 years ago by Shuixia100120

Edited post to include the appropriate method.

ADD REPLYlink written 7.7 years ago by Sean Davis26k
gravatar for cts
7.0 years ago by
cts1.6k wrote:

Hey we use tetranucleotide frequencies alot in our lab. One of our postdocs wrote a simple script that with calculate frequencies and give you a simple table.

You can find the source code and install instructions here

ADD COMMENTlink written 7.0 years ago by cts1.6k
gravatar for doug.rusch
7.7 years ago by
doug.rusch0 wrote:

You can try this web tool which provides a visual means to analyze the data:

You can upload your own data here:

The best reference for this would be Inskeep et al, 2010 from PLoS One.

ADD COMMENTlink written 7.7 years ago by doug.rusch0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1001 users visited in the last hour