sequence length distribution
4
0
Entering edit mode
4.6 years ago
evashan96 • 0

how to check sequence length distribution of trimmed reads? (fasta). is there any tool to calculate it?

next-gen sequencing • 3.0k views
ADD COMMENT
0
Entering edit mode

What have you tried so far?

why not run FastQC (again) on your cleaned reads? that report includes a length distribution plot.

EDIT. Oh, ok, you're asking it for fasta format as input, then FastQC might not be the answer, unless you still have the fastq files as well then use those as input

ADD REPLY
2
Entering edit mode
4.6 years ago
jean.elbers ★ 1.7k

BBMap/BBTools has a readlength.sh tool that you can you to make a readlength histogram of the reads after trimming.

ADD COMMENT
0
Entering edit mode
4.6 years ago
gb ★ 2.2k

A bit of an old tool but you can also use PRINSEQ. They also have a webversion, but never used it.

website: http://prinseq.sourceforge.net/

ADD COMMENT
0
Entering edit mode
4.6 years ago

Try OSTRFPD: Multifunctional tool for genome-wide short tandem repeat analysis for DNA, transcripts and amino acid sequences with integrated primer designer link: https://github.com/vivekmathema/OSTRFPD

ADD COMMENT
0
Entering edit mode
4.6 years ago
michael.ante ★ 3.8k

Lawrence et al published FAST: FAST Analysis of Sequences Toolbox (paper | github).

There you have a tool called faslen, which annotates the fasta header with its length. To only get the length, you can run:

faslen my_test.fasta | grep \>

Cheers,

Michael

ADD COMMENT

Login before adding your answer.

Traffic: 1763 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6