Quality Trimming 454 Data - Software Recommendation
2
12
Entering edit mode
13.4 years ago

Hi,

I am looking for a powerful and flexible tool to trim 454 sequences. I would like it to be able to remove the following:

  • Tags
  • 454 adaptors
  • Low complexity regions
  • Poly-A/T
  • Low quality regions

The sequences can be either in .sff or .fasta/.qual format.

I'm keen on knowing what you guys would recommend and why :)

Cheers!

next-gen sequencing trimming software • 6.4k views
ADD COMMENT
9
Entering edit mode
13.4 years ago
Rm 8.3k

Try NGS_Backbone: It can clean sanger, 454 and illumina sequences.

http://bioinf.comav.upv.es/ngs_backbone/cleaning.html#clean-reads

and also "FASTX-Toolkit"

For Illumina data. I use fastx toolkit to do some of the above analysis which you are looking.

For example

Quality filter at Q 12 and atleast 50% good bases and End trimming with quality filter at 12 and minimum length (30 bases) of the read to retain the read.

fastq_quality_filter -q 12 -p 50 -i Input_reads.txt | fastq_quality_trimmer\
    -t 12 -l 30 -o filtered_reads.out.txt

my barcodes are 8-bases long

cat filtered_reads.out.txt | fastx_barcode_splitter.pl --bcfile mybarcodes.txt\
    --bol --exact --prefix filtered_reads.out.txt.

fastx_trimmer -f 9 -i filtered_reads.out.txt.Tag1 \
    >filtered_reads.out.txt.Tag1.txt
ADD COMMENT
0
Entering edit mode

Thank you RaghuM :) I'll dig into FASTX-Toolkit more seriously!

ADD REPLY

Login before adding your answer.

Traffic: 2483 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6