Features:

Question

Tool:fastp, the ultra-fast FASTQ preprocessing tool, is now on BioConda

9

Entering edit mode

6.3 years ago

chen ★ 2.5k

fastp is a tool designed to provide ultra-fast all-in-one preprocessing for FastQ files. This tool is developed in C++ with multithreading supported to afford high performance.

Features:

filter out bad reads (too low quality, too short, or too many N...)
cut low quality bases for per read in its 5' and 3' by evaluating the mean quality from a sliding window (like Trimmomatic but faster).
trim all reads in front and tail
cut adapters. Adapter sequences can be automatically detected,which means you don't have to input the adapter sequences to trim them.
correct mismatched base pairs in overlapped regions of paired end reads, if one base is with high quality while the other is with ultra low quality
preprocess unique molecular identifer (UMI) enabled data, shift UMI to sequence name.
report JSON format result for further interpreting.
visualize quality control and filtering results on a single HTML page (like FASTQC but faster and more informative).
split the output to multiple files (0001.R1.gz, 0002.R1.gz...) to support parallel processing. Two modes can be used, limiting the total split file number, or limitting the lines of each split file.
support long reads (data from PacBio / Nanopore devices).
...

Now you can easily install it with BioConda: conda install -c bioconda fastp

For more information about fastp, please take a look at the Github project: https://github.com/OpenGene/fastp

fastp • 7.4k views

ADD COMMENT • link updated 13 months ago by Ram 43k • written 6.3 years ago by chen ★ 2.5k

0

Entering edit mode

Hi All, Is there anyway I can run fastp in batch ? I have fastq from 100 PE samples.

Thanks D

ADD REPLY • link 5.8 years ago by deep.pandya • 0

1

Entering edit mode

See this post for inspiration on how to write the loop: Bash Script Loop Help

Please don't use SUBMIT ANSWER to ask new questions.

ADD REPLY • link 5.8 years ago by GenoMax 141k