Question

Filter reads by calculating the distribution of phred score in each individual read

0

Entering edit mode

5.6 years ago

Nagesh ▴ 10

Dear All, I would like to know if there is any tool or script to calculate the distribution of PHRED score for each individual read and filter, if it has the high distribution low quality bases by providing cut-off value. Thanks in advance.

next-gen RNA-Seq sequencing • 2.2k views

ADD COMMENT • link updated 5.6 years ago by Devon Ryan 104k • written 5.6 years ago by Nagesh ▴ 10

1

Entering edit mode

You mean a preprocessing tool as FastQC or fastp ?

ADD REPLY • link 5.6 years ago by Bastien Hervé 5.3k

0

Entering edit mode

Try FastQC - https://www.bioinformatics.babraham.ac.uk/projects/fastqc/

ADD REPLY • link 5.6 years ago by Sej Modha 5.3k

0

Entering edit mode

Is bbduk what you are looking for?

bbduk.sh in=reads.fq out=clean.fq maq=10

This will discard reads with average quality below 10. If quality-trimming is enabled, the average quality will be calculated on the trimmed read.

ADD REPLY • link 5.6 years ago by finswimmer 16k

0

Entering edit mode

By calculating the average quality, we may miss reads which are having moderate distribution of low quality bases. For example I want to retain reads which are having less than 2% bases with 20 PHRED score.

ADD REPLY • link 5.6 years ago by Nagesh ▴ 10

0

Entering edit mode

Fastp has an option for that. BTW, in practice base qualities tend to be fairly bimodal.

ADD REPLY • link 5.6 years ago by Devon Ryan 104k

score 0 · Answer 1 · 2018-09-25

0

Entering edit mode

5.6 years ago

Devon Ryan 104k

As mentioned in the comments, bbduk and fastp can both do this.

ADD COMMENT • link 5.6 years ago by Devon Ryan 104k