Question: difference between 2 software of quality trimming
0
gravatar for Aurélien Béliard
4.2 years ago by
Paris
Aurélien Béliard10 wrote:

hi,

I am trying to improve our DNA sequencing work-flow. I choose a software called AdapterRemoval for quality trimming and merging overlapping reads. I try to benchmark AdapterRemoval against the previous software called sickle with the same parameter and I don't understand why I have so many difference.

this is the sickle command with the minimum quality of 20 and minimum read length of 30 :

sickle pe -t illumina -f multiplex_bad_contam_1.fq -r multiplex_bad_contam_2.fq -q 20 -l 30 -o trimmed_multiplex_bad_contam_1_si.fq -p trimmed_multiplex_bad_contam_2_si.fq -s removed_sample_si.fq

This is the AdapterRemoval command the minimum quality and minimum length are the same:

AdapterRemoval --file1 multiplex_bad_contam_1.fq --file2 multiplex_bad_contam_2.fq --qualitybase 64  --trimqualities --minquality 20 --minlength 30 --output1 trimmed_multiplex_bad_contam_1_ar.fq --output2 trimmed_multiplex_bad_contam_2_ar.fq --singleton removed_sample_ar.fq

example of difference between the trimming software result

if someone have an idea it would be great.

thank for your help,

Aurel

ps: I apologise for the English mistakes but I am not naturally English speaker.

dna alignment software error • 1.2k views
ADD COMMENTlink written 4.2 years ago by Aurélien Béliard10

It's extremely likely that these tools work somewhat differently for determining exactly where to trim things (all trimmers works slightly differently), so I'm not surprised that you get different results with the same settings. You'll need to look through the code for each if the exact reasons are really important to you.

ADD REPLYlink written 4.2 years ago by Devon Ryan95k

It could be worth running post trimming process, and compare end results: length [mean, median, max, min], read/contigs, and other attributes from both tools. Just a thought. Different tools have different approach, same parameters may or may not yield same results.

ADD REPLYlink modified 4.2 years ago • written 4.2 years ago by Bioinformatics_NewComer320
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 910 users visited in the last hour