Question: what read should i use for classifying?
gravatar for life99945
22 months ago by
life9994520 wrote:


I have forward, reverse and extended illumina myseq 16s rRNA v4-v6 amplicon reads. Length of v4-v6 region is about 520-530 bp while my forward and reverse reads length is only 250 bp each. I checked all three of them in fastqc and it appears that forward read is the best. Should i use forward read for classifying or it is better to use an extended, even if it has worse quality.

p.s. How is it possible to get an extended file when forward and reverse reads are not enough long for crossing? And why extended read quality is so bad in the end? Does not the merged file should have a degradation in the middle?

Fastqc reports attached (i don't know how to add html file properly, so i just added these links, sorry for that):




Thank you!

ADD COMMENTlink modified 22 months ago • written 22 months ago by life9994520

I've never used it, so I can't vouch for the results claimed in the paper, but in principle IM-Tornado uses information from both reads even if they don't overlap.

But how come the apparent mode of length for your extended file is 360bp, if the reads don't overlap?

edit: the quality of your reads in all files is very good and shouldn't impact much downstream analyses.

ADD REPLYlink modified 22 months ago • written 22 months ago by h.mon30k

Thank you for the answer! I'v got this extended file from laboratory along with forward and reverse reads, and i should ask them how they made it, but what if there is no big need in extended file? Maybe i can use only forward to classify my sample or it will increase the inaccuracy of the classification?

ADD REPLYlink modified 22 months ago • written 22 months ago by life9994520
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1278 users visited in the last hour