Question: Length of the amplicon after trimming
gravatar for life99945
2.3 years ago by
life9994520 wrote:


I have illumina myseq 16s rRNA amplicon reads 300bp length with 17 bp primer. After trimming length of this amplicon must be 283 bp max. BBDuk says that out of 300k sequences 298k were trimmed. Some sequences length is bigger that 283 bp. I wonder what these sequences are? Some sequences have for example 290 bp after trimming. Is it possible to amplificate sequence from part of the primer?

Thank you.

trimming • 578 views
ADD COMMENTlink modified 18 months ago by lieven.sterck10k • written 2.3 years ago by life9994520
gravatar for lieven.sterck
18 months ago by
VIB, Ghent, Belgium
lieven.sterck10k wrote:

Nothing very abnormal here.

Key thing here is that you have to think from the "other direction", the trimming of the reads usually happens on the 'end' of the read. This is due to something called read-through, meaning that your sequence reaction sequences more bases than what is in your sample and thus ends up in the primer/adapter on the other end of the read (so not the primer the seq-reaction started from, that one indeed you can't have in your read data).

Especially in the case of amplicon sequencing this happens often as the input data is rather short (or from a well defined length) , so there is a high change you end up in the primer/adapter on the other side of your read.

Combine this with the knowledge that if there are for instance only a few bases of the adapter present none of the trimming/cleaning tools will recognise these and thus they will remain in your read data given cleaned/trimmed reads of a variety of lengths.

ADD COMMENTlink written 18 months ago by lieven.sterck10k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2164 users visited in the last hour