Adapter Clipping
1
3
Entering edit mode
12.4 years ago
Steffi ▴ 580

I have a question regarding paired-end sequencing and adapter clipping (Solexa, HiSeq, TruSeq Protocol). I have the sequence of the "PE Read 1 Sequencing Primer" and the "PE Read 2 Sequencing Primer". Are these the sequences to clip away? And if so, at which end? 5' and 3' end?

Best, Stefanie

paired adaptor • 4.8k views
ADD COMMENT
0
Entering edit mode

As sequence data are generated and read from 5' to 3', you want to clip at the 3' side of the primers.

ADD REPLY
0
Entering edit mode

I suppose you mean to clip at the 3' side of the reads. And it never may happen that the sequencing reaction starts already within the 5' adapter?

ADD REPLY
0
Entering edit mode

It's not where the reaction starts, but where the data start that should be a concern. Sometimes, only a portion of the adapter is seen. You need to clip where adapter joins novel seq data.

ADD REPLY
1
Entering edit mode
10.7 years ago
Dan D 7.4k

In short, yes you do want to clip away the sequencing primer sequences. The question about which end to clip depends on you sequencer, but it will most often be the 3' end.

Before I embark on trimming a collection of samples, I usually run run a few FASTQ files through FastQC. It's easy to use and relatively quick. Before you run it, check the included contaminants file and see if your sequencing primers are located in there. If not, add your sequencing primers.

When you run FastQC, the report will have an "Overrepresented Sequence" section. Check that section and see if your sequence is shown there. An overrepresented sequence that matches with an entry in the contaminants file will be labeled accordingly. Based on this information you can tell your favorite trimming application which sequences to look for and remove.

ADD COMMENT

Login before adding your answer.

Traffic: 2551 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6