conceptual question about paired end sequencing
2
0
Entering edit mode
3.1 years ago
nhaus ▴ 300

Imagine I am doing paired end illumina sequencing of this simple DNA.

5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5'

The first read would cover the bases similar to this (i.e. GTACGAT):

read 1    
   -------------->
5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5'

Then, the strand would flip over to neighboring oligos that are on the flow cell and the second read would be synthesized.

My question now is, if the second read covers the bases like this (i.e. GCATGC):

(please ignore the underscores, I don't know how to show white space in the code sample)

   read 1                             read 2
   -------------->                <---------
5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5'

Or like this (i.e. CATGCTA):

   read 1
   -------------->
5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5' 
                                <-----------
                                      read 2

I would appreciate a lot if somebody could clear this up for me.

Cheers!

sequencing illumina • 776 views
ADD COMMENT
1
Entering edit mode

I've cleaned up your post for you. The formatting option you were using is the block quote option, not the code formatting option. The button for the code formatting option is the 101010 button.

code_formatting

ADD REPLY
2
Entering edit mode
3.1 years ago
GenoMax 141k

Thing to keep in mind is sequencing always happens in 5' --> 3' direction irrespective of the strand being sequenced.

ADD COMMENT
0
Entering edit mode

That makes complete sense. So its option 2. In that case I have a quick follow up question. When I align these two reads to the genome (lets say we use BWA), will read 2 be automatically converted to its reverse complement, so that it matches the direction of read 1? I am asking this, because if we download the human genome, its DNA sequence is represented as a single strand, so I guess one of the reads has to be converted.

ADD REPLY
0
Entering edit mode

Aligners will consider both strands when aligning. Some aligners will allow you to restrict alignments to specific strands, if you want that.

ADD REPLY
0
Entering edit mode
3.1 years ago
Carambakaracho ★ 3.2k

I think this is a good video to illustrate the concepts.

Disclaimer: It's an Illumina video, but I think the educational value is pretty high

ADD COMMENT

Login before adding your answer.

Traffic: 2414 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6