Question

conceptual question about paired end sequencing

0

Entering edit mode

3.1 years ago

nhaus ▴ 300

Imagine I am doing paired end illumina sequencing of this simple DNA.

5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5'

The first read would cover the bases similar to this (i.e. GTACGAT):

read 1    
   -------------->
5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5'

Then, the strand would flip over to neighboring oligos that are on the flow cell and the second read would be synthesized.

My question now is, if the second read covers the bases like this (i.e. GCATGC):

(please ignore the underscores, I don't know how to show white space in the code sample)

   read 1                             read 2
   -------------->                <---------
5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5'

Or like this (i.e. CATGCTA):

   read 1
   -------------->
5'-GTACGATCGTGCATCGATCGTACGTACGTACGTAGCGTACG-3' 
3'-CGTACGCTACGTACGTACGTACGATCGATGCACGATCGTAC-5' 
                                <-----------
                                      read 2

I would appreciate a lot if somebody could clear this up for me.

Cheers!

sequencing illumina • 776 views

ADD COMMENT • link updated 3.1 years ago by Ram 43k • written 3.1 years ago by nhaus ▴ 300

1

Entering edit mode

I've cleaned up your post for you. The formatting option you were using is the block quote option, not the code formatting option. The button for the code formatting option is the 101010 button.

code_formatting

ADD REPLY • link 3.1 years ago by Ram 43k

0

Entering edit mode

3.1 years ago

Carambakaracho ★ 3.2k

I think this is a good video to illustrate the concepts.

Disclaimer: It's an Illumina video, but I think the educational value is pretty high

ADD COMMENT • link 3.1 years ago by Carambakaracho ★ 3.2k

score 2 · Accepted Answer · 2021-03-18

2

Entering edit mode

3.1 years ago

GenoMax 141k

Thing to keep in mind is sequencing always happens in 5' --> 3' direction irrespective of the strand being sequenced.

ADD COMMENT • link 3.1 years ago by GenoMax 141k

0

Entering edit mode

That makes complete sense. So its option 2. In that case I have a quick follow up question. When I align these two reads to the genome (lets say we use BWA), will read 2 be automatically converted to its reverse complement, so that it matches the direction of read 1? I am asking this, because if we download the human genome, its DNA sequence is represented as a single strand, so I guess one of the reads has to be converted.

ADD REPLY • link 3.1 years ago by nhaus ▴ 300

0

Entering edit mode

Aligners will consider both strands when aligning. Some aligners will allow you to restrict alignments to specific strands, if you want that.

ADD REPLY • link 3.1 years ago by GenoMax 141k