bowtie2 mapping oritation problem
1
0
Entering edit mode
11 weeks ago
Shaway • 0

Hello everyone, Sorry for post this naive sequencing question. I aligned a paired DNA sample to hg19 by bowtie2 with parameter "-q -N 1 -X 2000 --no-mixed --no-discordant". I saw some strange read pairs in my bam file like below (I deleted some ATCG for illustration). read1 can either be forward and reverse. This confused me.

reference    CCCTAACCCAACCCTAACCCAA


reference    TAACCCAACCCTAACC


My question is :

(1) bowtie2 use --fr in alignment procedure by default. Is this means read1 should come from forward strand and read2 from reverse strand?

(2) How the above phenomenon occured?

(3) If this is normal, how to determine the start and end of the DNA fragment? which is 5' and which is 3'?

Thanks everyone!

bowtie2 WGS DNA sequencing • 205 views
1
Entering edit mode

For paired end sequencing you are sequencing from both ends of a DNA fragment. If the fragment size is small, the reads can overlap each other. For example, if your fragment length is 100 bases, and you are sequencing 75 bases from each end, the forward and reverse reads will have 50 bases of overlap in the middle.

0
Entering edit mode

Yes, I know this. But why read1 can either be from forward and reverse strand. I think that read1 should be the leftmost segment of the DNA fragment.

0
Entering edit mode
reference    TAACCCAACCCTAACC


In this situation, you mean that the DNA fragment should be "ACCCAACCCTAA". But adapters has already been removed, therefore I think the DNA fragment should be "TAACCCAACCCTAACC".

1
Entering edit mode
10 weeks ago
h.mon 32k

The reference genome has only one strand just for convenience, the actual DNA molecule has two complementary strands. During library preparation, the forward and reverse adapter can link to any of the two DNA strands, hence, you will see all combinations of (read 1 / read 2) and (forward strand / reverse strand).

The decision on which strand is used for genome representation is somewhat arbitrary, but, for organisms with good cytological and genetic maps, the reference genome chromosomes are positioned to make it colinear and with the same orientation with these maps.

0
Entering edit mode

Thank you very much!