Question: Some explanation about what a paired-end sequencing really means
0
gravatar for Sus
13 months ago by
Sus10
Sus10 wrote:

Hi ! I'm currently a student and I have a hard time understanding some basics of bioinformatics; I'm currently learning about alignment, filtering, variant calling and such so my question might look silly but here it is anyway.

I have some trouble about how you work with paired-end sequencing files and what does it means to be paired-end.

After taking a look on the Internet I found an explanation of what is paired-end sequencing (tell me if I got it right):

For what I understood, a paired-end sequencing is just done by sequencing from A to Z and then from Z to A. Which will provide two distinct datasets, one for each direction.

My question is, when you are doing some alignment with tools like BWA, TopHat or whatever, do you have to reverse one of the two dataset or not ? Because, for instance, If I wanted to find a consensus sequence (or the position specific score matrices), if half of the data are in the wrong direction wouldn't it be completely wrong ?

Completely unrelated: I've also heard that TopHat should be used over BWA for aligning RNA, do you know why ?

alignment paired-end • 4.8k views
ADD COMMENTlink modified 13 months ago by andrew.j.skelton735.7k • written 13 months ago by Sus10
2
gravatar for andrew.j.skelton73
13 months ago by
London
andrew.j.skelton735.7k wrote:

Always easier to illustrate with an image, from here. The grey represents the fragment, and each end of the fragment is sequenced. This allows more accurate mapping, particularly of repetitive regions. There's also a great animation here that illustrates the concept of Illumina paired end sequencing. As @h.mon stated, most programs will have parameters to deal with paired-end sequencing, and seriously, stay away from Tophat. STAR or HISAT2 are current alternatives

enter image description here

ADD COMMENTlink written 13 months ago by andrew.j.skelton735.7k
0
gravatar for h.mon
13 months ago by
h.mon26k
Brazil
h.mon26k wrote:

Most programs already take into account paired-end read orientation, you have to read the documentation carefully program-by-program.

Completely unrelated: I've also heard that TopHat should be used over BWA for aligning RNA, do you know why?

Don't use Tophat, there are several better programs, and it has been superseded by HISAT2 (from the same group of developers). BWA is not splice-aware, and Tophat is, hence Tophat is better for aligning RNAseq reads to a reference genome. But again, don't use Tophat.

ADD COMMENTlink written 13 months ago by h.mon26k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1836 users visited in the last hour