Question: Reading Bam Files With Rsamtools: Understanding Results For Negative Strand
0
gravatar for cristianregep
6.9 years ago by
cristianregep0 wrote:

Hello everyone,

I'm sort of an "advanced beginner" when it comes to R, so this might seem like a trivial question. Suppose I'm using RsamTools to query through a Bam File. If I say I want everything between bp 3000000 and 3000500 in a certain chromosome I'll get results both on the negative and positive strand.

Suppose I get 3000010 on the negative strand with DNA string "CTCTTTTTT" ( the string is bigger but for the purpose of this questions I think it's ok ) . What does this mean from the following:

  1. Is that DNA string returned actually the positive strand complement of the matched sequence on the negative strand? or
  2. Is CTC the 5' end of the read on the negative strand? or
  3. Is CTC the 3'end of the read on the negative strand? or
  4. Did it actually match it from bp 3000010 on the negative strand? or
  5. Something else?
R bam strand • 2.7k views
ADD COMMENTlink modified 6.9 years ago by Devon Ryan94k • written 6.9 years ago by cristianregep0
2
gravatar for Devon Ryan
6.9 years ago by
Devon Ryan94k
Freiburg, Germany
Devon Ryan94k wrote:

This isn't actually specific to Rsamtools, but is, rather, a function of the SAM spec. Regardless of the strand, the sequence in SAM files is always 5'->3' starting at the indicated position. So, CTCTTTTTT... is the sequence of the + strand. The original read, then, would have ended in AAAAAAGAG (i.e., the reverse complement). It's often easiest to get a handle on this sort of thing by loading the sorted and indexed BAM file in IGV or another viewer.

ADD COMMENTlink written 6.9 years ago by Devon Ryan94k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1752 users visited in the last hour