Question: PAR-clip T to C mutations becoming T to G in minus strand?
0
gravatar for t.candelli
4.6 years ago by
t.candelli60
France
t.candelli60 wrote:

I am looking at some PAR-clip data. trimming and mapping went perfectly, however, when i look at the data on IGV, in the plus strand i have a vast majority of T to C mutations (as expected), but in the minus strand i see a lot of T to G mutations, like a vast predominance.

is there any way to explain this? how is it possible that only the minus mapping reads are affected? there must be a technical issue somewhere!

igv mapping transversion par-clip • 1.5k views
ADD COMMENTlink modified 4.6 years ago by Istvan Albert ♦♦ 80k • written 4.6 years ago by t.candelli60
0
gravatar for Istvan Albert
4.6 years ago by
Istvan Albert ♦♦ 80k
University Park, USA
Istvan Albert ♦♦ 80k wrote:

Weird things do happen all the time. Since all alignments are (should be) represented on the forward strand the mutation should be shown relative to that strand as well, here it just happens to look like complement but it is most likely caused by more subtle errors.

there are papers on this subject, seach for strand bias in snp calling

The effect of strand bias in Illumina short-read sequencing data BMC Genomics 2012, 13:666

(whoops I may have jumped ahead of myself,  I don't know what PAR-Clip is, I assume it still applies ;-) )

ADD COMMENTlink modified 4.6 years ago • written 4.6 years ago by Istvan Albert ♦♦ 80k

yeah i thought a long while about this and realized that in SAM files the sequence is always given on the forward strand, irrespective of whether it maps to the reverse. this flipping makes it so the T->C in the reverse become A->G in the forward.

i am relatively convinced by this but i would like confirmation. i really thought IGV would be smart and report the correct mismatch depending on which strand the read maps to, but it seems it's not the case, unless i misunderstood something.

ADD REPLYlink written 4.6 years ago by t.candelli60

IGV is a visualizer and it shows alignments as they are produced.  I

Showing a different mismatch depending on the strand would be actually counterproductive as the DNA is complementary so there is nothing to be gained from showing one event as being two different ones - it would only add to the information overload.

ADD REPLYlink written 4.6 years ago by Istvan Albert ♦♦ 80k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1703 users visited in the last hour