Question: Effects of applying BQSR twice?
gravatar for Chris Miller
4 months ago by
Chris Miller21k
Washington University in St. Louis, MO
Chris Miller21k wrote:

Does anyone have insight into what happens if you apply BQSR twice to a sample?

The case here is that we might run a pipeline that aligns and runs BQSR, then we throw away the original fastqs. If we later want to realign, we can regenerate fastqs, but the original quality scores are gone. What would the effect be of running them through a pipeline that applies BQSR a second time?

Obviously we could modify the pipeline to omit BQSR on a subsequent run, but I wonder if that's even necessary. Anyone with more insight into the underlying algorithm know what the effect would be? In general terms, is it going to be worse, or largely inconsequential?

ADD COMMENTlink modified 4 months ago by h.mon31k • written 4 months ago by Chris Miller21k
gravatar for h.mon
4 months ago by
h.mon31k wrote:

I suspect if same reference genome and known variants are used for both rounds of BQSR, the second BQSR would produce little change over the first round. If different versions of reference genome and known variants are used, there may be additional changes, but honestly it is just a guess.

This sounds like a cool little project for a quarantine, no? One could either use simulated reads, or "gold standard" data sets, to investigate the issue.

ADD COMMENTlink written 4 months ago by h.mon31k

Sure - straightforward enough to check, but i was hoping someone else had already done the comparison :)

ADD REPLYlink written 3 months ago by Chris Miller21k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 699 users visited in the last hour