Need Clarification On The Formula For Base Alignment Quality (Baq) When Computed Via Mpileup
1
0
Entering edit mode
10.4 years ago

Dear all,

I am trying to compute the offset to base alignment quality (BAQ) myself from mpileup files (using this parameters -l $bed_file -BQ 0 -A -d10000000).

After some bit of research I have found this formula:

At the i-th read base, BAQi = Qi - (BQi - 64) where Qi is the i-th base quality.

I have following questions:

  • Is Qi the Phred Score P = 10^( -Q/10) of the base? Q being ascii_val(char) - 33 (if Sanger format).
  • Is BQi the same as Qi?
  • And do you always have to subtract with 64 BQi -64 or is it depended on the sequencer used?

I would really appreciate some help.

Thanks.

sam samtools • 2.8k views
ADD COMMENT
1
Entering edit mode
10.4 years ago

The computation of BAQ is described in Improving SNP discovery by base alignment quality (Bioinformatics, 2010).

To be honest I don't understand it beyond noting that it does not quite look like your formula.

ADD COMMENT
0
Entering edit mode

Thanks, actually that is useful. But I agree, if they use a HMM to calculate BAQ than this is not at all what the formula says - looks much easier. The formula is described on page 6 in here http://samtools.sourceforge.net/SAMv1.pdf

ADD REPLY

Login before adding your answer.

Traffic: 2657 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6