In order to calculate coverage per bait is the calculation: bait reads / (bait length 100)? That seems a bit off when I put real #'s to it (158 / (153100) = 0.01. Thank you :).
EDIT:
( read count * read length ) / length of area in question
so if the average reads for a bait were 158 at a read length of 150 and the bait length was 153
(158 * 150) / 153 = 155x
so what's the problem now?
I use an awk script to calculate the average number of reads and the length of each bait. Below are 14 baits that all map to the PTPN11 gene. So, since I am new to exome analysis, is it safe to assume that for bait one
(158 * 150) / 153 = 155x
and bait 2(220 * 150) / 225 =147x
bait 3(228 * 150) / 223 = 153x
. Would it be more useful to calculate the average coverage per bait as well as the average coverage per gene? So is the example lets say PTPN11 had 3 baits in it and the coverage (155, 147, and 153) mean that PTPN11 has 152x coverage. Thanks :).I am not quite sure where this is going.
More useful, for what?