Entering edit mode

7 weeks ago

andemexoax
•
0

Previously I've seen that other publications have used PowerMarker software to calculate Polymorphic Informative Content (PIC) for loci SNPs in a VCF file. However, this is outdated software (circa 2006), running only on an old version of Windows. Is there a function from a package in R that can calculate it... or a hint at how to manually perform it? Other software?

The formula from the original publication returns a scalar or a vector? I am expecting a vector or in other words a PIC value per loci.

my hack at doing it manually, where a2 is the frequency of allele 2.

```
var_freq$PIC_a2 <- (1-(var_freq$a2^2)-(1-var_freq$a2)^2)-(2*(var_freq$a2^2)*(1-(var_freq$a2^2))))
```

Is that right?