Hello, in a whole-exome dataset of healthy individuals I am working with, I noticed some high frequency protein truncating variants. I find this strange and confusing, since I generally expect PTV to be deleterious and thus low frequency. For instance, rs72559129 is a frameshift variant with allelic frequency that is almost 1, both in my dataset and in gnomAD. How do I interpret this? Which allele is functional, the rare one, the common one, both? This is just one example but there are dozens. Is this due to a problem with the reference (hg19) or the transcript annotation? Am I missing something?
Any pointer is welcome.
Thanks