I am hard filtering my first VCF, and currently exploring the scores to set my thresholds. I'm basing this mostly on the GATK best practises workflow on their website as that's how I've generated all of my VCFs.
When I extracted QD (Quality by Depth) scores, I noticed 32573 of the 323529 entries were NAs. Upon closer inspection of half a dozen or so, the score was simply not present. in the list. Below are single entries  without and  with the score;
To generate the VCFs I used GenotypeGVCFs in GATK v22.214.171.124 on a population with 6 individuals, and requested standard annotations. There appears to be a huge discrepancy between the amount of information generated. Is this correct? If not, what would you suggest to do next? It just seems like an awful lot of data to lose before I've even set thresholds.
Thanks in advance.