I'm using plink for QC. I have two questions re MAF:
1. My understanding is that plink (plink --file data --freq) will estimate the MAF based on the data and accordingly in the QC it will remove SNPs based on its own MAF estimates. Is this correct?
2. What is the standard for removing SNPs based on MAF threshold? Is it MAF estimates based on the data (as above)? or is it based on a general population e.g using 1000Genome data or similar?