Hi,
I'm aware of ways to merge duplicates in a vcf, like this one. With my vcf, however, there appear to be duplicate variants with conflicting calls. These are not multiallelic sites; It's simply a case where a variant might be listed twice, but the calls are different. Example:
chr7 chr7_12345_AT_A_b38 AT A 0/0 0/0 0/1 0/1...
chr7 some_rsID_identifier AT A 0/0 0/0 0/0 0/0...
Because of the conflicting calls, I don't trust either of these entries, and I want to remove them both. I don't know how frequently this occurs, so I just want to remove any variants that have duplicates in the VCF. Unfortunately, BCFtools --rm-dup flag just keeps the first record.
I can obviously write a script to just remove both entries. Is there a tool with this functionality? Is there a flag in BCFtools that I'm missing?
Thanks!
This worked out perfectly. I don't think this option is in PLINK 1.9, because I wasn't aware of it.