How to handle duplicate SNPs in PLINK?
2
0
Entering edit mode
9.4 years ago

Hello,

I'm aligning my dataset to a reference panel, and I am finding errors about duplicate SNPs.

Error: Duplicate variant ID 'rs17800615' in --flip file.

What is the standard way of dealing with these issues? The plink documentation doesn't mention an automatic way of doing this.

Is there at least a way for me to generate a list of duplicate SNPs to exclude them later?

Thanks

SNP plink • 6.1k views
ADD COMMENT
0
Entering edit mode
9.4 years ago

You can use Unix sort + uniq to deduplicate the --flip file:

sort flipfile.txt | uniq > flipfile_no_duplicates.txt

(You should also double-check how your --flip file was generated.)

ADD COMMENT
0
Entering edit mode
7.9 years ago
yulin1988320 ▴ 60

better do it in plink

ADD COMMENT

Login before adding your answer.

Traffic: 2010 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6