Problem to remove subset of patients with plink
0
0
Entering edit mode
6.2 years ago
Ginevra ▴ 10

Hi all, I am trying to remove ID patients from my data and I am using the original PED file for doing that. I create a .txt file with the number of ID family and ID patients that I want to remove put in two columns, but it still doesn't work. The analysis seems to go until the end of the process (creating temporary files) when appears the message saying: Error: duplicates ID.

My command is: $ ./plink --file name --remove IDlist.txt --out subset2 --make-bed

And my IDlist.txt is:

1 2204 
2 1146

So I know I have few duplicates but I don't understand why the presence of duplicates does not allow the removing process.

plink PED • 2.5k views
ADD COMMENT
0
Entering edit mode

You have encoded the IDlist.txt correctly; however, you should follow-up on the error message. It implies that either your IDlist.txt file or your PED file contains duplicate entries, i.e., the same individual duplicated.

Check the FID and IID of your PED file.

ADD REPLY
0
Entering edit mode

Just wanted to check if this issue has been resolved. I tried to remove duplicate samples by providing PLINK with an ID list that includes all the duplicates I want to remove. However, it still gives me an error message (Error: duplicate IDs) during this process.

ADD REPLY
1
Entering edit mode

The first step has to be taken outside of PLINK here. E.g. append "_1" and "_2" to the duplicated IDs in the .ped or .fam file, then mention both in the --remove ID list.

ADD REPLY

Login before adding your answer.

Traffic: 2229 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6