How does Plink choose which SNP is Allele1 and Allele2?
1
0
Entering edit mode
7.7 years ago
devenvyas ▴ 740

I am was going back and double checking some old work, namely to confirm some Plink files had been merged properly (as when I was new to Plink I was doing some unorthodox stuff to get things to merge since my datasets had different IDs for the same SNP in thousands of cases).

Anyways, to my point, I've noticed at difference stages of my project across several BIM files, for any given chromosome coordinate, sometimes there are certain cases where Plink made one allele allele1 and then later made the alternate allele1

For one line once read

1 rs10915428 0 4158955 G T

but then after adding more samples to the dataset it became

1 rs10915428 0 4158955 T G

This made me want to know, how does Plink determine, which alleles becomes 1 and which becomes 2?

plink R • 5.0k views
ADD COMMENT
1
Entering edit mode
7.7 years ago

Plink 1.x normally sets the major allele to A2, and the minor allele to A1. It isn't really possible to do better with .ped + .map filesets which don't specify reference alleles.

Plink 1.9's --a1-allele and --a2-allele commands give you more control over this. --keep-allele-order also comes in handy often, though it's really annoying to have to remember to use it.

Plink 2.0 keeps track of REF/ALT alleles and flags "provisional" REF alleles (e.g. major alleles from a .ped + .map fileset).

ADD COMMENT
0
Entering edit mode

So basically, whenever I add or subtract enough sequences for certain SNPs, the major and minor alleles could switch depending on which one is p > .5?

ADD REPLY
0
Entering edit mode

Also, is there anyway that one could set it that a given homozygous individual's allele is always allele1

ADD REPLY

Login before adding your answer.

Traffic: 2628 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6