Entering edit mode
11.2 years ago
kindlychung
▴
60
I just got some prephased genotype data, which is said to be the output of MaCH, and looks like this:
RS1->2001 HAPLO1 AAACAAGGAGGAGAAGGAAA ...
RS1->2001 HAPLO2 CAACAAAGAGGAGAAGGAAA ...
RS1->2002 HAPLO1 AAAAAAGGAGGAAAAGGAAA ...
RS1->2002 HAPLO2 CAACAAGGAGGAAGCAGAGC ...
Note there are no T's in it. I understand A pairs with T, so we can denote every T by A, but in that case, why do we have both C and G here?