Question: Simple Question About .Ped Format For Gatk And Plink
0
gravatar for el2622
6.2 years ago by
el26220
el26220 wrote:

The documentation states that in a pedigree file, the first column is family ID, second is pro band ID, third is father ID, and fourth is mother ID. But what is actually separating the ID numbers?

For example, lets say that I have these samples: proband = 1000 father = 1000-01 mother = 1000-02

In this case, would the family ID be 1000? This would mean the pro band ID would be 0? and would the father ID be 01 or would it be -01?

Thanks so much for your help.

exome gatk plink ped • 2.3k views
ADD COMMENTlink modified 6.2 years ago by jxchong160 • written 6.2 years ago by el26220
3
gravatar for jxchong
6.2 years ago by
jxchong160
Postdoc at the University of Washington
jxchong160 wrote:

the delimiter is spaces or tabs. This link will help: http://pngu.mgh.harvard.edu/~purcell/plink/data.shtml#ped

In particular:

The PED file is a white-space (space or tab) delimited file: the first six columns are mandatory:

 Family ID
 Individual ID
 Paternal ID
 Maternal ID
 Sex (1=male; 2=female; other=unknown)
 Phenotype

You have to convert your sample IDs to ped format. What I would do is:

1000 0 01 02 -9 -9
1000 01 0 0 1 -9
1000 02 0 0 2 -9
ADD COMMENTlink modified 5.8 years ago • written 6.2 years ago by jxchong160
3

You will want these on a new line, I think, like so:

1 1000 1001 1002 -9 -9 (assuming we don't know the sex of the proband)
1 1001 0 0 1 -9
1 1002 0 0 2 -9
ADD REPLYlink modified 6.2 years ago • written 6.2 years ago by Matt Shirley8.9k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 777 users visited in the last hour