Entering edit mode
19 months ago
rogerdavis2609 ▴ 20
I have started working on UK Biobank and I am completely clueless about the data structure. Is there is a tutorial (apart from the one provided by UK Biobank) where I can unsderstand the schema better?
How long have you spent on it? Which UK Biobank data have you retrieved?
Its been more than 2 weeks now, but I am very confused with the data structure. I have a few SNPs and all I want to know for the first instance is how many patients (suppose with lupus) are there that have that SNP in the Biobank. I am very new to GWAS.
I see, which supervisor threw you into this deep water? Do you have the PLINK dataset, or the VCF dataset?
:) ....I have VCF files.
I may suggest that you convert the VCF to PLINK format, and then do everything in PLINK. The UK Biobank data should already be available for you in PLINK format, though (?)
PLINK's documentation is excellent, but there will obviously be a learning curve if you are starting from scratch.