Question: Merging WGS SNP array data
sankar200410 wrote:


I combined SNP array and WGS data and plotted a PCA. I found that the individuals from the same population did not cluster together because the SNVs were obtained from the two different (genotyping) methods mentioned above. How do you remove this bias/discrepancy from these datasets.

Thanks a lot Kevin. I will try that

Kevin Blighe32k
Kevin Blighe32k wrote:

With your array data, you will have to filter out variants not called on the coding (+ / plus) strand, and then also filter these out of the NGS WGS data. Take a look at what I have written for Step 6, here: Produce PCA bi-plot for 1000 Genomes Phase III - Version 2

You can download the library files from the array manufacturer for the purposes of determining on which strand each probe genotypes.


