Difference between SNParray genotype dataset and WES genotype dataset
1
0
Entering edit mode
6.4 years ago
boymin2020 ▴ 80

Hi all,

I have handled many genotype datasets from SNP-array dataset or WES dataset. I found the standard PLINk QC methods for GWAS cannot be applied to genotype dataset from WES simply. For example, relatedness check step will screen more outliers. The heterozygous rate scale is smaller. It is kind of hard for me to understand the reason. Many people told me if you have WES or WGS data, no need to have SNP array data. I think there may be some critical difference. Any explanation will be appreciated.

Thanks.

RNAarray WES genotype GWAS • 2.4k views
ADD COMMENT
1
Entering edit mode
6.4 years ago

A genotyping array will typically look at SNP positions spaced roughly evenly across the entire genome, irrespective of intergenic or coding region. WES, obviously, then, just looks at the coding regions but is far superior for screening coding variants than an array. Also, for things like burden testing and novel variants discovery, WES is far superior.

Specifically for relatedness, though, the key difference would be that intergenic regions (and thus intergenic SNPs) are less conserved than coding regions, whereas exons are highly conserved. Thus, you could argue that performing relatedness on just coding regions using WES is biasing the results. Thus, you will notice differences in relatedness results between WES and a genome-wide genotyping array.

Hope that this helps

Kevin

ADD COMMENT

Login before adding your answer.

Traffic: 2326 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6