Question: The effect of sample size and relatedness in target sample in polygenic risk score
gravatar for javadjamshidi
15 months ago by
javadjamshidi40 wrote:

Hi every one

I'm using PRSice v1.25 to calculate the polygenic risk scores in my sample ( 1400 twins). My phenotype is wellbeing ( quantitative) and I'm using the summary stats of a very large GWAS as my base data ( The problem is that my PRS seems to be a little inflated. I have two questions 1. Does the relatedness between individuals in the target sample influence the PRS? 2. Does the target sample size influence the PRS? ( how?)

Thank you!

ADD COMMENTlink modified 14 months ago by Sam2.7k • written 15 months ago by javadjamshidi40

In general, i saw people remove related samples from the analysis.

Sample size of GWAS or PRS bins ?

ADD REPLYlink written 15 months ago by geek_y11k

Yes, If you want to run a GWAS or an association study you need to remove the related samples, but I am not sure about the PRS, if it needs to remove the related samples or not. The sample size of the PRS ( the target sample).

ADD REPLYlink written 14 months ago by javadjamshidi40
gravatar for Sam
14 months ago by
New York
Sam2.7k wrote:

You will definitely want to remove the related samples from your dataset, and I will suggest you move onto PRSice-2 instead of PRSice 1.25. We have made some major improvement of the software between the two iterations.

As for the effect of sample size, you might want to read Dudbridge's paper

ADD COMMENTlink written 14 months ago by Sam2.7k

Thanks Sam! I divided my twins into two groups, each group includes independent samples and I calculated the PRS again for each group. The R-squared from the whole sample (with related individuals_twins) PRS seems to be the average of the two groups. AND. Do we need to include covariates such as age, sex, and principal components while calculating PRS using PRSice?

ADD REPLYlink modified 14 months ago • written 14 months ago by javadjamshidi40

The inclusion of covaraites will have no effect on the PRS calculated. However, it will affect the R2 and p-value, therefore might influence the "best" p-value threshold. It is generally a good idea to include the relevant covariates when performing PRS just so that the R2 and p-value are more relevant

ADD REPLYlink written 13 months ago by Sam2.7k

Hi~Sam, I also want to know whether it is feasible to use 446 subjects as target data. After I removed related samples from my data, I only have 446 subjects.

ADD REPLYlink modified 5 months ago • written 5 months ago by hexiaoxi.2312320

Yes, you can definitely try. You can maybe also look at PRS power calculation tools such as AVENGEME

ADD REPLYlink written 5 months ago by Sam2.7k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1931 users visited in the last hour