Sample based on allele frequency matching between two files
0
0
Entering edit mode
11 months ago
selplat21 ▴ 20

I have two files, each of which has the SNP_ID and its allele frequency.

File 1 has a set of 200 SNPs (subset from my genome) and their respective allele frequencies, whereas File 2 is every SNP in the genome along with its allele frequency.

I need to sample 1000 SNPs from file 2, using the distribution of allele frequencies in file 1. In essence I need random SNPs from the larger file (file 2) that are somewhat matched in allele frequency distribution to file 1.

Any help is appreciated!

linkage • 338 views
ADD COMMENT

Login before adding your answer.

Traffic: 2413 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6