Hello,
I would like to obtain allele frequency by subpopulation for 102 SNPs.
The particular subpopulations I am interested in are ASW (African Americans from the South West), YRI, and CEU.
I would like to be able to batch submit them, if possible; I know I can do this through dbsnp, but sometimes the number of individuals used to assign the MAF gives me pause. The list of SNPs is here...
rs187786174
rs227163
rs2301888
rs28411352
rs12140275
rs2476601
rs624988
rs2228145
rs2317230
rs4656942
rs72717009
rs75409195
rs2105325
rs17668708
rs10175798
rs34695944
rs13385025
rs1858037
rs9653442
I'd suggest not pasting the entire list here, because it seems to convey the idea that you want us to do the work for you. If you wish to point out specific cases, please do that instead.
Also, please elaborate what you mean by "number of individuals gives you pause"
At this URL you can see an example of what I mean.
This SNP has been around a while, so the "population diversity" table is quite large. For other SNPs, it is much smaller. In that case, the sample chromosome count is often only 2 or 8 or 10... I'd rather be pulling from somewhere that has at least 100 chromosomes for a given ethnicity, if that is not possible of course I will take what I can get.
Well then you're left with two options: dbSNP batch query or a really REALLY long procedure that will give you a table of variant, pop[1..N]_alleleFreq, pop[1..N]_alleleCount
Unless you're looking at frequent queries of this kind, the second approach is not worth the effort.
EDIT: VEP is definitely a viable option. It is your best option, I think.