Question: autosomal ancestry inference
gravatar for J.F.Jiang
18 months ago by
J.F.Jiang800 wrote:

Hi all,

I am very interested in ancestry inference.

  1. autosomal ancestry I've followed the tutorial from

However, decision on cluster information K is quite confusing.

For example, there are already 12 populations among my data. How can I train these data so that I can fully separate the populations.

  1. Y haplogroup inference yhaplo offered by 23andme can do this, however, the ISOGG in this repo is too old. And the author did not update them more. So I am wondering if there is any other similar tool that can predict the haplogroup as well as building the ISOGG tree myself.


ancestry gwas • 748 views
ADD COMMENTlink modified 18 months ago by Kevin Blighe54k • written 18 months ago by J.F.Jiang800

Since you are already using admixture software, check out its manual:

I am not sure why the link is not opening, probably some updation is going in the server where it is stored, but, in the document, an explanation has been provided on how to select cluster information K through estimation of cross-validation error.

ADD REPLYlink modified 18 months ago • written 18 months ago by prasundutta87360
gravatar for Kevin Blighe
18 months ago by
Kevin Blighe54k
Kevin Blighe54k wrote:

There are many ways to do it. You can identify haplotype-tagging SNPs via linkage disequilibrium (LD) or you can find another way, e.g., via principal components analysis. Here, I relate to 3 such methods:

In the first case, the sensitivity/specificity of the model trained on 1000 Genomes data is >98%. I use this model to predict the ethnicity of unknown samples that I receive.

ADD COMMENTlink modified 18 months ago • written 18 months ago by Kevin Blighe54k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1286 users visited in the last hour