autosomal ancestry inference
Entering edit mode
4.0 years ago
J.F.Jiang ▴ 880

Hi all,

I am very interested in ancestry inference.

  1. autosomal ancestry I've followed the tutorial from

However, decision on cluster information K is quite confusing.

For example, there are already 12 populations among my data. How can I train these data so that I can fully separate the populations.

  1. Y haplogroup inference yhaplo offered by 23andme can do this, however, the ISOGG in this repo is too old. And the author did not update them more. So I am wondering if there is any other similar tool that can predict the haplogroup as well as building the ISOGG tree myself.


GWAS ancestry • 1.5k views
Entering edit mode

Since you are already using admixture software, check out its manual:

I am not sure why the link is not opening, probably some updation is going in the server where it is stored, but, in the document, an explanation has been provided on how to select cluster information K through estimation of cross-validation error.

Entering edit mode
4.0 years ago

There are many ways to do it. You can identify haplotype-tagging SNPs via linkage disequilibrium (LD) or you can find another way, e.g., via principal components analysis. Here, I relate to 3 such methods:

In the first case, the sensitivity/specificity of the model trained on 1000 Genomes data is >98%. I use this model to predict the ethnicity of unknown samples that I receive.


Login before adding your answer.

Traffic: 1604 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6