Question: retrospective trio analysis from WGS SNPs
gravatar for harold.smith.tarheel
2.8 years ago by
United States
harold.smith.tarheel4.5k wrote:

We'd like to perform trio analysis of mice to detect de novo mutations (via WGS and variant calling). The complication: our mouse facility performs breeding in bulk (~15 each males and females). Mom-pup pairs are obviously straightforward. To identify dads, our strategy is to sequence all of the males and infer paternal parentage by comparing dad-pup SNPs. One method is to manually identify SNPs at heterozygous loci for discrimination, but I assume there are tools that can perform this analysis from bulk VCFs. Any suggestions?

snp variant pedigree lineage trio • 1.1k views
ADD COMMENTlink modified 2.8 years ago by andrew.j.skelton736.0k • written 2.8 years ago by harold.smith.tarheel4.5k
gravatar for andrew.j.skelton73
2.8 years ago by
andrew.j.skelton736.0k wrote:

I recently came across Peddy that I'll be trying out soon. VCF tools also has a couple of implementations in --relatedness and --relatedness2, which are based on Yang et al, and Manichaikul et al respectively. There's also KING which I think was used in ExAC.

ADD COMMENTlink modified 2.8 years ago • written 2.8 years ago by andrew.j.skelton736.0k

Thanks, Andrew. I'm using VCFtools for other metrics, so I'll probably test it first. Will report back with results.

ADD REPLYlink written 2.8 years ago by harold.smith.tarheel4.5k

Hi Andrew, I'm trying to figure out what panel of SNPs KING interrogates to calculate kinship, but this doesn't seem to be explicitly stated anywhere. Do you know what this panel is?

ADD REPLYlink modified 2.2 years ago • written 2.2 years ago by gaelgarcia180

From the paper:

The framework underlying the KING approach to relationship inference centres on modelling genetic distance between a pair of individuals as a function of their allele frequencies and kinship coefficient

I don't believe that a panel is used, rather a random selection of SNPs are used to infer the coefficient.

ADD REPLYlink written 2.2 years ago by andrew.j.skelton736.0k

Thanks, Andrew. Yes, actually it looks like it uses all of them. (See

ADD REPLYlink modified 2.1 years ago • written 2.1 years ago by gaelgarcia180
gravatar for Fabio Marroni
2.8 years ago by
Fabio Marroni2.5k
Fabio Marroni2.5k wrote:

There are several options: I list you a couple that just came to my mind.

The most rigorous one would be to use Mendel. Method 9 is pedigree selection. Basically you specify all the possible pedigrees and it tells you which one is the most likely. Cons of this approach: 1) I am not sure it is easy to specify you data structure (one trio plus 14 unrelated males) to the software. You might have to do several comparisons between trios where mom-pup are the same and dad is changing. 2) I do not think mendel takes vcf as input.

Another approach would be to use the number of Mendelian inconsistencies to find the most probable father, using for example the mendelian plugin (I never tried it. For sure GATK has something similar). Basically, you run it on all the possible trios, i.e. for each mum-pup couple you rotate all the possible dads. If the dads are not related to the mom, then one of them (the real one) should show a sensibly lower number of mendelian inconsistencies.

Be advised that either analysis you perform, you should perform it on a very reliable set of SNPs.

ADD COMMENTlink written 2.8 years ago by Fabio Marroni2.5k

Thanks, Fabio. I'll check out Mendel and report back with results if I use it.

ADD REPLYlink written 2.8 years ago by harold.smith.tarheel4.5k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1612 users visited in the last hour