Question: Identifying De Novo Mutations from vcf files
0
gravatar for jiangqi_1996
8 months ago by
China
jiangqi_19960 wrote:

Hi,

I'm new here,now I have three vcf files(father,mother and child),how can I identify de novo mutations from these three files? I am trying GATK PhaseByTransmission,however I don't know how to convert three vcf files to one ped file. Can I make a try to program for the aim?

Thanks.

snp • 672 views
ADD COMMENTlink modified 8 months ago by Len Trigg1.2k • written 8 months ago by jiangqi_19960
1

Check out plink and plink/seq, especially pseq denovo. plink can create a plink ped file from a VCF file.

ADD REPLYlink written 8 months ago by RamRS21k

Thank you for your reply, i will try it.

ADD REPLYlink written 8 months ago by jiangqi_19960
3
gravatar for Pierre Lindenbaum
8 months ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum119k wrote:

I have three vcf files(father,mother and child)

merge those files with GATK combine variants https://software.broadinstitute.org/gatk/documentation/tooldocs/3.8-0/org_broadinstitute_gatk_tools_walkers_variantutils_CombineVariants.php

Identifying De Novo Mutations from vcf files

use gatk https://software.broadinstitute.org/gatk/documentation/tooldocs/3.8-0/org_broadinstitute_gatk_tools_walkers_annotator_VariantAnnotator.php with -A PossibleDeNovo ( https://software.broadinstitute.org/gatk/documentation/tooldocs/3.8-0/org_broadinstitute_gatk_tools_walkers_annotator_PossibleDeNovo.php)

and --pedigree your.ped https://software.broadinstitute.org/gatk/documentation/tooldocs/3.8-0/org_broadinstitute_gatk_engine_CommandLineGATK.php#--pedigree

I don't know how to convert three vcf files to one ped file.

you don't need the VCF, only 3 lines in a text file. See https://software.broadinstitute.org/gatk/documentation/article.php?id=7696

ADD COMMENTlink written 8 months ago by Pierre Lindenbaum119k

Thanks a lot! Now i am running PossibleDeNovo.

ADD REPLYlink written 8 months ago by jiangqi_19960
0
gravatar for Len Trigg
8 months ago by
Len Trigg1.2k
New Zealand
Len Trigg1.2k wrote:

Here's how I would do it with RTG Tools. This assumes your samples are named "father", "mother", "son" with their calls contained in block-compressed, tabixed VCFs named father.vcf.gz, mother.vcf.gz, son.vcf.gz respectively:

rtg vcfmerge father.vcf.gz mother.vcf.gz child.vcf.gz \
  --add-header "##PEDIGREE=<Child=son,Mother=mother,Father=father>" \
  --add-header "##SAMPLE<ID=son,Sex-MALE>" \
  --output trio.vcf.gz
rtg mendelian -t /path/to/referencegenome.sdf --input trio.vcf.gz \
  --lenient --output-inconsistent trio-non-mendelian.vcf.gz

Adjust appropriately if your child is female or for what your particular sample names are. The reference genome is used to adjust the Mendelian inheritance rules appropriately for the sex chromosomes, and is created as a one-off process via:

rtg format -o /path/to/referencegenome.sdf /path/to/referencegenome.fasta

(For typical human reference genomes the sex chromosomes will be automatically recognized by the format command)

You should also be aware of the fact that when your samples have been called separately, you can end up with some variation in the representation of variants (particularly complex variants involving indels or several variants in close proximity) in the VCF that may make it look like there is mendelian inconsistency when there actually is not.

The ideal solution to this is to jointly call all the members of your trio at once (preferrably with a pedigree aware caller like those in RTG Core) to ensure the variants are consistently represented across the trio. The next best solution to deal with this is to use a Mendelian comparison tool that is aware of the representation issue, such as VBT. The next best solution is to apply external decomposition and normalization tools (and there are many of these, included in tools such as vt, vcflib, bcftools) to the input VCFs prior to comparison.

ADD COMMENTlink written 8 months ago by Len Trigg1.2k

Thank you for your answer, maybe this is another way I can make a try.

ADD REPLYlink written 8 months ago by jiangqi_19960
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 931 users visited in the last hour