Question: GATK hg19 bundle question
0
gravatar for genetic
18 months ago by
genetic40
United States
genetic40 wrote:

I've downloaded GATK hg19 bundle datasets. Can I use 1000G_phase1.indels.hg19.sites.vcf instead of 1000G_phase1.indels.hg19.vcf? What is difference between these 2 files?

1000G_phase1.indels.hg19.vcf 1000G_phase1.indels.hg19.sites.vcf

Mills_and_1000G_gold_standard.indels.hg19.vcf Mills_and_1000G_gold_standard.indels.hg19.sites.vcf

Thank you in advance. MH

gatk • 819 views
ADD COMMENTlink modified 18 months ago by Biostar ♦♦ 20 • written 18 months ago by genetic40

copy/pasted from https://gatkforums.broadinstitute.org/gatk/discussion/1826/indelrealigner-realignertargetcreator-known-site-bundle-files:

the difference between the .vcf and the .sites.vcf files: the .vcf files contain the full callset info including genotypes, while the .sites.vcf files don't contain the genotypes, only the variant sites info. The point of having sites-only files is that they're smaller files

FAQs (https://software.broadinstitute.org/gatk/documentation/article.php?id=1247) mentioned in the same post will be of great help.

ADD REPLYlink modified 18 months ago by RamRS21k • written 18 months ago by cpad011211k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2018 users visited in the last hour