Question: How to merge different strain SNP vcf files into one combined vcf file with strain specificity.
gravatar for AlicePsyche
2.9 years ago by
United Kingdom
AlicePsyche20 wrote:


I want to do allele specific binding analysis using ChIP-seq data but I have some problems with the first step: individualized genome construction.

In mouse, I found the vcf file with strain specificity like this format:

CHROM  POS     ID      REF     ALT     QUAL    FILTER  INFO    FORMAT  129P2   129S1   129S5   AKR     A_J     BALB    C3H     C57BL   CAST    CBA

Each strain have one annotation column but I failed to find this vcf file of zebrafish(danRer7). I only got three separate SNP vcf files each regarding one specific strain(AB_strain.vcf, Tu_strain.vcf, WIK_strain.vcf). I am wondering if I could just combine them using bcftools(merge). I am not so familiar with the SNP data so any advice would be helpful.

Thanks in advance.

snp • 1.0k views
ADD COMMENTlink modified 2.9 years ago by Vivek2.2k • written 2.9 years ago by AlicePsyche20
gravatar for Vivek
2.9 years ago by
Vivek2.2k wrote:

You can merge them using GATK or VCF-merge from VCFtools as long as the strain names are specified as sample names within the VCF header. Be sure to specify the ploidy correctly if not diploid as most tools assume diploid by default.

ADD COMMENTlink written 2.9 years ago by Vivek2.2k

Thanks for your help, now I have solved this problem using vcftools :)

ADD REPLYlink written 2.9 years ago by AlicePsyche20
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 763 users visited in the last hour