Question: VCF merge containing CNV
1
gravatar for Andrea Spinelli
3 months ago by
Italy/Pisa
Andrea Spinelli10 wrote:

Hello everybody, like the title: which is the way to merge VCFs files containing CNV?

I use vcf-merge (https://vcftools.github.io/perl_module.html#vcf-merge), a VCFtools function, and after bgzip and tabix (http://www.htslib.org/doc/tabix.html), SAMtools, to indexing and tab separating variants, but I don't know if it is the right way. Thanks.

cnv sequence vcf • 215 views
ADD COMMENTlink modified 3 months ago by Dan Gaston6.9k • written 3 months ago by Andrea Spinelli10

Why would a VCF contain CNV? / Perché un VCF contenesse CNV? /

A copy number variant is generally defined as a region > 1 kilobase. VCF files should only contain single nucleotide variants and short insertions/deletions (InDels). If you have a customised VCF format, then the standard tools used to manipulate VCF files may not understand how to interpret your custom format. In that case, you could use Python scripts to manipulate your VCF files. I have done this recently to include copy number variants in my custom VCF format.

Kevin

ADD REPLYlink written 3 months ago by Kevin Blighe9.0k
1

Structural variants, including CNVs, have been allowed in VCFs according to the specification for quite some time. You can get into adding custom tags for ease of use (like SVEND is often used to give easy reference to stop coordinates). While it may well be that a caller may be adding custom fields to the INFO line it is incorrect to say that only short Indels and SNVs should be in a VCF file.

ADD REPLYlink written 3 months ago by Dan Gaston6.9k
2
gravatar for Dan Gaston
3 months ago by
Dan Gaston6.9k
Canada
Dan Gaston6.9k wrote:

Whether vcf-merge works for what you are doing or not I'm not actually sure off hand. However, since you have a solution you have tried the answer is to test that solution. Take some sample VCFs with the type of variants you want to merge, run vcf-merge, and look at the results. Do you get what you would expect?

If vcf-merge doesn't do what you expect you may want to try the ensemble method from bcbio.variation.recall

ADD COMMENTlink written 3 months ago by Dan Gaston6.9k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1426 users visited in the last hour