Convert gVCF to VCF with ALT allele instead of the <NON_REF> symbolic allele
1
0
Entering edit mode
3.3 years ago

Hi all,

I generated gVCF files with the intention of creating a multi-sample cohort VCF. However, some os these samples need to be analysed individually and, thus, need to be converted to VCF format. gVCF files generated with the GATK pipeline have <non_ref> as symbolic allele. This is not handled by BCFTools convert --gvcf2vcf command, since the <non_ref> still appears in the converted VCF (which, thus, is not handled by other tools in upstream analyses).

GATK GenotypeGVCFs command can handle this convertion but is very slow. Is there an efficient way to generate a VCF file from gVCF with the ALT allele instead of the <non_ref> symbolic allele?

Thank you in advance!

Maira

genome SNP snp sequencing • 2.8k views
ADD COMMENT
0
Entering edit mode
3.3 years ago

GATK GenotypeGVCFs command can handle this convertion but is very slow.

GenotypeGVCFs is the only way to convert gvcf to vcf.

Is there an efficient way to generate a VCF file from gVCF with the ALT allele instead of the <non_ref> symbolic allele?

parallelize by chromosome

ADD COMMENT
0
Entering edit mode

I am new to GATK practices. Can you please elaborate?

ADD REPLY
0
Entering edit mode

Thank you! I was wondering if it is a norm to separate individual samples from combined Genotypegvcf after vqsr? Or does that somehow hinder the quality of the data?

ADD REPLY

Login before adding your answer.

Traffic: 1277 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6