Question: 012 genotype matrix using vcf tools
gravatar for Ana
2.3 years ago by
Ana170 wrote:

Hello everyone,

I have a vcf-file contains nearly 11millions SNPs. I want to convert my vcf file into 012 genotype matrix for LD pruning. I am using this code:

/data/programs/vcftools_0.1.13/bin/vcftools --vcf my.file.vcf
--012  --out output_geno.vcf

So, I get the output, but I am confused. According to manual the output 012 genotype matrix rows are individuals and columns are genotypes. I have 11million SNPs, should not get 11million columns (one columns per SNP)? when I count number of columns it is only nearly one million! Is there anything wrong or am I doing a ridiculous mistake? Thanks for any help to figure out my mistake ...

ADD COMMENTlink modified 2.3 years ago • written 2.3 years ago by Ana170

Did you check the *.indiv and *.pos files that are also output with the --012 parameter? The *.indiv file should obviously cotain the expected number of samples that were in the input VCF.

Also, check the log file that's produced, particularly the line:

"After filtering, kept X out of a possible Y Sites"


ADD REPLYlink modified 2.3 years ago • written 2.3 years ago by Kevin Blighe52k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2098 users visited in the last hour