Question: 012 genotype matrix using vcf tools
0
gravatar for Ana
21 months ago by
Ana170
Ana170 wrote:

Hello everyone,

I have a vcf-file contains nearly 11millions SNPs. I want to convert my vcf file into 012 genotype matrix for LD pruning. I am using this code:

/data/programs/vcftools_0.1.13/bin/vcftools --vcf my.file.vcf
--012  --out output_geno.vcf

So, I get the output, but I am confused. According to manual the output 012 genotype matrix rows are individuals and columns are genotypes. I have 11million SNPs, should not get 11million columns (one columns per SNP)? when I count number of columns it is only nearly one million! Is there anything wrong or am I doing a ridiculous mistake? Thanks for any help to figure out my mistake ...

ADD COMMENTlink modified 21 months ago • written 21 months ago by Ana170

Did you check the *.indiv and *.pos files that are also output with the --012 parameter? The *.indiv file should obviously cotain the expected number of samples that were in the input VCF.

Also, check the log file that's produced, particularly the line:

"After filtering, kept X out of a possible Y Sites"

Kevin

ADD REPLYlink modified 21 months ago • written 21 months ago by Kevin Blighe42k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 953 users visited in the last hour