Question: How unphased VCF is converted into ped file?
gravatar for jingjin2203
24 months ago by
jingjin220340 wrote:

Hi All,

I have some ddRADseq data from a diploid organism I'm working on.

I've generated an unphased VCF file using freebayes that I wanted to convert into PED file. I was wondering how does VCF to PED conversion deal with unphased VCF data? Because when I further converted PED to FASTA, each of the sample had two reads, and the two reads for each sample were different. So how does the conversion program distinguish two alleles at a heterozygous site for each read?

Hope my question makes sense. Any answers or comments will be appreciated!


ped freebayes phaseing ddrad vcf • 631 views
ADD COMMENTlink modified 23 months ago by Kevin Blighe53k • written 24 months ago by jingjin220340
gravatar for Kevin Blighe
23 months ago by
Kevin Blighe53k
Kevin Blighe53k wrote:

I presume that you mean the running of plink --vcf on your file, i.e., in order to convert it to PLINK PED format?

The latest implementation of PLINK ignores phasing information. In the heterozygous situation, all variant alleles become A1 whilst the reference alleles become A2. In the homozygote situation, variant alleles are obviously set to both A1 and A2.


ADD COMMENTlink written 23 months ago by Kevin Blighe53k

Thanks, Kevin! Yes, that's exactly what I was asking. Really appreciated your kind reply! Just a follow up question, do you know how I can convert phased vcf file to plink ped format with the phasing information incorporated?
Thank you!

ADD REPLYlink written 23 months ago by jingjin220340

I'm not sure that phasing information is ever taken into account in PLINK. The person who will know is chrchang523

If you take a look here:

--vcf loads a (possibly gzipped) VCF file, extracting information which can be represented by the PLINK 1 binary format and ignoring everything else (after applying the load filters described below). For example, phase and dosage information are currently discarded. (This situation will improve in the future, but we do not have plans to try to handle everything in the file.)

It says that phasing information is "discarded" ...

ADD REPLYlink modified 23 months ago • written 23 months ago by Kevin Blighe53k

Thank you, Kevin! Really appreciated it!

ADD REPLYlink written 23 months ago by jingjin220340
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1051 users visited in the last hour