Question: formating for fineSTRUCTURE input
gravatar for natasha
3.7 years ago by
natasha100 wrote:


I would like to use fineSTRUCTURE to access the population structure of a bacterial species. Thus I will be inputting SNP data.

However, I don't understand how to create the 'phased' data format that fineSTRUCTURE requires. The fineSTRUCTURE manual lists multiple programmes to help with this phasing process, such as phase, beagle, shapeit, impute2 etc however, I don't know were to even start with these....

For example PHASE requires me to input my data in the following format...



P Position(1) Position(2) Position(NumberOfLoci) LocusType(1) LocusType(2) ... LocusType(NumberOfLoci) ID(1)









But how to I get this?!?!?!

As it stands I have the core genome alignment, the SNP alignment and a VCF of my data. How do I use these formats to phase my data?? Can anyone help to point me in the right direction??

Many many thanks!!!

finestructure input format • 1.4k views
ADD COMMENTlink modified 9 months ago by Biostar ♦♦ 20 • written 3.7 years ago by natasha100
ADD REPLYlink written 9 months ago by Shicheng Guo8.3k
gravatar for nataliagru1
11 months ago by
nataliagru150 wrote:

Hello. You need to use a program such as ShapeIT or IMPUTE2 to phase your VCF file. ShapeIT takes VCF files as input and will output a phased file format which you will need to convert to chrompainter format (these scripts and tools are provided on fineStructure website). In addition everything I am mentioning is explicitly written in the fineStructure manual in grave detail. You can utilize any other phasing software you desire but the ones I mentioned are recommended by fineStructure authors.

Link to ShapeIT: Link to IMPUTE2:

ADD COMMENTlink written 11 months ago by nataliagru150
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1074 users visited in the last hour