Sequence Haplotypes From 1Kg Project Variants
2
2
Entering edit mode
11.2 years ago

Greetings,

I need to create haplotype sequences (fastas) from the 1000 genomes data. I was wondering if anyone knows of a tool before I start writing my own?

I want to be able to pass in hg19 coordinates & 1kg.vcf & hg19.fasta and reconstruct all the haplotypes for that region.

1000genomes haplotype vcf • 2.6k views
ADD COMMENT
6
Entering edit mode
11.2 years ago
Adam ★ 1.0k

VCFtools should be able to do this:

cat ref.fa | vcf-consensus file.vcf.gz > out.fa

ADD COMMENT
0
Entering edit mode

Do you know if this supports multiple individuals? I don't want it to collapse the individuals into a consensus .

ADD REPLY
0
Entering edit mode

I haven't used it for this, but it does have a --sample argument that sounds like it will do what you want.

ADD REPLY
2
Entering edit mode
11.2 years ago
Peixe ▴ 660

Take a look at Plink options for working with haplotypes here. I have been recently using it quite a lot for some haplotype calculations, and sounds like it might work for what you want, if you specify the haplotypes. But I don't know if sequences as large as fasta are gonna be accepted. Anyway looking is free, and the plink tutorial is really good! ;)

Cheers!

ADD COMMENT

Login before adding your answer.

Traffic: 2553 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6