Vcf Database Table Structure
2
5
Entering edit mode
9.7 years ago
win ▴ 930

hi all, I am trying to store VCF files in a database table. While the VCF file structure is quite straight forward the issue I am thinking about is when one has more than sample in the VCF file.

Does anyone have any recommendations for relational table structure to store VCF file.

thanks in advance.

vcf • 7.9k views
ADD COMMENT
2
Entering edit mode
9.7 years ago

I use that kind of schema: (http://code.google.com/p/variationtoolkit/wiki/Vcf2Sqlite)

enter image description here

But I've never really used it . Why ? New ideas, new tools, new sources of annotations: I often re-annotate my VCF file . It's easier to parse some tabix-indexed files rather than storing them in a database.

ADD COMMENT
0
Entering edit mode

thanks, i will probably try sqlite.

ADD REPLY
2
Entering edit mode
9.7 years ago
User 59 13k

Have a look at Gemini:

Code: http://github.com/arq5x/gemini

Docs: https://gemini.readthedocs.org/en/latest/index.html

Blurb:

Gemini is designed to be a flexible framework for exploring genetic variation in the context of the wealth of genome annotations available for the human genome. By placing genetic variants, sample genotypes, and useful genome annotations into an integrated database framework, gemini provides a simple, flexible, yet very powerful system for exploring genetic variation for for disease and population genetics.

ADD COMMENT

Login before adding your answer.

Traffic: 2633 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6