Question: combining sequencing variants
0
gravatar for hmk123
4.2 years ago by
hmk1230
hmk1230 wrote:

I have targeted sequencing from 2 different projects.  One project was sequenced on SOLiD and the other on Illumina.  I have independently called variants on both projects using GATK.  Is is possible to combine the called variants (.vcf files) to allow for association analysis with a larger sample size?  Do you have a recommendation on a tool that can combine files?

Thanks!

sequencing next-gen • 1.5k views
ADD COMMENTlink modified 4.2 years ago by Katie D'Aco990 • written 4.2 years ago by hmk1230
3
gravatar for Emily_Ensembl
4.2 years ago by
Emily_Ensembl16k
EMBL-EBI
Emily_Ensembl16k wrote:

VCF tools?

ADD COMMENTlink written 4.2 years ago by Emily_Ensembl16k
3
gravatar for Katie D'Aco
4.2 years ago by
Katie D'Aco990
Massachusetts
Katie D'Aco990 wrote:

You should first normalize the variants if they aren't already. Then use a combination of bedtools intersect (to get the regions where the targets from the two projects overlap) and vcftools merge (to combine the vcf's from the two projects into a single vcf).  Before doing any association tests, do LD-pruning and PCA to check for batch effects. Since they were sequenced with different technologies combining them might not be valid.

ADD COMMENTlink written 4.2 years ago by Katie D'Aco990

Can you describe more what you mean by LD-pruning and PCA checks?

ADD REPLYlink written 4.2 years ago by hmk1230

I've used EIGENSTRAT to do PCA, but there are other tools that do this. If you plot your subjects along the first 2 PCs and they are clustered by sequencing technology then you don't want to use the combined data set to do association tests.  Before you do PCA you want to do LD-pruning (can be done with EIGENSTRAT).

ADD REPLYlink written 4.2 years ago by Katie D'Aco990
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 800 users visited in the last hour