Question

Dealing with a large sample for variant calling

0

Entering edit mode

5.4 years ago

Mehulsharma.253 ▴ 30

I have a large number of samples. Like a 100 samples at least (plus these are paired end reads). I aim to call variants on these samples and then predict their effects on protein structure dynamics.

The only way that seems possible for now is to align each sample individually, pre-process them individually, call on them individually and then combine them into a gvcf for analysis.

This however, seems very time intensive and computationally cumbersome. What would be the alternatives to this ?

I'm currently using standard bash script commands and plan to use various tools, viz. GATK, freebayes, varscan 2, pindel, etc.

NGS alignment variant-calling joint-calling • 1.4k views

ADD COMMENT • link updated 4 weeks ago by Ram 43k • written 5.4 years ago by Mehulsharma.253 ▴ 30

score 0 · Answer 1 · 2018-12-12

0

Entering edit mode

5.4 years ago

Pierre Lindenbaum 161k

This however, seems very time intensive and computationally cumbersome. What would be the alternatives to this ?

GATK hapcaller in gvcf mode: https://software.broadinstitute.org/gatk/documentation/article.php?id=3893

enter image description here

ADD COMMENT • link 5.4 years ago by Pierre Lindenbaum 161k