Question: Running Pindel On Multiple Samples Produces More Calls Per Sample Than Running Pindel On Single Sample?
gravatar for Dalia
3.7 years ago by
Rochester, NY
Dalia30 wrote:

I am using Pindel on human Whole Exome Sequence bam files, I have been looking at Short Insertions(_SI) and Deletions(_D) only, and have found that when I run Pindel on one sample only, the number of variants called in each exome is much lower than when I run Pindel on multiple exomes.

For example, I ran Pindel on one sample and found 415 combined Deletions and Short Insertions. I also ran Pindel on ~12 samples together and found 756 combined Deletions and Short Insertions in this sample (I counted only calls in that specific sample). The rest of my samples also have numbers consistent with this example.

I have not had a chance to compare the calls themselves as of yet, but plan to. But, this huge difference concerns me and my questions are:

  1. Is this typical? Has anyone else experienced this type of behavior when using Pindel on a single sample -vs- multiple samples?
  2. Is it safe to assume that more samples = more statistical power = more accurate calls?
  3. Is there something obvious I am missing?

Thank you!

exome pindel • 2.0k views
ADD COMMENTlink modified 3.7 years ago • written 3.7 years ago by Dalia30
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1547 users visited in the last hour