Entering edit mode
20 months ago
chrisgr
▴
20
I was playing around with somatic data from Novogene. For one tumor-normal sample they provided a VCF file that contained about 100 somatic mutations. In their manual they state that they used GATK mutect2.
In order for me to understand what is going on I wanted to reproduce these somatic mutations using the tumor- and normal bam files. I howerver discovered ~13.000. I can't find anywhere what filtering methods they used. I've looked in their provided documentation but it only says it's been 'filtered'. What are common ways to filter somatic VCF files?
Thanks in advance!
Did you look in the header of the provided VCF to see if any information has been encoded there (e.g. command line/options used).