I have multiple .fq whole genomes prepared for variant calling. However, it is quite expensive to repeat the whole pipeline so i wonder what data types are must have to extract?
Right now i am planning to extract the following:
- Variants (indels, SNPs), with HaplotypeCaller
- Structural variants (<1000 bp long), with Manta
What are some other "must have" i could extract?