Tutorial:Germline variant calling pipeline using Snakemake
Entering edit mode
10 months ago
nhaus ▴ 210

Hello everybody,

as part of a project, I had to write an in-house pipeline to call germline mutations for ~100 patients.

For that I used Snakemake and GATKs best practice guidelines. Steps that take a long time (HaplotypeCaller or BaseQualityScoreRecalibration) are automatically parallelized over genomic intervals.

Furthermore, I tried to document the requirements to run the pipeline on your own as extensively as possible, and also included links, where to download gold standard reference material, so it is easy to use for people without a lot of experience.

I hope this is useful for anyone who is also trying to perform germline variant calling.

If you have any questions or improvements for the pipeline, please let me know.

You can find the project here:



snakemake GATK germline_variant_calling • 1.0k views
Entering edit mode
10 months ago

Nice! A small suggestion... Instead of listing programs in the Installation section you could provide a requirements.txt file listing those programs, like:

snakemake >=6
samtools =1.10

then tell the user to set up the conda environment with:

conda create -n GermlineMutationCalling
conda activate GermlineMutationCalling
mamba install --file requirements.txt # Use mamba, much better than conda

if someone doesn't want to use conda, the requirements.txt is still useful.

I think this is a lot easier for both users and developers.


Login before adding your answer.

Traffic: 1887 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6