I am currently attempting to create a pipeline for Illumina NGS sequence alignment, SNP/Indel-calling and association testing upon multiple samples.
Can anyone recommend a data set for testing a pipeline like this, right through to the stage of calling associated mutations?
Alternatively, is there an existing means of generating such a data set?
Any tips would be greatly appreciated.
Thanks in advance!