Entering edit mode
4.6 years ago
mendel.roth
▴
10
Does anyone know how to engineer a FASTQ, SAM or BAM or related NGS data file so to test a bioinformatics pipeline for novel copy number variation calling? I am hoping to tweak sequence data to simulate CNV NGS data and run those files through several pipelines for validation studies. Thank you
See this prior thread for some resources: Gold standard CNV Calls for NA12878
There is: http://dgv.tcag.ca/dgv/app/home
These are real samples. Not engineered ones.
I would suggest googling for "simulating fastq" files, or similar terms.