Entering edit mode
9.9 years ago
Chirag Nepal
★
2.4k
Hi all,
I am using varscan2 to identify SNPs in our data (exome study comparing normal/tumor from 5 patients).
For testing I used only two pairs,
samtools mpileup -f assembly.fa ST1_normal.bam ST1_tumor.bam ST2_normal.bam ST2_tumor.bam > myTestData.mpileup
It takes long time, is there a way how can I run in parrallel. Or if I run it in single pair (tumor/normal), can I simply concatenate all mpileup files into single output file, which I can input to varscan2 for SNP calling.
Thanks for your help !
Cheers
Chirag
Dear Pierre,
Thanks for the answer. So just to be clear i need to run separately for each chromosome
So to make these run in parallel, i need to submit the jobs separately in different bash scripts ? I guess if I put all these commands in a single bash script they will run sequentially. Right ? Though it might be faster when chr is separated.
Next, when we have this multiple mpileup results, can we simple concatenate in single file :
like
cat mplie_N1.mpileup mplie_N2.mpileup > mplie_total.mpileup
Or does samtools have some functions to concat them.
thanks !
Cheers
Chirag
Use GNU parallel or GNU make with option
-j
How To Run Muscle In Batch?