Hi all,
Does anybody have any recommendation regarding breakPointThreshold argument in control-FREEC?
I know that values should be between 0.1-0.8 and the closest to 0.1 the more CNV's will be detected. But how can I specify what is the appropriate value to use in my analysis?Because I tried 0.8 and 0.6 and I noticed a big difference in my results.
I am running a WES analysis using normal tumor comparison.
Thank you in advance.
Dear Chris,
So, as I told you I have some sequenced samples with higher coverage than others. These samples were double sequenced. So, when I split it in two different files. So instead of having a file with 400M reads, I have two of 200M reads. So, I run the same analysis for all these three files (400M, 200M, 200M) with the same arguments and I noticed that at the end the results are totally different for the 400M file. I am getting too many CNVs which all of them are gains. That means the algorithm didn t perform normalization. That means the results is totally biased by the coverage. How therefore did you perform an unbiased analysis from samples with different coverage? Maybe I missed an argument.Thank you in advance. Here is my new configuration file
Actually, I realized that if you use window and step argument in exome sequence analysis, then capture regions file is totaly useless. For instance, the read depth is calculated every 500 bases in my case.
Yes you are correct, probably this tool is not very much suited for your sample. So I would suggest you to check VarScan2, AdTex, ExomeCNV and such. Yes the RC normalization is on the binning of the 500 bases window. In that case you can remove the step and window size. I have never worked with such high depth data. Is it 40X or 400X? If it is 400X then probably you have to make a coverage plot to understand the median level of coverage for your sample either for each bases or for the baits and based on that you can select the
readCountThreshold
. But definitely do not stick to one tool. I was not able to replicate my results with CBS method so I went with Control-FREEC and ADtex, but since you have a very high depth in order of 400X I would actually ask to use some other tools which has great performance with higher depth, VarScan2 should be better I believe here.Actually, my coverage is something like x100 and x200. I tried to use ADTEx as well, but something went really bad. When I am trying to obtain the Depth of Coverage in ADTEx respective step, the temporary files created for coverage of normal and tumor are endless;what i mean is that ADTEx keeps writing in those files like for ever and the size was more than 500GB each and of course my server crashed due to shortage of free space. Any idea what is happening? Same thing when I tried to get coverage with bedtools
A late comment but I have been using Control FREEC recently and encountered the situation where my normal had 3x coverage over the tumour. I was worried that the program would not adjust for library size and just call deletions everywhere; however, based on how it has been developed (i.e. modelling coverage), it appeared to tolerate the differences in overall library size and calls appeared as expected.
My data was whole genome and the configurations: