Question

Which is the right bin size for CNV detection in low pass WGS single cell?

0

Entering edit mode

4.5 years ago

am ▴ 60

Hello! I'm performing a single-cell genome-wide copy-number profiling by low-pass sequencing (coverage: 1X, 0.5X, 0.1X). Would anybody suggest me how to determine which is the optimal bin size in the genome segmentation step? Usually it's set to 0.5M or 1M when the genome coverage is as low as 1X. I would like to know which is the best one for my coverage, any advice? Thank you!

CNV low pass WGS single cell genome binning • 2.7k views

ADD COMMENT • link updated 4.5 years ago by Tomás Di Domenico ▴ 30 • written 4.5 years ago by am ▴ 60

0

Entering edit mode

Which tool are you using for the analysis?

ADD REPLY • link 4.5 years ago by igor 13k

0

Entering edit mode

I'm testing different tools such as CNVkit, Gingko, Control-FREEC... In the CNVkit manual for example I read: "Increase the “target” average bin size (--target-avg-size), e.g. to at least 1000 bases for 30x coverage, or proportionally more for lower-coverage sequencing". I would like to expand this concept.

ADD REPLY • link 4.5 years ago by am ▴ 60

1

Entering edit mode

I am not sure about the others, but Control-FREEC can automatically determine the optimal bin size.

ADD REPLY • link 4.5 years ago by igor 13k

2

Entering edit mode

I second the use of Control-FREEC. It is a truly great program suite, that can determine copy number from WGS, WES, or targeted seq.

ADD REPLY • link 4.5 years ago by Kevin Blighe 88k

0

Entering edit mode

Thank you Igor and Kevin. Yes, the coefficientOfVariation parameter in Control-FREEC allows to automatically determine the correct window size.

ADD REPLY • link 4.5 years ago by am ▴ 60

score 1 · Answer 1 · 2020-02-03

1

Entering edit mode

4.5 years ago

Tomás Di Domenico ▴ 30

The first author of this paper has developed an interactive app that lets you play around with your sample characteristics and see how that would affect the required read depth to be able to perform CNV calling: https://gmacintyre.shinyapps.io/sWGS_power/

ADD COMMENT • link 4.5 years ago by Tomás Di Domenico ▴ 30

0

Entering edit mode

Bizarrely, they do not seem to have done any adjustment for GC content, which will affect coverage at different genomic regions. As such, on face value, this appears to be yet one more copy number program that is not addressing all of the issues.

ADD REPLY • link 4.5 years ago by Kevin Blighe 88k

0

Entering edit mode

This is not a CNV program, just a tool to help you estimate the depth you need based on sample characteristics before you do your sequencing.

ADD REPLY • link 4.5 years ago by Tomás Di Domenico ▴ 30

0

Entering edit mode

Gracias por la clarificación / Thanks for confirming.

ADD REPLY • link 4.5 years ago by Kevin Blighe 88k

0

Entering edit mode

The answer was not really for you, but rather so that other people don't just dismiss the tool because of your comment. I think it can indeed be useful.

Fantastic Spanish, by the way.

ADD REPLY • link 4.5 years ago by Tomás Di Domenico ▴ 30

0

Entering edit mode

De nada

ADD REPLY • link 4.5 years ago by Kevin Blighe 88k