Question

Target deletions with adaptive sampling

0

Entering edit mode

7 weeks ago

njornet ▴ 20

I want to perform adaptive sampling to detect a panel of diseases. Some of them are caused by deletions such as Prader-Willi syndrome and Angelman syndrome, both are cause by the deletion of 15q11.2-q13. How should I design the bed file to target this region? Do I include the whole region?

Edit: For more context, we want to detect these diseases in fetal cells from a mother's blood sample, so we will have a small amount of fetal DNA. At the current state we are performing whole genome amplification, so we won't be able to detect the diseases if they are cause by imprinting defects. From my research, PW is cause by a deletion of the 15q11.2-q13 region and Angelman by 15q11.2-q13 deletion in about 70% of cases.

So in general, what I'm asking for are tips on how to detect deletions using AS.

microdeletions nanopore AdaptiveSampling • 590 views

ADD COMMENT • link 6 weeks ago by njornet ▴ 20

0

Entering edit mode

7 weeks ago

Brian Bushnell 20k

You are posing a question that assumes a lot of prior knowledge, because it is very domain-specific. If you want a useful answer, I suggest you rephrase the question in a way that does not require prior work in your lab to answer it. For example, mentioning the target organism, the chromosome, the region of interest (in bp), and what software you are trying to use, would all be useful.

ADD COMMENT • link 7 weeks ago by Brian Bushnell 20k

score 2 · Accepted Answer · 2024-03-09

I want to perform adaptive sampling to detect a panel of diseases.

this is the most problematic statment of your post. how many loci. what are they? how big is each? the feasibility of your study may well depend on what you mean by this nebulous statement...

Some of them are caused by deletions such as Prader-Willi syndrome and Angelman syndrome, both are cause by the deletion of 15q11.2-q13.

.... this is not entirely accurate.

while deletion followed by restoration of normal copy number through uniparental disomy or another mechanism is certainly one possible pathogenetic mechanism of PW or Angelman, it isnt really accurate to say that they are caused by deletions without further qualification... they may be also caused by issues with imprinting without evoking other mechanisms, or may be simple deletions i beleive... in a minority of cases....

but the main reason why nanopore could make sense is that these are disorders of imprinting. the use of nanopore is not needed to detect a mere deletion - this can be accomplished with far more ease through other technologies... rather, the utility of nanopore in this context flows from its ability to detect epigenetic modification primarily, and its ability to capture SVs secondarily.

How should I design the bed file to target this region?

This question is difficult to answer without greater clarification as to your goals. generally speaking, for AS and PW much of the epigenetic modification is concentrated into so-called "imprinting control regions" (ICRs) or differentially methylated regions (DMRs). you would probably want to target these regions first and foremost... but in practice the amount and patterning and methylation varies by tissue type, environmental factors, genetic background and everyone's favorite, "stochastic events" ... so its pretty difficult to give you meaningful advice on what you would want to do without further clarification.

Do I include the whole region?

you need to make a detailed study of this region, including genetic variation in the region in your population. the ICRs are thought to be separated by a few hundred kb, and the whole region is - i if understand correctly, 3Mb or so... so if that is all you need, its honestly a pretty good size for a AS approach. other labs (e.g. Mayo laboratories) has reported pretty nice enrichment of a 3Mb region (up to 150x) at AMP last year. generally 3Mb is 0.1% of 3Gb which is on the low end of what nanopore recommends as an "optimal" target region size for AS.

so overall the approach sounds like like it could be feasible, and including the whole region might be easier that trying to just specify the regions containing the ICRs only - that seems likely to result in loss of relevant reads that begin outside the region, but extend into the regions you might care about. in general, yes is probably the answer to this question but again. you need to do a lot of legwork to know this is true for you....

the bottom line here is you are going to need to make a serious study of this locus and any other loci on your panel before doing this, or find published reports of others doing the exact same things to use as an initial template.

at any rate, i hope that my post gives you enough vocabulary and what have you to get you started...