Question: Cnv Analysis After Segmentations
gravatar for Irsan
7.0 years ago by
Irsan7.0k wrote:

Hi there evryone,

We have tumor-normal paired WG-seq data and have successfully estimated copy numbers and segmented them which basically leaves us with a list of coordinates. Now my question is what are best practices after segmentation? We are planning to annotate the segments by downloading transcripts located at the CNVs with the GenomicFeatures package from Bioc and then do a gene set enrichment analysis. Should I be doing some extra quality filtering like filtering my segments for known CNVs, low mappability regions, ...

Anyone wants to share code or best practices?

cnv • 2.3k views
ADD COMMENTlink modified 5 months ago by ginabussard0 • written 7.0 years ago by Irsan7.0k

How many samples? Whole genome? What depth?

ADD REPLYlink modified 7.0 years ago • written 7.0 years ago by Sean Davis25k

Only 1 tumor-normal pair (pilot study) whole genome, 40x, 100bp paired en reads (insert size 100 bp)

ADD REPLYlink written 7.0 years ago by Irsan7.0k

Hi Irsan,

Could you please provide a sample (working tutorial) on how you can get the final list of amp/del genes in n number of T/N samples. I used varscan2 to analyze copynumber call and used CBS. I am stuck here at the moment.

Thanks !

ADD REPLYlink written 4.1 years ago by Chirag Nepal2.2k

Open this as a new question or do a google/biostar search 

ADD REPLYlink written 4.1 years ago by Irsan7.0k
gravatar for Sean Davis
7.0 years ago by
Sean Davis25k
National Institutes of Health, Bethesda, MD
Sean Davis25k wrote:

I'd suggest taking a look at a paper like Characterizing complex structural variation in germline and somatic genomes. It sounds like you have done a basic read count based analysis. You may want to look at PEM methods as well as split-read methods to be complete. Since you have a single cancer tumor/normal, you'll want to identify genes that appear to be activated or inactivated by somatic SNVs and small indels and structural rearrangements, not just those with copy number changes. I'd suggest a focus on those techniques since they are likely to be more actionable in the setting of a single sample. As you increase your sample size, you can then apply techniques such as GISTIC to define better regions of shared copy number states. Note GSEA on copy number data from a single sample is not likely to be fruitful since the vast majority of genes in such copy number variable regions are likely passengers and perhaps only half of them are even expressed.

ADD COMMENTlink written 7.0 years ago by Sean Davis25k

Hi Sean, do you know how to apply gistic to ngs data? I have the segmented file. But how to get the number of markers and marker file?

ADD REPLYlink written 3.3 years ago by rse90
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1687 users visited in the last hour