Question: CNV analysis using the ExomeDepth R Package
0
gravatar for cafelumiere12
4.1 years ago by
United States
cafelumiere1270 wrote:

Hi all,

I'm looking into using exomeDepth R package for CNV analysis; however, I have a couple questions from reading the manual  (https://cran.r-project.org/web/packages/ExomeDepth/vignettes/ExomeDepth-vignette.pdf) regarding building the reference set (page.5) 

Here is what the manual suggested:

my.matrix <- as.matrix( ExomeCount.dafr[, my.choice$reference.choice, drop = FALSE]) my.reference.selected <- apply(X = my.matrix, MAR = 1, FUN = sum) 

Question #1:

In this test case there are 3 samples selected as the appropriate reference set. Why did they use FUN = sum instead of FUN = mean? Wouldn't that make the reference set counts a lot higher than the test set counts?

Question #2: 

The other question is regarding the reference set selection - it looks like from the manual all the samples other than the one being tested gone through the function select.reference.set . However, if I have 14 cancer samples and 2 normal samples - shall I only input 2 normal samples to select.reference.set ? Or shall I use all the 15 samples that is not the current tested one?

 

Thanks!!

exomedepth cnv R • 2.4k views
ADD COMMENTlink modified 4.1 years ago by Fabio Marroni2.5k • written 4.1 years ago by cafelumiere1270
0
gravatar for Fabio Marroni
4.1 years ago by
Fabio Marroni2.5k
Italy
Fabio Marroni2.5k wrote:

Question #1: Fair observation, but I guess that they normalize by the total number of reads so summing or averaging is not a big difference. You ALWAYS need to normalize for the different total throughput of different experiments.

Question #2: It depends. At page 5 of the manual they say: "A key idea behing ExomeDepth is that each exome should not be compared to all other exomes but rather to an optimized set of exomes that are well correlated with that exome". My first choice would be to use the healthy tissues as reference. The best idea is always to compare cancer tissue of a patient with his blood (or at least with healthy tissue from the same patient.). The second choice is to compare cancer samples with healthy tissue (even if they originate from different patients). I wouldn't use another cancer sample (even if it's a different kind of cancer) as a reference, since it might have the same CNVs of your sample, thus making you lose results and in addition cancerous tissues might have very messy CNV situations, thus biasing results.

ADD COMMENTlink modified 10 weeks ago by RamRS25k • written 4.1 years ago by Fabio Marroni2.5k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 684 users visited in the last hour