I want to calculate if the sample has whole genome duplication event or not?
0
1
Entering edit mode
8 days ago
Hyper_Odin ▴ 280

I have the copy number data, and i am taking into consideration the mean count of the major allele from each chromosome

the method is published here. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6072608/

and following is the script, I want to confirm is taking the mean of each chromosome correct?

major_cn <- diffa %>%
  filter(chromosome %in% 1:22) %>%
  group_by(chromosome) %>%
  summarize(major_cn = mean(nMajor))

prop_major_cn_two_or_more <- sum(major_cn$major_cn >= 2) / nrow(major_cn)

if (prop_major_cn_two_or_more >= 0.5) {
  cat("whole genome duplication.\n")
} else {
  cat("no evidence\n")
}

Thanks

sequencing wgs • 153 views
ADD COMMENT

Login before adding your answer.

Traffic: 1048 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6