Question: TCGA DNA methylation data pipeline
gravatar for yuabrahamliu
9 days ago by
yuabrahamliu40 wrote:

Hello everyone,

Maybe my questions are really easy to some experts, but as a new to TCGA data, I indeed feel confused. So if anyone could give me any ideas, I will be appreciated.

I want to do some analysis on the TCGA level-3 DNA methylation data from various cancer types. However, my question is that, did all the cancer types use the same preprocessing pipeline? And which normalization method did TCGA use? If different cancer types used different preprocessing pipelines, is there any good method to let the comparison among them feasible? Thank you so much.

Best, Yu

ADD COMMENTlink modified 7 days ago by Charles Warden6.1k • written 9 days ago by yuabrahamliu40
gravatar for Charles Warden
7 days ago by
Charles Warden6.1k
Duarte, CA
Charles Warden6.1k wrote:

That is a good question.

You may sometimes encounter the need to use different normalizations (starting from the raw data), if you get some sort of strange beta distribution (for example, something that is clearly not bimodal). However, unless they processed the chips separately (instead of all together and/or processed with each sample as a group in GenomeStudio), I think a very clear issue with the beta distributions probably would have been identified by now.

So, I know there are some things where the provided processed data gives good results, but I imagine it is possible that subsets of probes may benefit from an alternative normalization (and/or filtering) strategy.

I think there is supposed to be some more information on this page, but I believe there is currently an issue with the link for more details:

ADD COMMENTlink modified 7 days ago • written 7 days ago by Charles Warden6.1k

I see. Thank you for the help!

ADD REPLYlink written 7 days ago by yuabrahamliu40
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2035 users visited in the last hour