I was wondering whether it is feasible to work with multiple types of data from same patient in TCGA to reveal mechanism of a cancer in that particular patient using tumor and adjacent healthy tissue controls. I am planning to analyse each data type individually and merge findings in a higher-level (e.g. pathway most probably).
I have been reading on it a lot but could be able to find satisfactory answers to following:
- On TCGA data quality, what do you think on the reliability of data for individual analysis? I have received different opinions on this matter, so I'm a bit confused right now.
- For all types of data, since tumor and adjacent tissue controls are in the same batch in TCGA, how should I check for other possible biases? I guess batch effect correction becomes irrelevant in this case. This question particularly becomes relevant in array based methods, such as HumanMethylation450k. If tumor and control scanned in different arrays, how can I be sure that there is no additional bias coming from array scanning (except probably expecting low DMR count)?
I really appreciate your valuable input in these matters.
Thanks a lot!