Tcga Illumina Methylation Combining 27K And 450K
1
1
Entering edit mode
10.1 years ago
devolver1 ▴ 20

I'd like to use a TCGA methylation dataset but some of the data is from the 27k Illumina platform and some from the 450k. Is there a simple script to take only the common features? I read about the type I and type II probes in the 450k array and how to correct the type II probes but that doesn't address if it's even possible to take the common probes on the 27k and 450k platforms. Thank you.

tcga methylation illumina • 7.9k views
ADD COMMENT
1
Entering edit mode

I'd be tempted to use only the 450k since it includes 90% of the 27k probes and has far greater coverage. Unless there is some reason for using both, e.g. to compare the platforms?

ADD REPLY
0
Entering edit mode

There are only so many subjects that TCGA has for any particular tumor type. So if I stick to only the 450 array then I get to use only half of the subject data and for machine learning purposes that is a big loss. If I take the common probes for all subjects, 90 % common doesn't sound too bad. Thank you for the response.

ADD REPLY
0
Entering edit mode

What correction did you performed on type II probes? I'm using the same data but I assumed that being them level 3 data, no other correction was needed. I wrote a simple script to extract common features but after reading your post I started wondering if I'm doing something wrong.

ADD REPLY
0
Entering edit mode
10.1 years ago

The methylumi package has a method for MethyLumiSet objects, combine27k450k, that may do what you want.

ADD COMMENT

Login before adding your answer.

Traffic: 2708 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6