What methylation values are usually supplied within ncbi geo 450k series matrices?
I am going to automatically download a lot of GSE (~80) matrices and to use them in my analysis. Before this I want to double check here if I understand correctly what data is within those matrices.
I know that they contain sample, contact information, etc. But what about methylation data? Are those values already normalised, batch corrected? Or does this vary between each study?
Sean, thank you for the answer!:)
One more quick question: in this fresh draft by Kasper D. Hansen they have analysed 450k data. For 2 datasets they have downloaded IDAT files and then normalised them. For one dataset they didn't apply normalisation, because only methylated, unmethylated intensities were available (Material and Methods - Processing of the methylation data).
I guess that K. Hansen knows what he does, but will this work for every dataset? For example: If I download only methylated/unmethylated values can I skip normalisation and go on further?
You'll need to evaluate the need for normalization on a case-by-case basis. Typically, values deposited in GEO have been normalized, but the effectiveness of that normalization for your needs may vary.