normalizing methods for microarray
2
0
Entering edit mode
8.6 years ago

Hi

If I want to merge some GSEs from GEO, what types of normalizing methods do you recommend me, FRMA or SCAN?

I think, I should notice that my datasets are from different platform,and I'm looking for DEGs

R bioconductor statistics • 1.6k views
ADD COMMENT
1
Entering edit mode
8.6 years ago

Combining data is not an easy task, and it should only be undertaken if it provides a substantial benefit. Often approaches where one dataset is analysed (pilot), and a separate dataset is analysed (followup), are more beneficial than artificially combining datasets.

If you still insist on merging datasets from GEO, you'll need to try and get the raw data. Raw data is preferable as you can apply your own normalisation technique, and keep things consistent (normalised GEO datasets could be using different methods between experiments). Alternatively you can use the two normalised expression sets and use an additive model to try and account for the variation.

Using additive models requires that you have the same sample types across experiments to accurately estimate the between dataset variance. Additionally you need to map up probes in some way, nuIDs are probably the best method as they're then using the same nucleotide sequence.

As for normalisation method, that'll depend on what approach you want to take, but it also depends on the platform the dataset is derived from.

ADD COMMENT
1
Entering edit mode
8.6 years ago

There is not a general normalization procedure that will make your data comparable across platforms. You'll need to determine whether the experimental designs of the various GSEs are compatible with the questions you want to ask. To be concrete, comparing sample type A in GSE1 with sample type B in GSE2 is probably not justifiable without great care and thought (though people have been known to try it). Comparing sample types A and B, both represented in both platforms is much easier to justify, though it is still difficult to interpret results in many cases; in this case, using the single "best" dataset may be simpler and give meaningful results.

ADD COMMENT

Login before adding your answer.

Traffic: 2720 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6