Question: Methods For Comparing Microarrays From Different Datasets
gravatar for Adam Cornwell
7.7 years ago by
Adam Cornwell440
United States
Adam Cornwell440 wrote:

I often run into situations which fall outside the realm of most existing microarray meta-analysis solutions- where I have two sets of arrays to compare (say, RNA from a particular cell type vs whole tissue), but the two sets are from different datasets and sometimes different platforms. Most of the time, a direct comparison is not appropriate because the variability due to the batch effect is greater than that due to the biology. Batch effect compensation methods such as COMBAT aren't appropriate as the batch effect and the target variable of interest are confounding.

So far, I've been normalizing the datasets separately and then compare them using RankProd. I'd like to try a different method, because I've had some complaints about unexpected results in my output genelists and so I'd like to make sure that the output from multiple methods correlates reasonably well so I can have some confidence in presenting the results. After doing a decent search, I haven't come up with much aside from RankProd and METRADISC that's actually semi-advertised as being able to handle this sort of scenario, the latter of which is also rank-based.

I'm starting to get to the point of just wanting to try some thing that sound crazy, like normalizing separately, combining, median-centering/scaling (when there's more than one data set involved), and then transforming to POE (MetaArray package) and following with differential expression testing. Would that

Is it appropriate to use an effect size-based method as implemented in GeneMeta for this sort of thing?

I've been holding off on experimenting with this until finding a more solid answer, but it seems like I really need to make some progress on this soon.

ADD COMMENTlink modified 7.2 years ago by Biostar ♦♦ 20 • written 7.7 years ago by Adam Cornwell440

I highlighted the question; it was a little bit lost in the text :)

ADD REPLYlink written 7.7 years ago by Neilfws49k

Thanks, ultimately it's about soliciting for suggestions to deal with such a scenario. Really seems like I need to practice refining effective questions!

ADD REPLYlink written 7.7 years ago by Adam Cornwell440
gravatar for Houkto
7.7 years ago by
Houkto210 wrote:

Hi Adam,

There are approaches like meta-analysis across different experiments published online however they seem sophisticated to me. RankProd was suggested to me when by the author but I did not have time and the experience to customise it to my need. What I do usually is normalize them all seperately then I use a cluster analysis using BioLayout with a unique ID for all microarray either using REFseq if they are not from the same affymetrix version or different platform and then check a cluster with pattern of expression of an interest across different expression data. Using a unique ID such as refseq will drop the number of genes that you can test.

I hope this will be handy.

ADD COMMENTlink written 7.7 years ago by Houkto210
gravatar for ewre
7.7 years ago by
United States
ewre220 wrote:

data heterogeneity is a central problem about microarray data of different sources. there are methods try to scale datasets from different labs with control probes or housekeeping gene probes of the same platform, but have little effect.

ADD COMMENTlink modified 7.7 years ago • written 7.7 years ago by ewre220
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2091 users visited in the last hour