Question: Rna-Seq And Microarray Classification
2
gravatar for michaelsbreen1
7.2 years ago by
University of Southampton
michaelsbreen180 wrote:

Hi All,

I am interested in developing a prognostic classifier in the following way:

  1. Use a large RNA-Seq cohort with SVM and LOOCV with RFE to develop a prognostic classifier that is able to distinguish between two phenotypes.
  2. I want take this classifier and test it on a different Microarray cohort to see how well these genes can discriminate between the same two phenotypes as described within the RNAseq data.

Since the distributions of these two data-types vary, we simply variance stabilize transform the counts which makes it more normal instead of neg. binomial.

The first point above is straightforward. My question is within the second point above. How can one take a classifier that was already independently created and validated to distinguish between two phenotypes using one cohort (RNAseq) but then test the accuracy of that multi-gene classifier to distinguish between the same two phenotypes using another cohort (microarray)?

Yours,

M.B.

ADD COMMENTlink modified 7.2 years ago by Charles Warden8.0k • written 7.2 years ago by michaelsbreen180

This is probably not a solution to your problem, but have a look at this paper, where they also have to address the multiplatform issue. Maybe that'll at least provide a springboard for a solution.

ADD REPLYlink modified 7.2 years ago • written 7.2 years ago by Devon Ryan98k

Classification with binary gene-expression was an idea. I see it was put out there in 2009 but I was completely unaware of this paper. Cheers, this may provide useful for us.

ADD REPLYlink written 7.2 years ago by michaelsbreen180

If you switch classification methods to work with a score for relative expression between two groups, this might also help.

For example, I was able to apply a classifier from a microarray dataset to an RNA-Seq dataset using BD-Func:

https://peerj.com/articles/159/

ADD REPLYlink written 7.2 years ago by Charles Warden8.0k

Cheers guys. I'll meditate a bit on these approaches and test them out. I still think it is quite surprising though that still you can not take a classifier from one study and put it on to another, using expression, to see how it performs.

ADD REPLYlink written 7.2 years ago by michaelsbreen180
2
gravatar for Charles Warden
7.2 years ago by
Charles Warden8.0k
Duarte, CA
Charles Warden8.0k wrote:

I think the answer is that this is unfortunately very hard to do.

The classifier must have the same sort of input for training and validation. You can try quantile normalizing the RNA-Seq and microarray data, but I have found this typically doesn't work very well (however, it is still the best suggestion that I can come up with).

ADD COMMENTlink written 7.2 years ago by Charles Warden8.0k

Thanks. Although, I wouldn't want to normalize the both them. It's my impression that in order to make the two distributions somewhat similar, RNASeq needs to be variance stabilise transformed and a covariate shift needs to be used in order to make the two distributions more similar. Although these methods are being working out.

ADD REPLYlink written 7.2 years ago by michaelsbreen180

Hi Michael, I'm actually addressing the exact same question, going from training on RNA-Seq to testing on micro-array. I confirm that this look like an open question. I've had some success by applying voom on the RNA-Seq data (from package limma, perform a log-transformation of the data by accounting for different library sizes). Did you tried this also?

ADD REPLYlink written 7.2 years ago by Hayssam280
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1724 users visited in the last hour
_