Question

Rna-Seq And Microarray Classification

2

Entering edit mode

10.5 years ago

michaelsbreen1 ▴ 90

Hi All,

I am interested in developing a prognostic classifier in the following way:

Use a large RNA-Seq cohort with SVM and LOOCV with RFE to develop a prognostic classifier that is able to distinguish between two phenotypes.
I want take this classifier and test it on a different Microarray cohort to see how well these genes can discriminate between the same two phenotypes as described within the RNAseq data.

Since the distributions of these two data-types vary, we simply variance stabilize transform the counts which makes it more normal instead of neg. binomial.

The first point above is straightforward. My question is within the second point above. How can one take a classifier that was already independently created and validated to distinguish between two phenotypes using one cohort (RNAseq) but then test the accuracy of that multi-gene classifier to distinguish between the same two phenotypes using another cohort (microarray)?

Yours,

M.B.

rnaseq microarray classification • 3.6k views

ADD COMMENT • link updated 10.5 years ago by Charles Warden 8.2k • written 10.5 years ago by michaelsbreen1 ▴ 90

0

Entering edit mode

This is probably not a solution to your problem, but have a look at this paper, where they also have to address the multiplatform issue. Maybe that'll at least provide a springboard for a solution.

ADD REPLY • link 10.5 years ago by Devon Ryan 104k

0

Entering edit mode

Classification with binary gene-expression was an idea. I see it was put out there in 2009 but I was completely unaware of this paper. Cheers, this may provide useful for us.

ADD REPLY • link 10.5 years ago by michaelsbreen1 ▴ 90

0

Entering edit mode

If you switch classification methods to work with a score for relative expression between two groups, this might also help.

For example, I was able to apply a classifier from a microarray dataset to an RNA-Seq dataset using BD-Func:

https://peerj.com/articles/159/

ADD REPLY • link 10.5 years ago by Charles Warden 8.2k

0

Entering edit mode

Cheers guys. I'll meditate a bit on these approaches and test them out. I still think it is quite surprising though that still you can not take a classifier from one study and put it on to another, using expression, to see how it performs.

ADD REPLY • link 10.5 years ago by michaelsbreen1 ▴ 90

score 2 · Answer 1 · 2013-10-30

2

Entering edit mode

10.5 years ago

Charles Warden 8.2k

I think the answer is that this is unfortunately very hard to do.

The classifier must have the same sort of input for training and validation. You can try quantile normalizing the RNA-Seq and microarray data, but I have found this typically doesn't work very well (however, it is still the best suggestion that I can come up with).

ADD COMMENT • link 10.5 years ago by Charles Warden 8.2k

0

Entering edit mode

Thanks. Although, I wouldn't want to normalize the both them. It's my impression that in order to make the two distributions somewhat similar, RNASeq needs to be variance stabilise transformed and a covariate shift needs to be used in order to make the two distributions more similar. Although these methods are being working out.

ADD REPLY • link 10.5 years ago by michaelsbreen1 ▴ 90

0

Entering edit mode

Hi Michael, I'm actually addressing the exact same question, going from training on RNA-Seq to testing on micro-array. I confirm that this look like an open question. I've had some success by applying voom on the RNA-Seq data (from package limma, perform a log-transformation of the data by accounting for different library sizes). Did you tried this also?

ADD REPLY • link 10.5 years ago by Hayssam ▴ 280