Question

Simulated microarray expression data

1

Entering edit mode

8.0 years ago

Jan-Niklas ▴ 30

Dear all,

I want to compare the performance of some gene set analysis methods and therefore want to simulate my own expression data to overcome the lack of a gold standard. The simulated data should be a good approximation of real biological data with it's complex characteristics and distributions. Genes should be modeled as known correlated blocks, which than can be identified by gene set analysis methods and detection rates can be estimated.

I found the Umpire R package Link, which looks promising, but an annotation of which gene sets are up and down regulated seems to be missing. Does anybody have experience working with Umpire or know a different tool for this purpose or a paper which describes the workflow to simulate expression data?

With best regards,

Jan-Niklas

R microarray simulation expression data • 2.6k views

ADD COMMENT • link updated 5.9 years ago by cpad0112 21k • written 8.0 years ago by Jan-Niklas ▴ 30

0

Entering edit mode

Hello Jan,

Did you manage to figure this out? I have just installed Umpire package but I'm not sure how to go about it. Kindly help. I'll appreciate.

ADD REPLY • link 7.8 years ago by lchaba • 0

zx8754 · Answer 1 · 2016-05-20

2

Entering edit mode

8.0 years ago

Benn 8.3k

"Ain't nothing but the real thing..." - Marvin Gaye

There is a huge database called GEO full with the real stuff, why not try it with real data.

ADD COMMENT • link updated 5.9 years ago by zx8754 11k • written 8.0 years ago by Benn 8.3k

1

Entering edit mode

Heard about that. :) But for all these datasets the ground truth is unknown. Sure, you can pick datasets which study a specific phenotype and you could assume that pathways associated with this phenotype show a significant correlation. But after all I would like to simulate data with known ground truth.

ADD REPLY • link 8.0 years ago by Jan-Niklas ▴ 30

zx8754 · Answer 2 · 2018-06-15

1

Entering edit mode

5.9 years ago

cpad0112 21k

For future reference, try :

Ruvcorr package and corresponding simulation page: https://rdrr.io/bioc/RUVcorr/man/simulateGEdata.html
sgnesR package: download from GitHub (not in CRAN and bioconducor repositories). No installation instructions on GitHub (as of 15th June, 2018)- https://github.com/shaileshtripathi/sgnesR

ADD COMMENT • link updated 5.9 years ago by zx8754 11k • written 5.9 years ago by cpad0112 21k