How to find out the significant gene from GDS.soft file by using bioconductor
1
1
Entering edit mode
9.5 years ago
ches51311 ▴ 10

now I have the microarray data from ncbi website http://www.ncbi.nlm.nih.gov/sites/GDSbrowser?acc=GDS4974

and I want to use the bioconductor code to find the significant gene

and now I can use GEOquery libray to load GDS4974.soft data into R (http://www2.warwick.ac.uk/fac/sci/moac/people/students/peter_cock/r/geo/)

and now I want to normalized this microarray(GDS4974.soft) data, but don't know which bioconductor code I can use to normalize it

can anyone tell me which bioconductor code can I use to normalized it?

and if the next step of normalized is using two sample t test? because after normalized it , I want to using two sample t test

ChIP-Seq R • 3.6k views
ADD COMMENT
0
Entering edit mode

If it's already on GEO, why not just use GEO2R? You can even see the exact R code that it uses.

ADD REPLY
0
Entering edit mode

If you want to use t test, you can just use tools available at GEO Datasets. At the bottom of GEO Datasets page there is an overlap "Compare 2 sets of samples". Then, you have to assign samples to two groups, which you would like to compare. At the end, choose the type of t test and significance level. As a results you get significantly expressed genes.

ADD REPLY
0
Entering edit mode

thank you, I find the t test at GEO datasets

ADD REPLY
0
Entering edit mode

I have two posts related to this problem. Use GEOquery bioconductor package http://crazyhottommy.blogspot.com/search/label/GEO

ADD REPLY
0
Entering edit mode
9.5 years ago

GEO datasets are already normalized by the submitter. Roughly, the steps are:

library(GEOquery)
gds = getGEO('GDS4974')
eset = GDS2eSet(gds)

Now, you can follow the limma package user guide to do differential expression.

ADD COMMENT
0
Entering edit mode

I'm just use this code to load GDS4974

but it appear these message and not success

Error in validObject(.Object) :
  invalid class "ExpressionSet" object: featureNames differ between assayData and featureData
ADD REPLY
0
Entering edit mode

Sorry you are having problems. What is the output of sessionInfo()?

ADD REPLY
0
Entering edit mode

I think that if you need to perform just a simple t test for differential expression you should simply use tools available @GEO. Otherwise, DE analysis in Bioconductor is not so easy, especially for beginners (believe me, I've tried it not so long time ago). Firstly, you need to normalize raw data, know all about the normalization method, which you would like to use, and so on, then you are able to find your DEGs. Of course, there is limma package manual where everything provided should be clear. Perhaps, GEO tools are enough for your purpose?

ADD REPLY

Login before adding your answer.

Traffic: 1413 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6