How do I use quantile normalized data to do differential expression analysis in RStudio?
1
0
Entering edit mode
3.9 years ago

Good Afternoon,

I am very new at RStudio and differential expression analysis and I am a bit lost about what to do.

I received 2 tables of gene expression on dataframe One with the studied cells, other with control group. The data doesn't come on rawcounts, but in quantile normalized data (with RMA). I tried but couldn't create a DESeq2 object and looks like I can only use rawcounts for it. I can't have access to the rawcounts and those 2 tables were all I got.

The authors say:

The .CEL files of control group were downloaded and data were normalized using the GeneSpring GX software (version 11, Agilent Technologies) and the Robust Multi-array Average (RMA) algorithm for summarization (which uses Perfect Match (PM)-only based measures) and quantile normalization, and then exported.

The "value" output == RMA normalized intensity values (no baseline transformation). I have, however, a *.csv file.

What should I first do with it? Could you help me with which steps and which package I can use? Now I've been checking limma, but I am having problems to find out how I can create the object for the package.

Best regards, Mari

RNA-Seq R gene • 2.0k views
ADD COMMENT
0
Entering edit mode

If it's all normalized I would first make sure that the two sets are normalized together (most genes have the same values) and then just use lm to test each gene.

ADD REPLY
1
Entering edit mode
3.9 years ago
ATpoint 81k

CEL files are the raw data that microarrays produce (from the company Agilent afaik). The RMA preprocessing is standard, so this is fine. Still, it is expected that all data that go into the same analysis have been processed together as the quantile normalization step is not an independent procedure per array but depends on which samples have been included or not. See this video for details on QN.

So what you have to do nowis to find why two tables exist. have they been processed together and then splitted? How many replicates are there? You need replicates for differential analysis. In the best case you have replicates and files have been processed together, so you would be good to go with the standard software called limma which has been developed for microarray differential analysis using a linear model-based frakework.

ADD COMMENT
0
Entering edit mode

Just to follow ATpoint's suggestions up with a couple of hopefully helpful links about microarray data analysis with R:

In short, limma is a package that provides functions to perform slightly more sophisticated tests than simple t-tests to determine whether a given gene is differentially expressed

ADD REPLY

Login before adding your answer.

Traffic: 2533 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6