Question: Finding Differentially Expressed Genes Across Multiple Cell Types In A Datset
2
gravatar for Bioinfo_2006
6.3 years ago by
Bioinfo_200690
Bioinfo_200690 wrote:

I have a transcriptome dataset from affymetrix microarray and has multiple cell types which were probed. I would like to obtain the differentially expressed genes from each of the cell types (i.e significant transcripts) using packages in R. My data is in the form of a matrix with genes and samples as rows and columns respectively. I tried using the csSAM package but did not quite know how to do it. Is there any other way or package to do this ?

R differential-expression • 4.2k views
ADD COMMENTlink modified 6.3 years ago by Sean Davis25k • written 6.3 years ago by Bioinfo_200690

Nobody knows "quite know how to do it" the first time :) With what, precisely, are you having trouble? Have you read the documentation? Is there an example or tutorial that you are trying to follow? Are you seeing an error message that you don't understand? It's easier to help if your question is better defined.

There are lots of R packages for this task - limma is popular and well-documented - but in all cases you need to make some effort to figure out how they work.

ADD REPLYlink modified 6.3 years ago • written 6.3 years ago by Neilfws48k

True. I should have given a clearer example.

My dataset looks this way:

Probes  celltype1  celltype2 celltype3 celltype4

gene1   5.098      4.677      7.456      8.564  
gene2   8.906      6.653      6.754      7.546
gene3   7.409      5.432      6.724      5.345
gene4   4.876      5.981      4.290      4.267
gene4   3.567      3.425      8.564      6.345 
gene5   2.569      8.645      5.234      7.345

The dimension is 22810(genes) x 20(cell types).

I tried to use the csSAM package in R.

csSamWrapper(G, cc, y, nperms = 200,alternative = "two.sided", standardize = TRUE,
medianCenter = TRUE, logRm = FALSE, logBase = 2,nonNeg = TRUE, fileName = "csSAMout.pdf")

where G is my expression matrix where I have the genes as rows and cell types as columns. and cc stands for the Matrix of cell-frequency. (n by k, n samples, k cell-types).

The example from the package had several samples.

I am not clear as how to construct the cell frequency matrix because I have only one sample.

ADD REPLYlink modified 6.3 years ago by Neilfws48k • written 6.3 years ago by Bioinfo_200690

With only one sample per cell type, you are in uncharted territory. What types of arrays were used to generate these data?

ADD REPLYlink written 6.3 years ago by Sean Davis25k

Affymetrix arrays ( Arabidopsis ATH1 chip)

ADD REPLYlink written 6.3 years ago by Bioinfo_200690

Are you sure the data you have shown is one sample per celltype i,e, only one column for each cell type or is it some sought of average you are using??

ADD REPLYlink written 6.3 years ago by k.nirmalraman1000

Yes, it is after the average of three replicates

ADD REPLYlink written 6.3 years ago by Bioinfo_200690
2

You'll definitely want to get the data in the un-averaged form. With the averaged replicates, you will be unable to apply statistical methods to determine differential expression (since you have no replicates). After you have the data with three replicates per cell type, I will refer to my answer below for multi-group comparison using limma.

ADD REPLYlink written 6.3 years ago by Sean Davis25k

Thank you. Yes, I am trying out different contrasts that I could apply to get what I want. I think Limma would be the best option to do this.

ADD REPLYlink written 6.3 years ago by Bioinfo_200690

So if I try this particular contrast as follows: I have cell types A, B ,C,D I'd use limma and define a contrast like A - (B+C+D)/3", contrast.matrix<-makeContrasts("p1-(p2+p3+p4)/3",levels=design) I tried this and got some DEG's. So will that be those ones only in A and not in others ?

ADD REPLYlink written 6.3 years ago by Bioinfo_200690
2
gravatar for Sean Davis
6.3 years ago by
Sean Davis25k
National Institutes of Health, Bethesda, MD
Sean Davis25k wrote:

Check out section 8.3 in the limma user guide here:

http://www.bioconductor.org/packages/release/bioc/vignettes/limma/inst/doc/usersguide.pdf

You'll probably want (need) to read the user guide and help pages associated with limma functions, also.

ADD COMMENTlink written 6.3 years ago by Sean Davis25k
1
gravatar for k.nirmalraman
6.3 years ago by
k.nirmalraman1000
Germany
k.nirmalraman1000 wrote:

There is a little bit of discussion Rna Seq Cell Type Specificity that concerns this question though raised for RNA Seq.... I am sure these methods can be used for microarrays as well

ADD COMMENTlink written 6.3 years ago by k.nirmalraman1000
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1496 users visited in the last hour