Question: Group definition and logFC calculation in GEO2R
0
gravatar for Kim
12 months ago by
Kim10
Kim10 wrote:

Hello everyone

I'm learning to analyze GEO data and using GEO2R is the most straightforward to me, but I'm having problem understanding how GEO2R (or limma) calculate the logFC.

I read this link https://wiki.bits.vib.be/index.php/Analyze_GEO_data_with_GEO2R and noticed this:

"First define the treated group (it will be colored in blue), then define the control group (it will be colored in pink). The order is important for calculating log fold changes later in the analysis"

The result looks like this https://ibb.co/CKdRkSz

It seems to me that logFC is calculated in GEO2R as logFC = log(control/treated) (or log(normal/tumor) in this case) However, I've read from several sources that logFC = log(treated/ control)

This is confusing me in interpreting gene expression level. Could you please help explain this to me.

Thank you very much.

logfc geo2r group definition • 997 views
ADD COMMENTlink modified 4 months ago by cagdas10 • written 12 months ago by Kim10
1
gravatar for Kevin Blighe
12 months ago by
Kevin Blighe63k
Kevin Blighe63k wrote:

The logFC (to be proper, it is log [base 2] fold change) calculation can have anything as the numerator and anything as the denominator. It is your role as the analyst to define these.

Consider the example for geneX:

  • Tumour geneX = 16
  • Normal geneX = 4

.

log2FC = log2(Tumour / Normal) = log2(16 / 4) = 2
log2FC = log2(Normal / Tumour) = log2(4 / 16) = -2

See the difference? Now, you choose what you want as the numerator and denominator.

Kevin

ADD COMMENTlink written 12 months ago by Kevin Blighe63k

Hi Kevin

Thank you for your comment. I agree that what we choose as numerator and denominator are up to our decision. I'm just confused as GEO2R always take Pink/ Blue (control/ treatment) while some people say it's always treatment/ control. But it's useful to know there's no fixed rule on log2FC calculation.

ADD REPLYlink written 12 months ago by Kim10

I believe you can change the order via GEO2R, but I do not use it too much.

ADD REPLYlink written 12 months ago by Kevin Blighe63k

Hello,

I have two groups: Pancreatic cancer and control, I defined it in GEO2R via Defined Groups menu. 1. (blue): Pancreatic cancer 2. (pink): Control

According to your message the result should be Pancreatic Cancer / Control. Is it true.? I see below code in GEO2R codes. What is the meaning of it and are G1-G0 related to control or treated groups? G1 / G0 = Pancreatic Cancer / Control ?

cont.matrix <- makeContrasts(G1-G0, levels=design)

gsms <- paste0("XXXXXXXXXXXXXXXXXXXXXX1111111111111111111111111110", "00000000000000000000000000000000000000000000000000", "00000000000000000000000000000000000000000000000000", "00000000000000000000000000000000000")

Thanks

ADD REPLYlink written 4 months ago by cagdas10

If you have skills in R, I would encourage you to avoid using GEO2R, and to instead use your own coding.

I do not know to what G1 and G0 relate, but could be cell cycle stage. I do know, however, that the code that is generated by GEO2R can be incorrect / misleading.

If you want further help, please at least post the GEO accession ID of this stage.

Thank you!

ADD REPLYlink written 4 months ago by Kevin Blighe63k

Hi Kevin,

Thank you for your reply. I think, it is not related to cell cycle stage. Because when I review another IDs, I see the other numbers such as G3-G4... GEO ID is GSE24279 Thank you

ADD REPLYlink written 4 months ago by cagdas10

I do not see anything related to G0-4 when I go to GSE24279. It is just a case-control study for pancreatic cancer.

I would just obtain the data in R, like this:

library(Biobase)
library(GEOquery)

# load series and platform data from GEO
gset <- getGEO("GSE24279", GSEMatrix =TRUE, getGPL=FALSE)
if (length(gset) > 1) idx <- grep("GPL10944", attr(gset, "names")) else idx <- 1
gset <- gset[[idx]]

After that, I would conduct my analyses in R using limma

ADD REPLYlink modified 4 months ago • written 4 months ago by Kevin Blighe63k

Thank you Kevin,

I will try to write them Best regards

ADD REPLYlink written 4 months ago by cagdas10
0
gravatar for Kim
12 months ago by
Kim10
Kim10 wrote:

I also emailed GEO2R team and they've just replied. For anyone who has the same question:

"The first group that is named in the 'Define groups' drop-down menu becomes the denominator in logFC, and the second group that is named becomes the numerator. The background pink/blue colors are not important.

Reversing the order in which you name groups will result in a reversed sign in logFC (e.g., 4 vs. -4)."

ADD COMMENTlink written 12 months ago by Kim10
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1475 users visited in the last hour