Group definition and logFC calculation in GEO2R
4
0
Entering edit mode
4.7 years ago
Kim ▴ 20

Hello everyone

I'm learning to analyze GEO data and using GEO2R is the most straightforward to me, but I'm having problem understanding how GEO2R (or limma) calculate the logFC.

I read this link https://wiki.bits.vib.be/index.php/Analyze_GEO_data_with_GEO2R and noticed this:

"First define the treated group (it will be colored in blue), then define the control group (it will be colored in pink). The order is important for calculating log fold changes later in the analysis"

The result looks like this https://ibb.co/CKdRkSz

It seems to me that logFC is calculated in GEO2R as logFC = log(control/treated) (or log(normal/tumor) in this case) However, I've read from several sources that logFC = log(treated/ control)

This is confusing me in interpreting gene expression level. Could you please help explain this to me.

Thank you very much.

GEO2R logFC Group definition • 4.3k views
ADD COMMENT
1
Entering edit mode
4.7 years ago

The logFC (to be proper, it is log [base 2] fold change) calculation can have anything as the numerator and anything as the denominator. It is your role as the analyst to define these.

Consider the example for geneX:

  • Tumour geneX = 16
  • Normal geneX = 4

.

log2FC = log2(Tumour / Normal) = log2(16 / 4) = 2
log2FC = log2(Normal / Tumour) = log2(4 / 16) = -2

See the difference? Now, you choose what you want as the numerator and denominator.

Kevin

ADD COMMENT
0
Entering edit mode

Hi Kevin

Thank you for your comment. I agree that what we choose as numerator and denominator are up to our decision. I'm just confused as GEO2R always take Pink/ Blue (control/ treatment) while some people say it's always treatment/ control. But it's useful to know there's no fixed rule on log2FC calculation.

ADD REPLY
0
Entering edit mode

I believe you can change the order via GEO2R, but I do not use it too much.

ADD REPLY
0
Entering edit mode

Hello,

I have two groups: Pancreatic cancer and control, I defined it in GEO2R via Defined Groups menu. 1. (blue): Pancreatic cancer 2. (pink): Control

According to your message the result should be Pancreatic Cancer / Control. Is it true.? I see below code in GEO2R codes. What is the meaning of it and are G1-G0 related to control or treated groups? G1 / G0 = Pancreatic Cancer / Control ?

cont.matrix <- makeContrasts(G1-G0, levels=design)

gsms <- paste0("XXXXXXXXXXXXXXXXXXXXXX1111111111111111111111111110", "00000000000000000000000000000000000000000000000000", "00000000000000000000000000000000000000000000000000", "00000000000000000000000000000000000")

Thanks

ADD REPLY
0
Entering edit mode

If you have skills in R, I would encourage you to avoid using GEO2R, and to instead use your own coding.

I do not know to what G1 and G0 relate, but could be cell cycle stage. I do know, however, that the code that is generated by GEO2R can be incorrect / misleading.

If you want further help, please at least post the GEO accession ID of this stage.

Thank you!

ADD REPLY
0
Entering edit mode

Hi Kevin,

Thank you for your reply. I think, it is not related to cell cycle stage. Because when I review another IDs, I see the other numbers such as G3-G4... GEO ID is GSE24279 Thank you

ADD REPLY
2
Entering edit mode

I do not see anything related to G0-4 when I go to GSE24279. It is just a case-control study for pancreatic cancer.

I would just obtain the data in R, like this:

library(Biobase)
library(GEOquery)

# load series and platform data from GEO
gset <- getGEO("GSE24279", GSEMatrix =TRUE, getGPL=FALSE)
if (length(gset) > 1) idx <- grep("GPL10944", attr(gset, "names")) else idx <- 1
gset <- gset[[idx]]

After that, I would conduct my analyses in R using limma

ADD REPLY
0
Entering edit mode

Thank you Kevin,

I will try to write them Best regards

ADD REPLY
1
Entering edit mode
4.7 years ago
Kim ▴ 20

I also emailed GEO2R team and they've just replied. For anyone who has the same question:

"The first group that is named in the 'Define groups' drop-down menu becomes the denominator in logFC, and the second group that is named becomes the numerator. The background pink/blue colors are not important.

Reversing the order in which you name groups will result in a reversed sign in logFC (e.g., 4 vs. -4)."

ADD COMMENT

Login before adding your answer.

Traffic: 1990 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6