Question

transcriptomics gene expression per sample

1

Entering edit mode

4.0 years ago

mcorb ▴ 20

Is is fair (or correct) to separate subjects in high responders or low responders based on expression value of a gene in an seqRNA experiment? In other words can we use per sample gene expression for anything? I thought we are looking at the data as groups. Thank you.

RNA-Seq • 760 views

ADD COMMENT • link 4.0 years ago by mcorb ▴ 20

0

Entering edit mode

Not sure what you mean, but there are various disease sub-types where the heightened expression of a single gene is predictive of the sub-type, e.g., EGFR expression and 70% of lung cancers, ERBB2 / Her2 expression and Her2-positive breast cancer, etc.

ADD REPLY • link 4.0 years ago by Kevin Blighe 87k

0

Entering edit mode

I've been asked to use expression or fold change for gene x in patient A to compare with the patient B for the same gene. Both patients are part of a big study. I said that is possible but is not correct.

ADD REPLY • link 4.0 years ago by mcorb ▴ 20

0

Entering edit mode

So, just comparing 2 numbers? - all that you can do is simple arithmetic:

division
subtraction

ADD REPLY • link 4.0 years ago by Kevin Blighe 87k

0

Entering edit mode

I know the math. I don't believe it is correct to look at one patient Fold Change in a group. That was my original question/ doubt. It is meaningless.

ADD REPLY • link 4.0 years ago by mcorb ▴ 20

0

Entering edit mode

You can't say they are significantly different. Either or both of those samples could be outliers. That does not mean the values are meaningless. Outliers are a problem with any classification problem. Does it mean we can't classify individual samples?

ADD REPLY • link 4.0 years ago by igor 13k

0

Entering edit mode

Thank you Igor for agreeing that you cannot say that they are significantly different. Each of them is part of the group, its value makes up for the group value, if they are outlier we can eliminate them. I can see the number difference in expression for one gene but I cannot infer from that that this sample (patient) responds better than the other one. At least this is my understanding, I might be wrong, this is why I asked. I do not think we can classify samples based on expression gene levels or fold change. We can use other metrics (like library size, number of exons, etc) but not this one.

ADD REPLY • link 4.0 years ago by mcorb ▴ 20

1

Entering edit mode

You can certainly classify samples based on gene expression. Classic example is PAM50, which has been used clinically for over a decade: https://ascopubs.org/doi/10.1200/JCO.2008.18.1370

ADD REPLY • link 4.0 years ago by igor 13k

score 0 · Answer 1 · 2020-04-30

It depends. If you have normalised your dataset, transformed it, and standardised it (e.g., to Z scores), then it is very much possible to look at a single value and make inferences from that. For example, assuming a good study design and a two-tailed normal distribution, a Z-score value of > 1.96 is statistically significantly above the mean of the population at 5% alpha.

Edit: and the assumption in this scenario is also, of course, that sample outliers have been removed and that the analyst has perform other QC metrics, like inter-group correlation, low count filtering prior to normalisation, et cetera.