Question: How to know if high expression of a gene is really a good independent marker for poor prognosis in cancer?
gravatar for curious
20 months ago by
curious470 wrote:

This is kind of a general question that I have been curious about for a while. Please consider my example:

I have a gene that seems like it should be associated with pro-cancer effects based on the literature.

I go ahead and mine rna-seq samples from cancer patients and code them as "high expressers" or "low expressers" of my gene.

I do some survival analysis using Univariate Cox regression and find out high expression of my gene is associated with significantly reduced overall survival.

My question is how do I actually know that this gene is an independent prognosticator and that this is not a correlation does not equal causation trap?

How do I know that when selecting my population of "high expressers" of my gene of interest I am not inadvertently over-representing some other gene that is a much more powerful and independent prognosticator?

Thank you

ADD COMMENTlink modified 20 months ago by omg what am I doing70 • written 20 months ago by curious470
gravatar for omg what am I doing
20 months ago by
omg what am I doing70 wrote:

I would approach it by thinking about what is known about the function of the gene it a known regulator of an established tumor suppressor or oncogene? A KEGG search might help with that (to follow Kevin's example here is p53 If there's no clear answer, but some other supporting literature showing correlation with survival (and you can use cbio portal to look at a lot of cancer data. Here I looked at BRCA2 in breast cancer) Then maybe it's worth starting some functional experiment and taking RNA-seq data and seeing if it lines up with any protein expression, cell survival data, etc.

ADD COMMENTlink written 20 months ago by omg what am I doing70
gravatar for Kevin Blighe
20 months ago by
Kevin Blighe69k
Republic of Ireland
Kevin Blighe69k wrote:

'Most' genes exhibit altered expression in cancer, but most of these are likely unrelated [directly] to the primary mechanism that drives the cancer. As we know, certain genes, like TP53, are the main drivers of tumourigenesis. Heightened TP53 expression will trigger a cellular cascade that will alter the expression of many other genes, downstream, but the altered expression of these downstream genes, on its own, is not what is driving tumourigenesis. Even still, the altered expression of these downstream genes can be seen as a biomarker or 'proxy' of tumourigenesis.

Some independent validation of your finding would help. So, like, search GEO microarray datasets and try to replicate the finding in those.

Also check the expression of your gene in Genotype-Tissue Expression (GTEx) data, i.e., in order to ensure that the gene is not just normally highly expressed in the tissue in which this cancer occurs.

ADD COMMENTlink modified 20 months ago • written 20 months ago by Kevin Blighe69k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1826 users visited in the last hour