Question: What is the cutoff used for define high or low expression level of gene for survival analysis
gravatar for mygamefun3
5 months ago by
mygamefun30 wrote:

Hi everyone

In RNA-seq analysis, we need to separate samples into two groups for survival analysis. How can I define high level or low level for a gene according to counts or FPKM. Use median? average or quantile?

In TCGA or Oncomine, how are they define the cutoff for a gene ?


rna-seq • 470 views
ADD COMMENTlink written 5 months ago by mygamefun30

There's no definitive answer to your question. I would not advise going by the median or average. Quantile is a reasonable idea, or tertiles, with the higher third being regarded as "high expression".

An even better idea would be to convert your data to the Z scale, i.e., standard deviations from the mean, and then choose absolute 3, 4, 5, or 6 (3, 4, 5, or 6 standard deviations from the mean) as potential cut-offs. I trust that you have QC'd your data already and that low count transcripts have been removed.

ADD REPLYlink modified 5 months ago • written 5 months ago by Kevin Blighe21k

Thanks for your advice.

ADD REPLYlink written 5 months ago by mygamefun30
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1000 users visited in the last hour