Question: RNA-seq gene expression normalization for independent samples
gravatar for Amos
2.9 years ago by
Amos0 wrote:


I have a large database of RNA-seqs of different cell lines. I'd like to get normalized gene expression levels for each of the cell lines. From what I've read so far it seems that normalizing gene expression is done compared to a reference or control (such as with DEseq). Since the cell lines are indepedent of each other I don't have one cell line that I would consider a control or a suitable refenrce point for gene expression levels.

Is there an accepted way for normlaizing gene expression levels for single samples without comparing to a control?

Thank you.

rna-seq deseq gene expression • 1.5k views
ADD COMMENTlink written 2.9 years ago by Amos0

I would say it depends on what is your biological question (e.g. you want to use all of them and compare them or pick some then compare?). The most obvious normalization that you can do as a first step is to normalize for the library size, i.e. do the cpm normalization.

ADD REPLYlink written 2.9 years ago by aka001190

I will want to pick some and compare them to each other later on but essentialy I want to compare the expression levels of different genes within each cell line to another metric that I calculated for each cell line.

ADD REPLYlink written 2.9 years ago by Amos0

What about Transcripts per million? (TPM), or RPKM? (Reads per kilobase per million)?

ADD REPLYlink written 2.9 years ago by Buffo1.8k

Well, normalization can be within a sample or across samples. In that case, I think you wanna do cpm, TPM, or RPKM/FPKM. Later when you want to compare some sets of cell lines, you probably want to do the across samples normalization (e.g. RLE, etc.).

ADD REPLYlink written 2.9 years ago by aka001190
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1582 users visited in the last hour