Question

kal's test on rpkm vs raw counts

0

Entering edit mode

6.5 years ago

from the mountains ▴ 230

For comparisons with 0 replicates, I am using Kal's Z-test on proportions. I am wondering, does it make more sense to run Kal's on RPKM or on raw counts, and why? I would think that RPKM is better because it is normalized by gene length. We are more interested in knowing the (transcripts_of_geneA/total_transcripts) rather than (reads_of_geneA/total_reads), aren't we? I can see however that they can yield slightly different results in some cases.

I see conflicting information on each alternative. On the one hand, the Kal's paper writes the equation as (n-specific mRNA reads/cell)/(N-total mRNA reads/cell), but on the other hand, when I try to run the function on RPKM in CLC it warns me that proportions tests are aimed at count data.

Which is better?

RNA-Seq • 1.5k views

ADD COMMENT • link updated 6.5 years ago by theobroma22 ★ 1.2k • written 6.5 years ago by from the mountains ▴ 230

0

Entering edit mode

Could you elaborate on what it is you actually want to do? Then we migth be better able to help you.

ADD REPLY • link 6.5 years ago by Kristoffer Vitting-Seerup ★ 4.0k

score 0 · Answer 1 · 2017-10-11

0

Entering edit mode

6.5 years ago

theobroma22 ★ 1.2k

CLC Bio is a licensed platform, so it may be best for you to contact CLC to get an answer to your question. However, Karl's test is for the count data and not RPKM.

ADD COMMENT • link 6.5 years ago by theobroma22 ★ 1.2k

0

Entering edit mode

good to know. but why? and doesn't that seem in conflict with the formula stated in the kal's test paper? : https://www.ncbi.nlm.nih.gov/pmc/articles/PMC25383/

ADD REPLY • link 6.5 years ago by from the mountains ▴ 230

0

Entering edit mode

Not at all. You have to consider the experimental design of SAGE vs. RNA-seq, which is what you used.

ADD REPLY • link 6.5 years ago by theobroma22 ★ 1.2k