Question: What Does "Length Per Kilobase" Mean ?
0
gravatar for Tarun Mohite
7.2 years ago by
Tarun Mohite0 wrote:

Let's say if I have to find "length per KB" of a read.

How should I go about it ?

I can find the "length" by subtracting the END coordinate with the read with the START coordinate.

Let's say Read 1 spans from 300 to 330. So, it's length is 30.

Does "Length per KB" mean I divide 30 by 1000 i.e 0.030 ?

length • 2.1k views
ADD COMMENTlink modified 2.9 years ago by Biostar ♦♦ 20 • written 7.2 years ago by Tarun Mohite0
0
gravatar for JC
7.2 years ago by
JC8.7k
Mexico
JC8.7k wrote:

The "length per kilobase" as you pointed is the sequence size in kilobases units. But it's applied to gene sizes which spans for 1-2kb in average in the human genome, no reads that are really short. I think you are trying to understand RPKM see http://seqanswers.com/forums/showthread.php?t=586

ADD COMMENTlink modified 7.2 years ago • written 7.2 years ago by JC8.7k

JC, thanks a lot ! RPKM is exactly what I am trying to understand. :)

ADD REPLYlink written 7.2 years ago by Tarun Mohite0

the original concept applied to RNAseq is described in http://www.nature.com/nmeth/journal/v5/n7/full/nmeth.1226.html

ADD REPLYlink written 7.2 years ago by JC8.7k
0
gravatar for Tarun Mohite
7.2 years ago by
Tarun Mohite0 wrote:

Hello JC,

According to my understanding of the paper, the simplest way I can write a wordy formula for RPKM is :

RPKM = (sum of all reads mapping to all exons of the gene) / ((sum of lengths of all exons of the gene / 1000) * (sum of all reads mapping to all genes in the sample / 1000000))

Is that correct ?

However, if I have to find the RPKM for each exon of each gene, would I modify the formula to :

RPKM = (total #reads mapping to the exon) / ((length of exon / 1000) * (sum of all reads mapping to all genes in the sample / 1000000))

Please correct if I am going wrong.

Thanks.

ADD COMMENTlink written 7.2 years ago by Tarun Mohite0

You are right in both formulas. Recently, bioconductor has a new package to deal with RNAseq data in exons: DEXSeq http://www.bioconductor.org/packages/devel/bioc/html/DEXSeq.html Paper: http://precedings.nature.com/documents/6837/version/2/files/npre20126837-2.pdf

ADD REPLYlink written 7.2 years ago by JC8.7k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1766 users visited in the last hour