Question

Does the countToFPKM package work for single end RNA seq or its only for paired end?

0

Entering edit mode

3.2 years ago

salehm ▴ 10

Hi All, I have a featurecount output row counts and need to convert them to FPKM. I knew about the countToFPKM package, however, the description of the package says its for paired end RNA seq data. My data are single end. Could I still use this package? and if not could you please suggest an easy way to do it?

The reason I need to convert from row counts to FPKM is to perform ssGSEA analyses on gene pattern. Thanks.

Mohamed

RNA-Seq • 999 views

ADD COMMENT • link updated 3.2 years ago by seidel 11k • written 3.2 years ago by salehm ▴ 10

0

Entering edit mode

Does the FPKM metric apply to SE data? What would be the difference between RPKM and FPKM in the context of SE data?

ADD REPLY • link 3.2 years ago by Ram 43k

0

Entering edit mode

It sounds that yes the FPKM needs paired end. So I may need to convert my single end data to RPKM, any easy idea how I can do that? Thanks.

ADD REPLY • link 3.2 years ago by salehm ▴ 10

1

Entering edit mode

What have you tried? Have you searched online for "get RPKM from raw counts"?

ADD REPLY • link 3.2 years ago by Ram 43k

0

Entering edit mode

I was thinking of using countToFPKM package but it says it works for paired end reads, I am not sure if I could also use it for single end reads.

ADD REPLY • link 3.2 years ago by salehm ▴ 10

0

Entering edit mode

So you searched for "get RPKM from counts" and the only thing you see is the countToFPKM package?

ADD REPLY • link 3.2 years ago by Ram 43k

0

Entering edit mode

Is featurecounts going to do FPKM right? Don't you need something like Kallisto or RSEM that understands how the different transcripts are proportioned?

ADD REPLY • link 3.2 years ago by swbarnes2 14k

score 4 · Accepted Answer · 2021-02-24

Once you have a matrix of counts per feature, there is no way to tell if the counts for the feature came from paired-end data or single end data. The single or double endedness of your DNA Fragments is accounted for in the generation of an integer count representing a feature. Thus, your count matrix can be converted to FPKM whether it was generated from single or paired-end data. FPKM is used to represent abundance regardless of end type, and is identical to RPKM for this same reason. RPKM simply suffers from a historical prematurity in that "Reads" actually represent "Fragments", whether sequenced by one end or two. One caveat: the package, and the method it implements (from Trapnell et al., 2010), uses the average DNA fragment length for a given library in the calculation. This can be determined easily in paired end data, but for single end data you usually have to estimate the library insert size (usually an educated guess).