Question: Is it possible to analyze together rna seq outputs from cufflinks and rsem?
0
gravatar for SandraGarcia
21 months ago by
Barcelona, Spain
SandraGarcia0 wrote:

Hello everyone! I want to analyze together the gene expression profiles from two different datasets and compare a number of genes between the two. One of them is in rsem v2 output format, and the other in fpkm (cufflink output). I have triend calculating the fpkm from rsem raw counts, but the distributions do not correlate at all so I am not sure if it would be possible to compare the gene expression among them as the data do not seem to be in the same scale.

Do you have any idea if it is possible to make these two data type comparable?

Thank you in advance,

Sandra

rsem rna-seq cufflinks fpkm • 740 views
ADD COMMENTlink modified 21 months ago by Kevin Blighe45k • written 21 months ago by SandraGarcia0
0
gravatar for Kevin Blighe
21 months ago by
Kevin Blighe45k
Kevin Blighe45k wrote:

Edit: answer updated February 4th, 2019

You're implying that you want to merge the datasets together? There may be no precedent for merging FPKM and RSEM v2 counts. If I were forced to do it, I would process the RSEM counts independently via tximport/DESeq2 and ultimately produce variance-stabilised counts, which I would then transform to Z-scale. I would then also convert the FPKM counts to Z-scale via zFPKM (R package). On the Z-scale, the distributions are at least comparable. You would likely have to include, as a covariate, SOURCE (FPKM or RSEM) in all downstream modeling from that point forward.

If you go down the merging route, then you will always face criticism by reviewers when trying to publish.

Hope this helps!

Kevin

ADD COMMENTlink modified 5 months ago • written 21 months ago by Kevin Blighe45k

Thank you Kevin, I will go for it. Yes I want to merge the two datasets. We are also thinking of asking for the permissions to have the bam files of the second dataset so I will be able to re-analyze them with the same software.

ADD REPLYlink written 21 months ago by SandraGarcia0

Great!

Well, the BAM files would help, but they would still be produced very differently, i.e., a BAM produced by Bowtie/TopHat is very different from a BAM produced by some other aligner. You will just have to be very methodical when merging these datasets.

ADD REPLYlink modified 5 months ago • written 21 months ago by Kevin Blighe45k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 634 users visited in the last hour