Question: Is it possible to analyze together rna seq outputs from cufflinks and rsem?
0
gravatar for SandraGarcia
4 weeks ago by
Barcelona, Spain
SandraGarcia0 wrote:

Hello everyone! I want to analyze together the gene expression profiles from two different datasets and compare a number of genes between the two. One of them is in rsem v2 output format, and the other in fpkm (cufflink output). I have triend calculating the fpkm from rsem raw counts, but the distributions do not correlate at all so I am not sure if it would be possible to compare the gene expression among them as the data do not seem to be in the same scale.

Do you have any idea if it is possible to make these two data type comparable?

Thank you in advance,

Sandra

rsem rna-seq cufflinks fpkm • 129 views
ADD COMMENTlink modified 4 weeks ago by Kevin Blighe6.6k • written 4 weeks ago by SandraGarcia0
0
gravatar for Kevin Blighe
4 weeks ago by
Kevin Blighe6.6k
Republic of Ireland (√Čire)
Kevin Blighe6.6k wrote:

You're implying that you want to merge the datasets together? I think that the best approach would be to convert both datasets independently to the Z-scale and to then see how they line up in a histogram (and also do your correlation analyses). Another option could be to use one dataset as the training dataset, and the other as the validation dataset.

If you go down the merging route, then you will always be criticised by reviewers when trying to publish.

Hope this helps!

Kevin

ADD COMMENTlink written 4 weeks ago by Kevin Blighe6.6k

Thank you Kevin, I will go for it. Yes I want to merge the two datasets. We are also thinking of asking for the permissions to have the bam files of the second dataset so I will be able to re-analyze them with the same software.

ADD REPLYlink written 4 weeks ago by SandraGarcia0

Great!

Well, the BAM files would help, but they would still be produced very differently, i.e., a BAM produced by Bowtie/TopHat is very different from a BAM produced by some other aligner. You will just have to be very methodical when merging these datasets. I mentioned Z-scores because I had success with that in the past, but the differences between RSEM and FPKM are even great than the differences that I encountered.

ADD REPLYlink written 4 weeks ago by Kevin Blighe6.6k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1464 users visited in the last hour