Question: TPM VS FPKM VS TMM and K-mer specific expression quantification
1
gravatar for user230613
17 months ago by
user230613260
Europe
user230613260 wrote:

Hi Biostar's!

I'm trying to find the best approach to fill my goal. I would like to measure the expression of specific set of kmers - 10 aa long peptide sequences across different samples. I have RNA-Seq data for each sample. I'm planing to use alignment-independent methods such as Kallisto or Salmon. My first question is:

1) How can I measure the expression of specific Kmer when this Kmer is not unique in the genome? Let's say that it can be coded by different transcripts in the same gene or in different genes sharing the sequence... How can I measure the global abundance of that Kmer in the sample?

For other hand...:

2) Which method should I choose for measuring the expression? I have read that TPM has overcome FPKM. But both TPM and FPKM measure the relative abundance, maybe I should consider measuring absolute abundance with TMM. Is there any specific case when relative is preferred over absolute measurement?

Thank you in advance,

rna-seq kmer • 2.2k views
ADD COMMENTlink modified 15 months ago by Rob2.8k • written 17 months ago by user230613260
3
gravatar for h.mon
15 months ago by
h.mon21k
Brazil
h.mon21k wrote:

Kallisto and Salmon quantify transcript expression, not kmer expression. They use kmer matches between reads and transcripts to quantify transcript expression, as estimated by read counts.

1) How can I measure the expression of specific Kmer when this Kmer is not unique in the genome?

Expectation-maximization algorithm.

2) Which method should I choose for measuring the expression?

What you want to do? TPM and FPKM are within-sample normalizations, intended to allow comparison of expression levels of different genes from the same sample. They are not needed for differential transcript expression between different samples.

ADD COMMENTlink modified 13 months ago • written 15 months ago by h.mon21k
3
gravatar for Rob
15 months ago by
Rob2.8k
United States
Rob2.8k wrote:

To add to h.mon's answer, there is generally no "absolute" measurement for transcript expression. For example, the number of reads assigned to each transcript depends on sampling depth, relative abundnaces, etc. I have written a blog post on some of the different expression measurements that are common that you can read here. Salmon outputs both TPM and the estimated number of reads assigned to each transcript. The former is useful for within-sample analysis. To perform e.g., differential expression testing, you can read Salmon's output using a tool like tximport. This will allow you to import all of your quantified samples directly into R in a way that the built-in between-sample normalization approaches for tools like DESeq2 and EdgeR can be directly applied.

ADD COMMENTlink written 15 months ago by Rob2.8k

Hi Rob. Could you guide me a bit? If I want to find the expression of a given kmer (10nt) that is common between two transcripts, should I add the TPMs of both transcripts?

ADD REPLYlink written 13 months ago by user230613260

Hello, again, I wonder if you could guide a little bit with my previous question? Thank you :)

ADD REPLYlink written 11 months ago by user230613260
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1736 users visited in the last hour