Question: TPM VS FPKM VS TMM and K-mer specific expression quantification
2
gravatar for user230613
2.1 years ago by
user230613280
Europe
user230613280 wrote:

Hi Biostar's!

I'm trying to find the best approach to fill my goal. I would like to measure the expression of specific set of kmers - 10 aa long peptide sequences across different samples. I have RNA-Seq data for each sample. I'm planing to use alignment-independent methods such as Kallisto or Salmon. My first question is:

1) How can I measure the expression of specific Kmer when this Kmer is not unique in the genome? Let's say that it can be coded by different transcripts in the same gene or in different genes sharing the sequence... How can I measure the global abundance of that Kmer in the sample?

For other hand...:

2) Which method should I choose for measuring the expression? I have read that TPM has overcome FPKM. But both TPM and FPKM measure the relative abundance, maybe I should consider measuring absolute abundance with TMM. Is there any specific case when relative is preferred over absolute measurement?

Thank you in advance,

rna-seq kmer • 2.9k views
ADD COMMENTlink modified 23 months ago by Rob3.3k • written 2.1 years ago by user230613280
3
gravatar for h.mon
23 months ago by
h.mon26k
Brazil
h.mon26k wrote:

Kallisto and Salmon quantify transcript expression, not kmer expression. They use kmer matches between reads and transcripts to quantify transcript expression, as estimated by read counts.

1) How can I measure the expression of specific Kmer when this Kmer is not unique in the genome?

Expectation-maximization algorithm.

2) Which method should I choose for measuring the expression?

What you want to do? TPM and FPKM are within-sample normalizations, intended to allow comparison of expression levels of different genes from the same sample. They are not needed for differential transcript expression between different samples.

ADD COMMENTlink modified 20 months ago • written 23 months ago by h.mon26k
3
gravatar for Rob
23 months ago by
Rob3.3k
United States
Rob3.3k wrote:

To add to h.mon's answer, there is generally no "absolute" measurement for transcript expression. For example, the number of reads assigned to each transcript depends on sampling depth, relative abundnaces, etc. I have written a blog post on some of the different expression measurements that are common that you can read here. Salmon outputs both TPM and the estimated number of reads assigned to each transcript. The former is useful for within-sample analysis. To perform e.g., differential expression testing, you can read Salmon's output using a tool like tximport. This will allow you to import all of your quantified samples directly into R in a way that the built-in between-sample normalization approaches for tools like DESeq2 and EdgeR can be directly applied.

ADD COMMENTlink written 23 months ago by Rob3.3k

Hi Rob. Could you guide me a bit? If I want to find the expression of a given kmer (10nt) that is common between two transcripts, should I add the TPMs of both transcripts?

ADD REPLYlink written 20 months ago by user230613280

Hello, again, I wonder if you could guide a little bit with my previous question? Thank you :)

ADD REPLYlink written 18 months ago by user230613280
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 972 users visited in the last hour