Question: Interpretation of Salmon quat.sf result
0
gravatar for saadleeshehreen
10 months ago by
saadleeshehreen60 wrote:

Hi,

I trying to quantify whether my gene of interest is expressed in a particular bacterial genome or not. From the quant.sf file of salmon, I got TPM 1000000.0 and NumReads 111.00 for that particular gene.

What does this mean? Is the value of NumReads low to conclude that the gene is expressed?

Thanks in advance.

rna-seq salmon • 376 views
ADD COMMENTlink modified 10 months ago by h.mon27k • written 10 months ago by saadleeshehreen60
1

Looks like you mapped only against that gene? A TPM of 1000000 means all of the relative expression is allotted to that gene. This seems very unlikely if you've quantified the sample correctly (using the whole transcriptome as the reference).

ADD REPLYlink written 10 months ago by Rob3.4k

Hi,

I downloaded the SRA file of the corresponding genome and used the nucleotide sequence of my interested gene. My interested gene is 300 bp long. As I am going to check its function, my intention was to see whether that gene expressed in that particular genome or not. If not expressed, I will not proceed for downstream experiments.

I think it is a clear indication that the gene was expressed in that particular genome, isn't it?

ADD REPLYlink modified 10 months ago • written 10 months ago by saadleeshehreen60
1
gravatar for h.mon
10 months ago by
h.mon27k
Brazil
h.mon27k wrote:

I think it is a clear indication that the gene was expressed in that particular genome, isn't it?

No, I disagree. Strictly speaking the way you quantified is incorrect and does not allow your conclusion. As Rob pointed out, you didn't use Salmon correctly, and the way you used it has two potential problems:

1) if you map to only one gene, reads that would otherwise map perfectly to other (similar in sequence, but different) genes and not map to this gene, may now may map to this gene;

2) if this gene has high similarity with other genes, reads would map to multiple locations, but their quantification could be ascertained accurately due to Salmon EM algorithm. Now, in the absence of these similar genes, you may be over-estimating the counts.

Use Salmon as intended: quantify the reads against the whole set of transcripts from your species, then examine the counts and TPM for th gene of interest.

ADD COMMENTlink written 10 months ago by h.mon27k

Thanks for correcting me. I have downloaded the cDNA of the genome from Ensemble and examines the TPM and Numreads again.

The TPM is: 299.9976 and Numreads: 87 Is it now a clear indication that the gene was expressed in that particular genome?

Cheers

ADD REPLYlink written 10 months ago by saadleeshehreen60
1

Yes, it is a clear indication it is expressed. You could plot a histogram of TPM values to have a visual indication of the level of expression of your gene of interest compared to the rest of the genes.

ADD REPLYlink written 10 months ago by h.mon27k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1808 users visited in the last hour