Question: Help with Salmon --> tximport --> edgeR
gravatar for u3005992
7 months ago by
u30059920 wrote:

Hi all,

I am new to RNA-seq analysis. Currently, I am trying to use the salmon, tximport, edgeR pipeline to process my human RNA-seq results on galaxy. The cDNA library for my RNA-seq is generated from PolyA selection.

I am abit confused with the normlisation steps.

  1. For salmon, i have aligned my reads to the human transcriptome, and used the human gff file for quant.genes.sf output, however, the TPM are still annotated with ENST00000XXXXXX.X instead of ENSGXXXXXXXXXXX. Does that mean salmon failed to recognise the GFF file and my TPM number is still for transcripts and not genes?

  2. If salmon failed to produce the correct quant.genes.sf files, I would like to use tximport to aggregate my transcripts to genes with my quant.sf files. But I come across 4 options in tximport for "Summarization using the abundance (TPM) values?"------ i) No, ii) scaled up to library size, iii) scaled using the avg. transcript length over samples and then the library size, iv) scaled using the median transcript length among isoforms of a gene, and then library size.

Which option should I be using if I want to follow up with edgeR on degust? Will I "overnormalised" my results if I choose the wrong option to go with edgeR?

Any help would be appreciated. Many thanks in advance!


ADD COMMENTlink modified 7 months ago • written 7 months ago by u30059920

If you already ran salmon on transcript level there is no need anymore to provide it with a gff files of genome annotations for human (will not even work I think).

You can safely continue to tximport who will do the summarisation on gene level.

One thing you might consider doing is to use a transcriptome version with one transcript per locus?

ADD REPLYlink written 7 months ago by lieven.sterck9.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1498 users visited in the last hour