Question: Transcript vs. Gene TPM counts?
gravatar for mike-zx
12 days ago by
mike-zx140 wrote:

Working with some GTEx portal data right now and I've noticed that at GTEx's downloads page there are both a "Gene TPMs" and a "Transcript TPMs" file. My question is how exactly do these files differ from each other in terms of the steps for obtaining such files? I guess another way to phrase it would be why are there two files like this if RNA-Seq is supposed to output reads for transcripts in general? I would expect only 1 file with all the transcripts from GTEx instead of one that makes a distinction of gene vs. transcript... I'm obviously missing out on something rather elemental here but I don't know what it is.

Another minor question is whether or not if it is safe to assume that data from these files is normalized. As I understand, the data being TPMs implies the read counts have been normalized in the process of converting to the TPMs themselves, but I'm not 100% sure about this Thanks for any help.

gtex rna-seq • 129 views
ADD COMMENTlink modified 11 days ago by kristoffer.vittingseerup2.4k • written 12 days ago by mike-zx140
gravatar for MatthewP
12 days ago by
MatthewP260 wrote:

Hey, RNA-seq can output both gene and transcript read counts. TPM is normalized data, yes.

ADD COMMENTlink written 12 days ago by MatthewP260

what is exactly being measured in "gene counts" though?

ADD REPLYlink written 12 days ago by mike-zx140

Reads mapped to the gene.

ADD REPLYlink written 12 days ago by shoujun.gu240

Mike, I am very sorry if I am being pedantic and what I say below is too simplistic.

Here, the word "transcript" does not mean the mRNA product of the gene. The "Gene" and the "Transcript" are those defined in the gene definition file (gtf, or gff/gff3). For example, Hoxa1 gene in human has two transcripts according to ensembl. So if GTEx has used ensembl gene definition the "transcript TPM" file will have two values while the "gene TPM" file will have only one value.

ADD REPLYlink written 11 days ago by vj410
gravatar for kristoffer.vittingseerup
11 days ago by
European Union
kristoffer.vittingseerup2.4k wrote:

1) Gene expression is obtained by summing the expression of all transcripts belonging to the same gene. 2) Yes TPM are normalised values - but you might still need to perform a inter-library normalization. You can read more about that here.

ADD COMMENTlink written 11 days ago by kristoffer.vittingseerup2.4k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1272 users visited in the last hour