Question: How to use the data from Affymetrix Human Exon 1.0 ST Array for microarray gene expression analysis?
gravatar for fernandov
3.4 years ago by
fernandov10 wrote:

Hello, I was Reading this article from the TCGA research group about multiform glioblastoma and there it is used microarray data of three platforms, one of them is Affymetrix Human Exon 1.0 ST Array, but this platform measure exons. So, I look for the supplementary data of the article in this website and I found that they transform the data ending with the expression of genes. Thus, my question is: How can be done that?

I'm very new in this stuff, so even when this is a silly question, the answer will help me a lot. Thanks in advance.

exon microarray tcga • 1.7k views
ADD COMMENTlink modified 5 months ago by ATpoint24k • written 3.4 years ago by fernandov10

This GUI may be able to help you:

ADD REPLYlink written 3.4 years ago by
gravatar for Charles Warden
3.4 years ago by
Charles Warden7.3k
Duarte, CA
Charles Warden7.3k wrote:

In the website link, LBL202.txt is a gene-level summary of expression values. This would be most helpful if you wanted to get an idea of the expression level of the gene, compared to the other arrays which have probes that target different parts of the gene. It will also be more robust than the exon-level summarization.

If you have access to raw .CEL files for HuExon arrays, your question would be similar to this post:

Computing Expression From Affymetrix Exon Array Data

ADD COMMENTlink written 3.4 years ago by Charles Warden7.3k

Thank you very much for your answer, but I recently found that the TCGA give you also the gene-level expression values (however, when I don't be working with TCGA I will need the information of the links that you and wrote). And now I have another problem:

I downloaded from TCGA with the Data Matrix option

  1. Select a disease: GBM - Glioblastoma multiforme
  2. Center/Platform: LBL (HuEx-1_0-st-v2)
  3. Batch Number: Batch 1
  4. Data Level: level 3

then, I opened the file lbl.gov_GBM.HuEx-1_0-st-v2.1.gene.txt, which has 29 signal measures. But when I open the file FILE_SAMPLE_MAP, it has only 25 samples mentioned for lbl.gov_GBM.HuEx-1_0-st-v2.1.gene.txt, what does it means? Could you explaime this?

ADD REPLYlink modified 3.4 years ago • written 3.4 years ago by fernandov10

That is a good question - I can see that you would need that file in order to map the expression values to patients, and the number of columns in the gene file is greater than the number of samples in the mapping file.

However, if you look at the .sdrf.txt file, there are mapping for all the samples (in the "Normalization Name" or "Hybridization Name"), even though that file contains way more than 30 rows. It also looks like the explanation is at least partially due to running control samples, while the mapping only lists patient IDs.

ADD REPLYlink written 3.4 years ago by Charles Warden7.3k
gravatar for ATpoint
5 months ago by
ATpoint24k wrote:

The gene level expression estimate is calculated from the several probes covering that gene / exon on the array. For an end-to-end workflow towards Affymetrix arrays see this paper on F1000Research.

ADD COMMENTlink written 5 months ago by ATpoint24k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2056 users visited in the last hour