Question: Annotation Of Affymetrix Human Exon St. 1.0
5
gravatar for Some
9.1 years ago by
Some50
Some50 wrote:

Dear community,

it is the first time for me working with the Affy Human Exon St. 1.0. I followed the instructions of the 'oligo' package in Bioconductor - doing the RMA with 'core' parameter. Thus, I get an ExpressionSet and an expression matrix using exprs(exonCore).

Now I wanted to map the Affy Probe Ids using Biomart - so I got a lookup table, having all ids from the HuEx St.1.0 and e.g. RefSeq IDs.

When I do a simple lookup, only 1365 out of 22011 have RefSeq IDs.

1) What am I doing wrong? Why is that?

2) How is the best way to retrieve annotation?

3) My biologists want to see the expression of several genes - one value! Not only probe ids. What is the best way to do this?

Thank you very much in advance

ADD COMMENTlink modified 22 months ago by popantrop40 • written 9.1 years ago by Some50
7
gravatar for Neilfws
9.1 years ago by
Neilfws48k
Sydney, Australia
Neilfws48k wrote:
  1. The best way to retrieve annotation is definitely BioMart. If you are not using it already, try the Bioconductor biomaRt package. Search this site and you will find some examples of how to use it. I also wrote a couple of short tutorials: here and here.
  2. It is possible that some of your probeset IDs do not map to a RefSeq ID, but 1365/22011 seems like a very small proportion. However, note that there are ~ 1.4 million exon probeset IDs, but currently, 531 333 are represented in the BioMart database. This issue came up in a question on the aroma.affymetrix list, but take care with the code example in the answer (it contains incorrect spaces).
  3. If your biologists want to see expression values for single genes, not exons, perhaps they should not have used a platform designed specifically for exon expression! People deal with this in a couple of ways. One is to combine exon values for each gene (e.g. by taking the median). Another is to generate plots for each gene showing the exon expression values: these can be very informative. The packages GenomeGraphs and exonmap are good tools for this task.
ADD COMMENTlink modified 7 months ago by RamRS26k • written 9.1 years ago by Neilfws48k

your tutorials look great, thanks

ADD REPLYlink written 9.1 years ago by Some50
2
gravatar for Bert Overduin
9.1 years ago by
Bert Overduin3.7k
Edinburgh Genomics, The University of Edinburgh
Bert Overduin3.7k wrote:

Hi,

Your BioMart data indeed don't make sense to me:

If I use BioMart v61, it turns out that of the 53630 annotated human genes, 21450 have a RefSeq DNA ID. Of these 21104 have a probe from the Affy Human Exon St. 1.0 mapped to them.

If I only look at the protein-coding genes, out of 21244 Ensembl genes, 18976 have a RefSeq DNA ID. Of these 18932 have a probe from the Affy Human Exon St. 1.0 mapped to them.

ADD COMMENTlink written 9.1 years ago by Bert Overduin3.7k

nice stats, thanks for doing that.

ADD REPLYlink written 9.1 years ago by Some50
1
gravatar for Michael Imbeault
9.1 years ago by
Michael Imbeault30 wrote:

Use the standaloneannotation functions from OneChannelGUI in BioC - you don't have to use the actual GUI, just look in the docs for those functions.

ADD COMMENTlink written 9.1 years ago by Michael Imbeault30

thank you, i will check that out

ADD REPLYlink written 9.1 years ago by Some50
0
gravatar for popantrop
22 months ago by
popantrop40
popantrop40 wrote:

Your specific problem was most likely the fact that at the "core" annotation level RMA produces a file annotated with transcription cluster IDs, and biomart only speaks probeset IDs. So using biomart you can't translate from transcription cluster IDs to gene names.

I've produced a csv file, which maps from transcription cluster IDs to HGNC gene names (per most recent genecode annotations).

ADD COMMENTlink written 22 months ago by popantrop40
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1517 users visited in the last hour