Question: Highly Expressed Genes
gravatar for virag
8.9 years ago by
virag30 wrote:

Hi all,

I am looking into some bacterial genomes. For the moment though, it is OK to assume that I am dealing with just E coli. I want to get an idea of the genes which are highly expressed in E coli.

One of the most common techniques is to first of all identify a set of known highly expressed genes (which are generally ribosomal protein gene). Then use this reference set (ribosomal proteins) to get a codon usage table (CUT) for highly expressed genes. And then calculate Codon Adapatation Index (CAI) for each gene using the CUT generated in the previous step. And subsequently rank genes based on their CAI value This is a classical method which was first published by Paul Sharp and his colleagues in 1987. Since then people have come up with minor variants of CAI, but the principle essentially remains the same.

However, I was wondering if I could make use of some publicly available microarray data to get a list of highly expressed genes instead of using some theoretical measure like CAI. I do not know if it is really possible since microarrays are designed for relative studies implying that all the differentially expressed genes that are identified by microarrays show a relative overexpression or underexpression. i.e when a wild type is compared to say a drug treat condition, then all the genes which will be found to be overexpressed in wild type condition may not necessarily be highly expressed. But they could be highly expressed relative to their expression in, say drug treated state. So is there a way to use data from NCBI's GEO to get a list of experimentally determined highly expressed genes in wild type condition.

Thanks and regards Sankalp

gene expression microarray • 3.3k views
ADD COMMENTlink modified 8.9 years ago by seidel7.4k • written 8.9 years ago by virag30

Highly expressed in E coli in terms of quantity ? or are you looking at essential genes ?

ADD REPLYlink written 8.9 years ago by Khader Shameer18k
gravatar for seidel
8.9 years ago by
United States
seidel7.4k wrote:

In general there are two kinds of microarrays, those that are comparative and measure relative values as you describe, and those that measure absolute values of a given transcript based on hybridization intensity (such as arrays produced by affymetrix). Affymetrix produces an array for E. coli, so you could grab data from GEO that has been produced using those arrays and rank transcripts by their hybridization intensity - which is a general proxy for "abundance". I would imagine you could also find some RNA Seq data for various prokaryotes, and do something similar to find high abundance transcripts.

ADD COMMENTlink written 8.9 years ago by seidel7.4k

Thanks Seidel, I was not aware of microarrays that are used to get a measure of transcript abundance in absolute terms though the thought of using RNA-seq data did cross my mind at some stage. Thanks a lot.

ADD REPLYlink written 8.9 years ago by virag30

Microarrays provide only a crude measure of abundance, in general, but it might very well be better than using CAI.

ADD REPLYlink written 8.9 years ago by Sean Davis26k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2607 users visited in the last hour