Normalisation Of Microarray Data
1
0
Entering edit mode
12.9 years ago
Ananth • 0

I want to write a simple program to calculate the fold change from Affymetrix soft files. Should normalisation be carried out for soft files (if so how ?) or are the soft files already normalised ?

data microarray • 4.0k views
ADD COMMENT
0
Entering edit mode

Where are the SOFT files from? If they're from a public repository, there should be metadata describing how they were processed. Can you give us an accession number?

ADD REPLY
3
Entering edit mode
12.9 years ago

Edited after comment by Neil who pointed out that the SOFT file can in fact contain data.

The format of the the SOFT file is described here: http://www.ncbi.nlm.nih.gov/geo/info/soft2.html). For Affymetrix it will normally not contain expression result data. These are contained in external raw data files, e.g., Affymetrix .CEL files which are referenced within the SOFT file using the !Sample_supplementary_file attribute.

The Affymetrix files will usually be bundled in an archive that contains the real data. For Affymetrix that will normally be .CEL files, which you would need to normalize. (You could use our arrayanalysis.org for that but of course there are many other options).

ADD COMMENT
1
Entering edit mode

Is that strictly true? SOFT files can contain data tables after a "!Sample_table_begin" line, like this example file: http://www.ncbi.nlm.nih.gov/geo/info/soft_ex_affy.txt. The data table may or may not be normalized. But it's true that you need raw CEL images for true normalization.

ADD REPLY
0
Entering edit mode

I'm also confused about this. Whether the data in "!Sample_table" is normalised or not. And how can I know that.

ADD REPLY
0
Entering edit mode

You need to look at the metadata in the SOFT file. In the link to the example file, for example, you'll see "#VALUE = MAS5-calculated Signal intensity", which tells you that the arrays were processed using the MAS5 algorithm. Usually if unprocessed, values will be described as "raw".

ADD REPLY
0
Entering edit mode

If you are still in doubt after checking the metadata you can also look at the intensity distributions for the arrays. These should be centered around the same value when the data are normalized.

ADD REPLY

Login before adding your answer.

Traffic: 1328 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6