I am trying to be sure about the working of background adjust using MAS5 in R with relation to filtering. I have 9 samples all based on MoGene-1_0-st-v1.r3. This chipset only has PM intensities.
Here is what I actually want to achieve,
- Determine the highest brightness value that is considered to be equivalent to no expression, using PM intensity values
- Remove genes under this threshold from the data if none of the samples have any values over this threshold.
My questions are:
- Is this included in background adjust (RMA function in R)?
- Is this a sensible way of filtering data, are there any better alternative?
I am aware of the function:
nsFilter(exprSet_rma,require.entrez=TRUE,remove.dupEntrez=FALSE, var.filter = TRUE, var.cutof=LOG2_EXPRESSION_MEASURE_CUT_OFF_BY_QUANTILE)$eset;
but I am unsure what LOG2EXPRESSIONMEASURECUTOFFBYQUANTILE stands for and how to use it to do what I actually intend to do.]
Is there a proble that gives this intensity threshold information and how can I access it?
Thanks for answering my question. After some reading I realized how wrong I was. Although I would like to ask you this to implement your own function for deciding 'no expression', how is the threshold usually decided, if not a rule, general guidelines might help as well. Thanks again
For general guidelines, you can read the help page of genefilter and related publications. As already given above, in expression arrays it is taken as rule of thumb that about half of them will not be expressed. But you should plot several diagnostic plots of your data to arrive at a decision.