I have microarray dataset (700 patients) I identified different genes that correlates with the oncogene of interest. I made two gene set:
1) the genes that most correlate with the oncogene 2) the genes that anti-correlate with the oncogene
I would like to separate the patient in two group (for instance, the patients where the gene signature is present and the ones they do not express the signature).
What I would like to generate is at the end a dummy variable (1 for the signature present in the patient, 0 not). How I can establish the signature is present in the patient? exist also a test/metric to evaluate this as significant? If you can also suggest how to implement this in R it would be great.
Thank you in advance for your help,