Hello, I'm a beginner in bioinformatics and I have some question about cmap, precisely, regarding Landmark gene selection. I am grateful for all of your responses. here are a few questions. It's a question about paper "A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles. Subramanian A, Narayan R et al
- I want to learn about the Affymetrix HGU133A microarray gene expression dataset that was exactly used when choosing the 978 landmark genes. I've tried to find some data, but there are so many GPLs with Affymetrix HGU133A microarray, and their number of rows is not 12063 as mentioned in the published paper. I think the data may have been updated, or perhaps I'm misunderstanding something.
- I'm curious about the PCA method in paper. What software tool did you use to obtain the 386 components?
I'm researching how to improve inference, so I want to understand how you reduced the 12063 genes to 978 genes. Thank you for reading, and thank you for your help. I'll be waiting for your response!
thank you for reply, but i want some information for what microarray data are selected for L1000. thank you!
Link for the data is included above.