I am a complete beginner at 'R', so bear with me. Currently, I have an RNA seq data set that identifies two clusters (High exp samples and Low exp samples) based on a gene set. I now want to explore the differential expression of the remaining 20k genes between these two clusters. My advisor has told me to use limma, and I found a pretty useful link : RNA-Seq analysis is easy as 1-2-3. My approach was to conduct the limma analysis separately for the two clusters. Half way through my attempt I began to question if that was right, as the article kept mentioning DGE compares between conditions..
My question is: How do I prepare my RNA-Seq data. Currently, it is framed as Rows: Samples Columns: Genes How do I factor in the two clusters in this data frame, to use for DGE? If there is a complete dumbed down version, please do tell. Any help is appreciated.
Thank you for being kind enough to explain the crux of this. Yeah, I understand that it is necessary to take the time to go through tutorials, but it's frustratingly slow. I guess I will have to go through it any which ways. Thank you again!