I used calcNormFactors from edgeR to normalize my RNA seq data matrix with raw counts. Here is the code I am using. d_des is my design matrix and cm is my contrast matrix
counts_matrix <- as.matrix(raw_counts_dt, rownames = TRUE)
de_List <- DGEList(counts_matrix)
de_List %<>% calcNormFactors
res_Voom <- voom(de_List, d_des, plot = TRUE)
lm_Fit <- lmFit(res_Voom, d_des)
eb_Fit <- eBayes(contrasts.fit(lm_Fit, cm), trend = F, robust = T)
I have the following questions:
- If I understand the document correctly,
TMMis the default normalization method? Is this the correct method to be used for normalizing raw counts to be used for DE analysis? If not which one should be used?
what is it normalized for? It is normalizing for all genes in my list across all samples ie. replicates for each groups?
Is the
voomusing the normalized counts from myde_Listor is it still using the raw counts? If it is using raw counts, how could I use the normalized counts?I am using
res_Voom$Ematrix to create the PCA plot to compare my conditions. Is this using the normalized counts or the raw counts? how do I use normalized counts if it is not?I am using
topTablefromeb_Fitto list the top differentially expressed genes. Is this the correct table to use?
Thank you in advance for your answers!