How to Normalize and log tranform the miRNA Seq data using DeSeQ2 / EdgeR?
1
1
Entering edit mode
3.7 years ago

I wish to obtain log-transformed and upper quantile normalized expression data for my miRNA-Seq raw count data. How should I implement the script using EdgeR or DeSEQ2? The default option in DeSEQ2 is TMM normalization. Is there a way out for obtaining upper-quartile as in EdgeR? How can i accommodate in my script.

rawCountTable <- read.delim(file.choose(), row.names=1)                                   ## miRNA-Seq raw data##
Col_data = read.table(file = "COL_LUAD_miRNA.txt", header = T, sep = "\t")     ## miRNA-Seq Annotation data##
dgeFull <- DGEList(rawCountTable, group = Col_data$Condition)
dgeFull <- DGEList(dgeFull$counts[apply(dgeFull$counts, 1, sum) != 0, ], group=dgeFull$samples$group)
dgeFull <- calcNormFactors(dgeFull, method="upperquartile")
dgeFull <- estimateCommonDisp(dgeFull)
normCounts <- cpm(dgeFull, log=TRUE, prior.count = 0.25)
## Perform batch correction of normalized and log transformed values##

After implementing this script in EdgeR I'm obtaining some negative values in my expression data at the end.

R rna-seq software error • 1.7k views
ADD COMMENT
0
Entering edit mode

You obtain some negative values because you log transform count >0.25. Every log transformation from 0.25 to 1 will be end up negative. Try to set up prior.count to 1.

ADD REPLY
0
Entering edit mode
3.7 years ago
Gordon Smyth ★ 7.0k

Your script already does what you say you want. Negative values on the log-scale are not a problem, although you could reduce the number of negative values by leaving prior.count at the default value instead of setting it so small.

ADD COMMENT

Login before adding your answer.

Traffic: 2928 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6