how to get Differentially expressed genes from TCGA normalized data.

0

Entering edit mode

6.2 years ago

oghzzang ▴ 50

I'm working on datas from TCGA's pipeline. we have 'rsem_isoforms_normalized.txt', 'rsem_gene_normalized.txt', of 215 samples. and this samples have number of 20501 genes.

I have these question:

I want to get differentially expression genes from 215 data. Group 1 dim is 195 (samples)* 20501 (genes) Group 2 dim is 20 (samples)*20501 (genes)

I have already run 2 t-tests

multtest's mt.maxT in r

PP=mt.maxT(Counts, groups, test="t", B=5000)
PP$fdr=p.adjust(PP$rawp, method = "fdr")

number of FDR =< 0.05 genes are only 35.

stats's t.test in r

t.result <- apply(TotalCounts, 1, function(x) t.test(x[1:ncol(Counts.C)], x[ncol(Counts.C)+1:ncol(TotalCounts)], paired=F, var.equal = F))
f$p_value <- unlist(lapply(t.result, function(x) x$p.value))
f$fdr <- p.adjust(TotalCounts$p_value, method = "fdr")

number of FDR =< 0.05 genes are 2000.

this 2 group's adjusted p values by BH are completely different.

In this situation, how can I get DEGs?

rna-seq • 1.5k views

ADD COMMENT • link updated 6.2 years ago by Kevin Blighe 87k • written 6.2 years ago by oghzzang ▴ 50

1

Entering edit mode

You might consider using either edgeR or DESeq2, both of which are designed to deal with RNA-seq data.

ADD REPLY • link 6.2 years ago by Sean Davis 26k

0

Entering edit mode

Thanks Davis,

Can I use edgeR by using quartile normalized RNA-seq data?

quartile normalization is conducted to make 'rsem_gene_normalized.txt'

ADD REPLY • link 6.2 years ago by oghzzang ▴ 50

Login before adding your answer.