Question

Finding DEGs using RNA-seq data

0

Entering edit mode

7.8 years ago

noorpratap.singh ▴ 330

Hey, it might seem a basic question but I wanted to find degs using RNA-seq data. First thing for finding degs should I take only tumor and tumor adjacent normal or should I compare the entire tumor samples even though for the patient I might not have normal samples against the few normal samples.

Further before finding DEGs the normalisation which we have to do has to applied on the entire samples and not only on those belonging to the same pateint?

At the moment I have a count matrix but as such if I look at total ensembl ids(row names) they are more than 60000. I understand they might contain noncoding regions so I map them to entrez genes but even after that I have some 24000 ids. Any further filtering which I need to apply.

Thanks in advance.

RNA-Seq DEGs • 2.0k views

ADD COMMENT • link 7.8 years ago by noorpratap.singh ▴ 330

score 2 · Accepted Answer · 2016-07-13

2

Entering edit mode

7.8 years ago

Devon Ryan 104k

You can exclude any samples without matching controls.
You should be using a package from Bioconductor like DESeq2 or edgeR. This will have methods for normalization, just read the appropriate vignette.
The genes should be rows, not columns, so transpose your matrix. Do not filter further unless it's by something like mean expression level (this is again covered in the aforementioned vignettes).

ADD COMMENT • link 7.8 years ago by Devon Ryan 104k

0

Entering edit mode

Thanks, 1)It was typing error genes were rows. 2)For normalisation do I need to exclude the other samples.