Question

Rnaseq using only coding genes

1

Entering edit mode

9.8 years ago

maurices ▴ 10

Hi!!!

If I take from the counts use on DESeq2 only the reads from coding genes and not pseudogenes the results are reliable?

deseq2 • 2.2k views

ADD COMMENT • link updated 3.4 years ago by Ram 45k • written 9.8 years ago by maurices ▴ 10

Ram · Answer 1 · 2016-02-06

0

Entering edit mode

9.8 years ago

Devon Ryan 105k

Sure, most people don't bother analyzing pseudogenes anyway.

ADD COMMENT • link 9.8 years ago by Devon Ryan 105k

0

Entering edit mode

Thanks so much!!

If I take all my RNA transcript and annotate using biomart I found many antisense, pseudogenes and coding sequence . If I try to do Unsupervised studies using all rna transcript my cluster are not primary related with the element I try to investigate. Differently If I use only coding genes I found Interesting things!

So there is not a bias on choose only protein coding..

ADD REPLY • link updated 5.9 years ago by Ram 45k • written 9.8 years ago by maurices ▴ 10

0

Entering edit mode

Maybe there's a bias, maybe not. If these are datasets that weren't created by you then I imagine that library type and/or percentage of poly-A RNA is what's really mucking with what you hope to see (so just looking at protein coding genes helps minimize that).

ADD REPLY • link updated 5.9 years ago by Ram 45k • written 9.8 years ago by Devon Ryan 105k