Question

DGE analysis suggestion : linearity between TF and target gene expression level

0

Entering edit mode

8.8 years ago

bharata1803 ▴ 560

Hello,

So, I have done gene expression analysis using DESeq2. Now, I have several gene which is differentially expressed between normal and cancer sample. I tried to analyze what is the cause of the differentially expressed. I also have data about variation from exome seq with same sample (I got it from NCBI GEO). To find the cause of DE gene, I tried to list all of TF for that gene that I can found from gene card website. My method is really simple, use Pearson correlation too check the correlation coefficient and plot it in scatter plot to find the linearity. From this method, I found quite interesting result which is I found 1 TF correlates strongly with down-regulation of my target gene (the target gene and TF gene both down-regulated). Other TF didn't show this result. I use assay function from DESeq2 object to get the normalized read count, not the raw count or logarithmic expression level.

My question is, is my method acceptable, both biologically and mathematically?

Also, I found most of the TF and target gene didn't give any strong linear correlation, do I need to calculate it with non-linear correlation method?

Thank you for your suggestion.

RNA-seq • 2.1k views

ADD COMMENT • link updated 16 months ago by Ram 43k • written 8.8 years ago by bharata1803 ▴ 560

0

Entering edit mode

what did you calculate the Pearson correlation for? i.e. which values are you comparing with each other?

ADD REPLY • link 8.8 years ago by Friederike 8.9k

0

Entering edit mode

I use the value from DESeq2 object I get it using assay function from DESeq2. I use Pearson correlation coefficient just to check which TF has strong correlation in linearity befor I plot it in scatter plot.

ADD REPLY • link 8.8 years ago by bharata1803 ▴ 560

0

Entering edit mode

And what value did you use for the TF?

ADD REPLY • link updated 16 months ago by Ram 43k • written 8.8 years ago by Friederike 8.9k

0

Entering edit mode

Same, I compare expression level from DESeq2 for target gene and TF

ADD REPLY • link 8.8 years ago by bharata1803 ▴ 560

Ram · Answer 1 · 2015-06-30

0

Entering edit mode

8.8 years ago

Devon Ryan 104k

While a non-linear relationship is probably more biologically realistic, if you get a good candidate using a simple linear relationship then that's a good start. You'll obviously need to do more experiments (e.g., is it expressed in your cells of interest, what happens if you inhibit/activate it in your system, ...), but you seem to have a good start on things. BTW, keep in mind that when you test many candidates, you'll randomly get some significant findings simply due to testing many things (i.e., always be wary of multiple comparisons).

ADD COMMENT • link 8.8 years ago by Devon Ryan 104k

0

Entering edit mode

Thank you for your reply. I've read from another post that Spearman correlation also good for checking because it can interpret a bit non linearity.

ADD REPLY • link updated 16 months ago by Ram 43k • written 8.8 years ago by bharata1803 ▴ 560

0

Entering edit mode

Yeah, I would agree that a rank-based method like Spearman's correlation would be preferred.

ADD REPLY • link 8.8 years ago by Devon Ryan 104k