I have a question about fold change and how it should be calculated for enhancer regions. I am aware that packages such as DESeq2 and edgeR will calculate the fold change for genes, but I am interested in calculating fold change at enhancers.
I am quite naive and new to RNA-seq data, and it's possible that something like this is not entirely feasible, but i'd rather ask the experts.
So my questions can be summed up as the following:
Is it possible to calculate the fold change of 'expression' at enhancer regions? Is this something that can be feasibly done with RNA-seq data?
Is the information in this thread still relevant and if so could I implement this idea of log2 coverage / read counts at enhancer regions to compare against enhancer associated promoter fold change?
I am trying to use this information to answer whether there is any correlation between eRNA expression at enhancers and gene expression at promoters. One thought is that if I have high eRNA expression, I would see high gene expression in the 'associated' gene.
I appreciate all the help! I'm looking to learn, so any information regarding anything in this post is appropriate.
I have access to both GRO-seq and RNA-seq datasets. My RNA-seq datasets are three replicates of WT cells and 3 replicates of KO cells where my TF (identified to be found both at promoter regions and my enhancer regions of interest) has been knocked out.
Could you explain the concept of computing FC globally and randomization controls a bit more? I'm going through some google searches now, but any links or explanation would be appreciated.
some colleagues in http://www.ncbi.nlm.nih.gov/pubmed/23728302 did the analysis in the following way :
-- took a set of 1000 enhancers bound by estrogen receptor -- computed for each enhancer the counts of eRNAs (based on GRO-seq reads) -- let's say : enhancer 1 has 10 reads in -E2, and 20 reads in +E2 .... enhancer n has 15 reads in -E2, and 25 reads in +E2 -- then you can represent as boxplots, the number of reads on ALL 1000 enhancers in -E2, and in +E2 : i.e. a boxplot will have the values 10, 15 ...in -E2 and another boxplot will have the values of 20, 25 in +E2. -- you can do a t-test between the boxplot in -E2 and boxplot in +E2.
This paper combined with your other answer have solved the majority of my problem. I just need to learn how to calculate log2FC and I should be good!