I have some sequence data which are not expression. Is it possible to do GSEA or GO enrichment?
My understand:If I want to perform GSEA, I need expression level data.
My data are like this:
- A set of pair regions which interact with each other (RNA-DNA);
- CAGE-seq data (several regions which has "cap", probably they are TSSs or very close to enhancers or promoters).
I tried CAGE-seq (mainly) with GREAT and chipenrich (used as ChIP-seq). I think all of these methods focused on the areas close to the regions I have. So I think it could be a little dangerous because sometimes enhancers will not only regulate areas right near? The results may not be very accurate?
So my questions are:
- For non-coding regions, are there any standard workflow for GSEA and GO enrichment?
- If I use the interacted regions to do with GREAT or chipenrich, will this be reliable? Because I think 2 regions if they have interaction, the GREAT analysis or chipenrich analysis based on these regions are much reliable.
Sorry for late reply, and thank you so much for comment!
I am using assign statistics for CAGE peak right now. I think this could work
About interaction, your understand is correct but this interaction also includes the "physically close". Probably they don't have interaction, they were captured just because they are very close at that time. So I may choose like promoter-enhancer.