Question: Workflow to store and integrate multiple NGS datasets for easy access and analysis
gravatar for urjaswita
3.3 years ago by
United States
urjaswita90 wrote:

Our lab has a variety of NGS datasets that have been accumulating for many years (RNA-Seq, ChiP-Seq, DNase-Seq etc.) for different conditions and projects. In addition there are many relevant public datasets that are available too (e.g. ENCODE). I was wondering how to best organize and store these datasets so that an integrative analysis can be readily done.

I guess simplest can be that I have respected files for each processed file (e.g. ChiP-Seq peaks, RPKM values for RNA-Seq etc.) for different condition, but then it's difficult to summarize all the data for any new person. Have someone experienced similar issues, and found or developed a useful pipeline to store and integrate multidimensional genomic datasets?


rna-seq chip-seq next-gen ngs • 853 views
ADD COMMENTlink written 3.3 years ago by urjaswita90

Keeping lists of genes in GMT format from each experiment is a lightweight approach. Differential RNA-seq data can be collapsed into rank files which can be analysed using spearman correlation.

ADD REPLYlink written 3.3 years ago by mark.ziemann1.3k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1997 users visited in the last hour