Question: Integrative analysis of omics studies using machine learning
0
gravatar for Firas
13 months ago by
Firas0
Firas0 wrote:

Hi All,

I would like to use public omics datasets (ChIP-seq, RNA-seq, and ATAC-seq) from different studies to do an integrative analysis as follow:

  1. Normalise samples, within each type of omics, from different public datasets.
  2. Convert the normalised values into a uniform scale to make the comparison between ChIP-seq, RNA-seq and ATAC-seq possible.
  3. Feed the normalised uniformed values into machine learning to infer one feature (e.g. RNA expression) from other features (e.g. TF or histone marks ChIP-seq).

Does anyone have experience with this type of analysis? I would like to hear about preferable approaches, problems, caveats, etc.. that I need to worry about / take care of before I start working on it.

Many thanks.

ADD COMMENTlink modified 9 weeks ago by Kevin Blighe52k • written 13 months ago by Firas0

Hi Firas, We are interested to do exactly the same thing for diferent plant mutants. have you got positive experience with that?

we can interchange experiences by email, if you´r interested!!

ADD REPLYlink written 9 weeks ago by rnadi0

Please use ADD COMMENT, not the answer field. Fyi, OP has not logged-in for 9 months, unlikely to get a response.

ADD REPLYlink written 9 weeks ago by ATpoint26k
2
gravatar for Kevin Blighe
9 weeks ago by
Kevin Blighe52k
Kevin Blighe52k wrote:

The big question back to you is this: which biological question (or questions) are you hoping to answer by performing a 'multi-omics' analysis and 'machine learning'? From what I can see, and please correct me if I am incorrect on this, the majority of people are doing these studies without knowing the end goal, and/or without even understanding why they are doing them in the first place.

I write on related topics, here:

You can try your best to process these datasets to the point where they are on the same data distribution, however, by then, you may have processed them so much such that much information has been lost in the process. Elsewhere, for example, I saw a Scientific Reports manuscript whereby the authors managed to encode each 'omics' dataset as a kernel and summarises the data into matrices of binary values, which seems meaningless to me.

Kevin

ADD COMMENTlink written 9 weeks ago by Kevin Blighe52k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1980 users visited in the last hour