I am planning to develop an De-novo assembly tool similar to Trinity which will make use of Hadoop framework.
In order to do so I would use a hadoop de-novo assembler (cloudbrush or contrail) and add the analysis steps (gene expression, gene functionality, etc), so that we have an automated "pipeline" tool to perform analysis scenarios automatically.
This is going to be my Thesis project, so I wanted to ask if anyone knows of a similar work done already.
After searching for several hours I can't find anything identical, the most similar concept seems to be galaxy-hadoop integration, which is a different thing as far as I understand, since you need to write your own hadoop tools and then wrap them to galaxy.
Am I missing something?
Thanks for your time in advance,