Tool:Generate unit-tested, ready-to-run pipelines using natural language prompting
0
0
Entering edit mode
11 weeks ago
Kevin ▴ 50

Hey Biostars!

I'm building a web app that lets you generate ready-to-run bioinformatics pipelines using a simple chat interface.

Here's how it works:

  1. Describe your desired pipeline, in natural language. For example: "Build me a variant-calling pipeline for Illumina paired-end human data that uses GPU acceleration" or something like "a small RNA-seq pipeline with the following steps: fastqc -> cutadapt -> bowtie2 -> samtools -> deseq2"
  2. An AI-powered system builds your pipeline then performs live unit tests on each step, re-trying until each step produces appropriate output.
  3. Run your pipeline in the cloud on your own data, on our SOC2-compliant web platform.

I'll be sending out beta invites starting at the end of Feb. If you're interested, you can sign up for the waitlist here: https://shire.bio

Please let me know what you think! I'd love to hear your requests, doubts, concerns, questions, etc :)

NGS pipeline genomics cloud AI • 693 views
ADD COMMENT
1
Entering edit mode

Who writes those unit tests?

ADD REPLY
0
Entering edit mode

The unit tests are written and performed by the AI. For an alignment step, for example, the AI will try to align some small fastq's to a small reference, then it'll make sure a bam is produced. Of course, a human could write a more thorough test; here, we're really just checking to see if the step completes without errors. But for a completely hands-off test, I don't think it's too bad!

ADD REPLY
1
Entering edit mode

Does this mean bwa and touch $sample.bam will both pass the test? Or will the AI reuse existing tests designed by developers?

ADD REPLY
0
Entering edit mode

Yep right now touch $sample.bam would pass the test

ADD REPLY
0
Entering edit mode

So you're offering less than current cloud platforms do? Pipelines tested not by bioinformaticians employed by the company but by AI?

ADD REPLY
0
Entering edit mode

Hey Ram -- the AI builds the pipeline and takes care of the plumbing, but as a bioinformatician you'll be able to audit and modify what the AI produces. Think of it as a way to do rapid prototyping and hypothesis-testing. And if you need custom pipeline design by a real bioinformatician, we offer that service too, here: https://www.shirebio.com

ADD REPLY
1
Entering edit mode

So, ChatGPT + some test data? I like the idea because testing it on data sounds nice but there's no way you can tell if the pipeline works just from running it on some toy data.

Plus, given today's environment where you need to bring the code to the data, your site relies on me to bring you the data which is not great. If you were to give me a CWL script or a Snakemake file based on this AI powered pipelining + testing system, I'd use it a lot more, because as a bioinformatician I wish I were doing more analyses and writing fewer glue scripts.

Like "Design a CWL workflow that uses a Docker container to classify xenograft scATAC-seq FASTQs, where the Docker container has seqtk, fastq-tools and Xenome available in its environment". If your tool can design a pipeline and give me test xenograft scATAC-seq FASTQs, I'd pay good money to use it.

ADD REPLY
0
Entering edit mode

This is good feedback, thanks. It might not be too hard to adapt my current system so that it'll spit out a fully-containerized pipeline that you can run locally. Something like that. And I thought I was the only one who used fastq-tools all the time, glad to see it's not dead software.

ADD REPLY

Login before adding your answer.

Traffic: 1690 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6