Question

BD Rhapsody software - single cell seq

1

Entering edit mode

12 months ago

colindaven 6.4k

Dear all,

Has anyone worked with BD Rhapsody single cell data before?

According to this FAQ https://scomix.bd.com/hc/en-us/articles/360057714812-All-FAQ you can only use the analysis software online via sevenbridges. That is, you cannot download the tool and use it locally to reanalyze data (which is a fantastic way to get rid of customers by the way, great job).

I'm sure there are other alternatives with Dropseqpipe and others, but has anyone found a decent solution for these data? Their library design does not seem that simple either with variable bases in the V2 scheme. https://teichlab.github.io/scg_lib_structs/methods_html/BD_Rhapsody.html

Thanks

Edit: so local usage is possible, as LChart says below.

You basically have to install common workflow language CWL using their PDF instruction guide, then download their repository of just yml and cwl files from here https://bitbucket.org/CRSwDev/cwl/src/master/. Then you edit the yaml files to point to your input files (should be easy).

The next step is running with eg

cwl-runner --outdir out1 rhapsody_wta_1.9.1.cwl template_wta_1.9.1.yml

and then experience cryptic CWL errors. I haven't been successful yet, and am not sure if the problem is with the CWL from BD Rhapsody, or the yml syntax from me (which appears valid).

Edit 2- I was successful with the cwl 2.0 version some time ago. Previous versions did not work for me.

rhapsody bd single-cell • 1.7k views

ADD COMMENT • link 5 months ago by colindaven 6.4k

score 3 · Answer 1 · 2023-10-27

I have managed to run Rhapsody on a computing node (HPC Slurm cluster). Since it was not a head node, it has no network connection to the outside world, and no Docker.

You will need:

javascript/node on the PATH, no clue why sometimes it did not work with the container. Maybe toil was more picky
apptainer (make sure you got unsquashfs. I got mine using conda and squashfs-tools
in $CWL_SINGULARITY_CACHE : bdgenomics_rhapsody:2.0.sif maybe also node_alpine.sif. You may sftp these from your workstation if the HPC headnode somehow also has restricted connections to the outside .
cwltools
TMPDIR set up on some large partition since the pipeline tends to fill in /tmp
reference file from:

http://bd-rhapsody-public.s3-website-us-east-1.amazonaws.com/Rhapsody-WTA/Pipeline-version2.x_WTA_references/

my script:

#!/usr/bin/bash

#SBATCH --job-name=rhapL1
#SBATCH --nodes=1
#SBATCH --time=10:00:00
#SBATCH --cpus-per-task=32
#SBATCH --mem=64G
#SBATCH --partition=foobar

export CWL_SINGULARITY_CACHE=/path/to/singu_cache/
export PATH=/path/to/soft/progs/node_current/bin:$PATH

cwltool --singularity \
--outdir /some/output/path/rhapsody_lane1  \
--debug \
--cachedir /some/output/path/rhapsody_lane1/cache \
rhapsody_pipeline_2.0.cwl pipeline_inputs_lane1.yml

The above is quite brain dead attempt at executing the whole pipeline sequentially on a one computing node. But at least it did work.

I also was experimenting with toil, but so far without a success with Rhapsody.

Last but not least: if you are curious poke around inside of the container. Compiled QualCLAlign program I guess executing STAR etc.

edit reference file link

score 2 · Answer 2 · 2023-05-02

Yes, BD Rhapsody can be run locally. In fact it uses a docker so the setup isn't really all that bad: https://www.bdbiosciences.com/content/dam/bdb/marketing-documents/BD_Single_Cell_Multiomics_Analysis_Setup_User_Guide.pdf

If you are deploying somewhere without docker, you will run into trouble as one particular step of the pipeline calls docker from within docker, so no amount of reverse-engineering the environment can get around it.

It should be pointed out that the FAQ you link does not say the software must be used via the seven bridges platform; only that it cannot be downloaded from seven bridges itself:

You cannot download the software from Seven Bridges. Please follow the BD Single-Cell Multiomics Analysis Setup User Guide on installing the pipeline for local use.