Question

How to start a website with jbrowse to show sequencing data?

1

Entering edit mode

5.6 years ago

I0110 ▴ 140

Hi,

I have some RNA-sequencing data and transcriptome assembly for a non-model species. I would like to start a jbrowse to share the information with people interested in this species. I assume that I would need a website with enough space and computation power, but I have no idea how to start. Should I build my own web server, or is there a good commercial resource to do so? Thanks!!!

jbrowse • 1.4k views

ADD COMMENT • link updated 5.5 years ago by Biostar 20 • written 5.6 years ago by I0110 ▴ 140

1

Entering edit mode

You would find information about how to set up jbrowse at this link. If you are not IT savvy then you would need appropriate help to get this set up.

ADD REPLY • link 5.6 years ago by GenoMax 141k

score 5 · Accepted Answer · 2018-10-01

Is Jbrowse the right tool?

I have some RNA-sequencing data and transcriptome assembly for a non-model species.

Jbrowse is a genome browser. A genome consists of relatively fewer 'scaffolds/contigs/chromosomes' containing multiple structures (gene-models) and onto which other genomic feature tracks (e.g. quantitative) can be mapped.

A de-novo transcriptome consists of relatively many (possibly 100,000 + ) and shorter contigs, with little additional structure or names to display. Even if there are few structures, a genome browser like Jbrowse handles that many contigs poorly, creates a relatively high burden for the database back-end, and it will give little benefit for the user, because they will have a hard time to find the right 'gene/transcript', one will mostly need tblast(x/n) to find anything sensible. That is why many assemblies simply exist in large repositories like the Trace/SRA and sadly some published assemblies are not even provided or made available upon request only.

Maybe a way to address sharing of your data is simply to

Upload your raw data to either SRA or ENA (this is the most crucial step, doing a re-assembly of the transcriptome has become relatively quick and straight-forward nowadays, but if a commercial tool was used this could be blocker to exactly reproduce your results).
Submit your assembly to TSA: https://www.ncbi.nlm.nih.gov/genbank/tsaguide/
Provide a FASTA file as supplementary data with your paper (if you can't do 2)

That way, you ensure availability of your assembly data even after the event that you need to discontinue your service in the future.

There are many ways to get a sequence database web-site up and running.

Possibly one of the easiest ways to get a running Jbrowse up on your local computer is using Docker and pulling one of the Jbrowse images. The most up-to-date I found just now is the Bioconda Jbrowse repository https://quay.io/repository/biocontainers/jbrowse?tab=info That should also be the fastest way to get an instance up including a web server and back-end (I think this should run more or less out of the box) and it might be even doable for beginners in the field, however you need to be willing to learn the basics of the operating system and web-servers, docker, jbrowse and their configuration.

Another option could be GMOD in the Cloud but the software in this image is very outdated according to the docs (state of ~2013).

The essential decision is then where to deploy:

deploy on local computer
deploy 'in the cloud'

Here is an overview of some commercial cloud/docker hosters, some provide limited 'free' resources. https://blog.codeship.com/the-shortlist-of-docker-hosting/

I recommend to start on a local Linux machine, any (half-)recent computer will do, to get some experience and practice without a pricing to consider.