Question: Cloud Computer Cluster VS Local Compute Cluster for RNAseq analysis
gravatar for Chen Mor
2.8 years ago by
Chen Mor0
Chen Mor0 wrote:

Hey everyone!

What are you thoughts about using a local cluster VS a cloud based one for doing RNASeq analysis? Any pros and cons you can share from you own experiences?

Best, Chen

rna-seq compute cloud cluster • 1.1k views
ADD COMMENTlink modified 2.8 years ago by genomax83k • written 2.8 years ago by Chen Mor0

Can't think of anything significant which would make one better than the other. Accessibility usually gives the edge to cloud based set ups (not everyone has the luxury of a private server or cluster). If you use cloud based VMs, you have the bonus of the server being 'all yours' for a while, so you can abuse it somewhat. Really depends what you need/already have.

ADD REPLYlink written 2.8 years ago by Joe16k
gravatar for Jean-Karim Heriche
2.8 years ago by
EMBL Heidelberg, Germany
Jean-Karim Heriche22k wrote:

Cost-wise choice would depend on how you get charged for using your local cluster and associated storage. For one-offs and short-term projects a cloud-based solution may be cheaper but for regular use in the long run, a local cluster tends to be cheaper (especially when taking into account mistakes, bugs ...). Cloud-based solutions may have a cost in terms of data transfer and upload/download of large amounts of data can be significantly slow (and may only be possible by using something like Amazon's snowball or Amazon's snowmobile).
The main advantage of cloud-based storage would be for sharing data with people outside your institute.

ADD COMMENTlink written 2.8 years ago by Jean-Karim Heriche22k
gravatar for h.mon
2.8 years ago by
h.mon29k wrote:

What kind of analyses do you need to run? For differential gene or transcript expression, the latest methods (such as Salmon or Kallisto) are so fast and light on resources that a regular laptop can perform them quickly, making cluster and cloud resources unnecessary. The constraint is the size of fastq files - do they fit on your disk or not?.

See some discussions and examples here, here, here and here.

ADD COMMENTlink written 2.8 years ago by h.mon29k
gravatar for genomax
2.8 years ago by
United States
genomax83k wrote:

Take into account local security policies at your institution/company. If that policy does not allow you to use external/cloud based resources then your choice would be limited to using local resources. If you work with human data (or data subject to privacy restrictions) that will add another layer of complexity and may require you to have specific agreements with the providers (e.g. if you use Amazon cloud then you may have to ask them to keep your data in a certain geographical jurisdiction).

That said, if you needed to get ~5000 samples analyzed in a week there is simply no substitute for using a cloud based provider like google compute/amazon AWS. Cost would be (relatively) inexpensive (when considering time/infrastructure) when you can dial up thousands of cores on demand.

ADD COMMENTlink modified 2.8 years ago • written 2.8 years ago by genomax83k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2108 users visited in the last hour