Question: The fastest way to download a list of SRR accessions from Sequence Read Archive with sratoolkit
0
gravatar for Denis
3 days ago by
Denis140
Denis140 wrote:

I've installed sratoolkit.2.10.0 at my home on a cluster. I have to download a numerous SRR accessions to my home directory. There are a several options, as i can understand:

  1. fastq-dump
  2. run prefetchutility, then convert resulted sra files to fastq by fastq-dump
  3. fasterq-dump (able to use multi-threading, but if i'm correct can not employ list of SRR accessions as input)

Which option is the fastest? Could you please provide a command line example which will be suitable for my purposes?

I've found very useful a post here: download from SRA However it seems too old.

sequence genome • 92 views
ADD COMMENTlink modified 3 days ago • written 3 days ago by Denis140

Your best bet is to use this: Fast download of FASTQ files from the European Nucleotide Archive (ENA) instead.

ADD REPLYlink written 3 days ago by genomax76k

This link covers the (in my opinion) two fastest options. The first is to download directly in fastq format from ENA, and the second is prefetch followed by parallel-fastq-dump. See the thread for details including code examples. Don't use any of the "dump" commands to download data directly, too slow and too unstable in my experience.

ADD REPLYlink modified 3 days ago • written 3 days ago by ATpoint28k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1533 users visited in the last hour