Question: fastq-dump error for restricted data from dbGAP
0
gravatar for Rashedul Islam
13 months ago by
Canada
Rashedul Islam310 wrote:

I have got approval and downloaded sra files of restricted dataset from dbGAP. However, I am getting the following error while dumping to fastq. Here are the steps I used.

Step 01: download sra (successful)

prefetch --ascp-path  '/fullPath/ascp2.7/asperaweb_id_dsa.putty' --max-size 100G --ascp-options "-T -k 2 -l 400M" SRR3819129

Step 02: dump fastq files (error)

/fullPath/sratoolkit.2.8.2-1-ubuntu64/bin/fastq-dump --split-files --gzip /fullPath/phs000435/SRR3819129.sra

Error message:

2018-07-24T03:41:19 fastq-dump.2.8.2 err: item not found while constructing within virtual database module - the path '/fullPath/phs000435/SRR3819129.sra' cannot be opened as database or table

However fastq-dump works successfully for the public data SRR390728 using the same procedures.

fastq-dump -X 5 -Z SRR390728

Additionally, I tried using "vdb-config --restore-defaults" but did not work. It would be great if you can help me with that.


Thanks in advance!

sequencing next-gen • 664 views
ADD COMMENTlink modified 13 months ago by finswimmer12k • written 13 months ago by Rashedul Islam310

See if this thread is helpful.

ADD REPLYlink written 13 months ago by genomax70k
2
gravatar for Rashedul Islam
12 months ago by
Canada
Rashedul Islam310 wrote:

I finally figured out the issue after email communications with sra team. I only had the download permission and therefore I was not able to extract raw sequences. I got the NGC file with full permission from my PI. The instruction was "The dbGaP repository key is only available from the dbGaP account of primary PI of approved data access request. It is not available from downloader’s dbGaP account. Downloaders should get the key directly from the PI."

Although the error message I got did not mention that I dont have proper permission until I came to know from sra team. Here is the summary of how I downloaded dbGAP restricted data.

1) From project directory, configure sra-tool $vdb-config --import prj_xxxx.ngc ./.
2) Downloaded sra files using ASPERA.
3) Test sra file: $test-sra SRRxxxxxxx.sra
4) Dump fastq files: $fastq-dump --split-files --gzip ./sra/SRRxxxxxxx.sra

ADD COMMENTlink modified 12 months ago • written 12 months ago by Rashedul Islam310
0
gravatar for ATpoint
13 months ago by
ATpoint21k
Germany
ATpoint21k wrote:

You have to execute the fastq-dump command from within /fullPath/phs000435/, so the same directory that you specified in vdb-config as the directory to load the restricted data into.

ADD COMMENTlink modified 12 months ago • written 13 months ago by ATpoint21k

Thanks a lot for your quick reply. Still have the problem. Steps I used:
-goto the wd
$cd /fullPath/phs000435/
-configure vdb
$vdb-config --import /fullPath/prj_1xxx1_D20xxx.ngc /fullPath/phs000435/
-download sra
$prefetch --ascp-path '/fullPath/ascp2.7/asperaweb_id_dsa.putty' --max-size 100G --ascp-options "-T -k 2 -l 400M" SRRxxx2334 -dump
$/fullPath/sratoolkit.2.8.2-1-ubuntu64/bin/fastq-dump --split-files --gzip /fullPath/phs000435/sra/SRRxxx2334

I ran fastq-dump from the directory specified in vdb-config e.g., /fullPath/phs000435/

ADD REPLYlink modified 13 months ago • written 13 months ago by Rashedul Islam310

Hmm, I just tried to re-download a restricted file that we have access to and I successfully loaded and dumped last year, but it always prompts that access was denied. Maybe a problem with dbgap itself. In that case I would contact their support.

ADD REPLYlink written 13 months ago by ATpoint21k

Thanks a lot for your help. I also check other discussions on that issue. I emailed sra developer.

Update: 1. made ncbi root directory to my project directory. 2. imported config file to project directory 3. downloaded sra file to project directory 4. and getting same error in fastq-dump while executing it from the project directory.

When I ran test-sra SRR3652334.sra and test-sra from the project directory. In both case its showing total space is 0 and free space is negative, although I have 8TB of space in my project directory. Can you check test-sra from your directory?

<Space free="-442525664" total="0" units="KBytes"/>

ADD REPLYlink modified 12 months ago • written 13 months ago by Rashedul Islam310
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1432 users visited in the last hour