sra to fastq
4
0
Entering edit mode
7.6 years ago
vimlakany • 0

How does SRA file looks and how is it converted into two fastq file in case of paired end?

RNA-Seq • 3.5k views
ADD COMMENT
0
Entering edit mode

ProTip: You can avoid using sratoolkit altogether. Search EBI-ENA with the SRA accession number to download fastq files directly (disclaimer: may not work for new SRA submissions, from within last day or two, but will eventually catch up).

ADD REPLY
0
Entering edit mode
7.6 years ago
GenoMax 141k

SRA file is binary (not human readable), but looks just like any other file in a listing :-).

Use this guide for SRAtoolkit from NCBI, which is what you would use for converting an sra file to fastq.

ADD COMMENT
0
Entering edit mode

can u tell how sra is converted into fastq, I mean the algorithm used in conversion? How does fastq-dump identifies and splits into two fastq files in case of paired-end and single fastq file in case of single-end?

ADD REPLY
0
Entering edit mode
7.6 years ago
Benn 8.3k

You don't need to download sra files, but you can use fastq-dump from the SRAtoolkit (as @genomax2 mentioned already).

If you have the toolkit installed, you just have to name the SRA file which you want to have. It will be downloaded as a fastq file.

e.g., in linux terminal:

~/sratoolkit/bin/fastq-dump SRR2393592
ADD COMMENT
0
Entering edit mode

Does SRR2393592_1.fastq represents reads from forward strand and SRR2393592_2.fastq represents reads from reverse strand? If not how are they splited from sra file. for example, If sra file size is 6.9GB, fastq file generated is 50.8GB how is it processing?

ADD REPLY
0
Entering edit mode

Yes they do. SRA files are binary and compressed. Think of this as similar to using tar or gzip to compress files.

If you are interested in software, the source code for SRA software/utilities is available on this page.

ADD REPLY
0
Entering edit mode

For a sra file of size 2.2GB, the fastq file generated using fastq dump was 10.8GB but fastq file of the same sample in EBI was only 2.7GB in size. why is it so?

ADD REPLY
0
Entering edit mode

What was the exact command used? Which SRA# are you looking at?

ADD REPLY
0
Entering edit mode

The command used to convert sra to fastq is fastq-dump --split-3 ERR738423.sra The above sra is single-end data. SRA file size is 2.2GB; using fastq-dump command fastq file obtained is 10.2GB; in ENA fastq file is 7GB. Why there is a huge difference in size?

ADD REPLY
0
Entering edit mode
7.6 years ago
chen ★ 2.5k

use fastq-dump --split-3 command from sra-toolkit

ADD COMMENT
0
Entering edit mode
7.6 years ago
ThePresident ▴ 180

For paired, use this: fastq-dump --split-3 SRR2393592

The easiest way to install SRA toolkit is from brew package. Follow this link to first install brew and once that's done, simply run brew install sratoolkit in terminal

ADD COMMENT
0
Entering edit mode

brew or apt-get or yum will install an old version of sra toolkit, suggest to download the source and compile it.

ADD REPLY

Login before adding your answer.

Traffic: 1974 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6