SRA to fastq conversion, 2Gb to 11Gb
1
0
Entering edit mode
7.9 years ago
bxia ▴ 180

Just begin to learn, on youtube, they said after fastq-dump, the file become smaller.

But I am undergoing the other way..my SRA file only 2Gb, after fastq-dump, it become 11Gb...

RNA-Seq ChIP-Seq • 3.6k views
ADD COMMENT
2
Entering edit mode

SRA is a compressed file format so if your output fastq file became larger that is not unexpected. As long as the format looks fine don't worry about the file size.
As a sanity check you can search with that SRA accession number at ENA to confirm. ENA provides direct links to fastq files (without having to use sratoolkit).

ADD REPLY
3
Entering edit mode
7.9 years ago
piet ★ 1.8k

But I am undergoing the other way..my SRA file only 2Gb, after fastq-dump, it become 11Gb

FASTQ files are pure ASCII and very large. Thus most people use gzip to shrink their size, typically down to about 20 %. You may want to use the option ' --gzip' with fastq-dump in order to generate gzipped FASTQ.

Moreover, you may also use option '-F' with fastq-dump to shrink the size of the description line.

Finally, fastq-dump produces an extraordinarily verbose variant of FASTQ with the description line doubled for each read. You may want to remove the doubled description line by filtering your FASTQ with a tool like seqtk.

ADD COMMENT

Login before adding your answer.

Traffic: 1987 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6