Question: Storing Fastq As Unaligned Bam
gravatar for Abhi
7.2 years ago by
United States
Abhi1.5k wrote:

Hey Guys

Just wondering if anyone there is now storing the raw fastq data as unaligned bam files. We are reaching a stage where any space we could potentially save would be beneficial.

Any con of storing fastq as bam ? I see some discussion about this on seqanswers :

Also any tools that people already have that converts a fastq to bam and vice-a-versa. I know there are few which can do bam to fastq like picard but not sure if fastq to bam is there.

Thanks! -Abhi

fastq bam • 6.5k views
ADD COMMENTlink written 7.2 years ago by Abhi1.5k

This doesn't read like a stackexcahnge question to me - if you want a discussion why not continue on SEQanswers?

ADD REPLYlink written 7.2 years ago by Peter5.7k

This seems relevant to me, handling large NGS data files is an increasing bioinformatic issue.

ADD REPLYlink written 7.2 years ago by Ian5.3k
gravatar for Pierre Lindenbaum
7.2 years ago by
France/Nantes/Institut du Thorax - INSERM UMR1087
Pierre Lindenbaum116k wrote:

I think a bgzipped fastq file will be always smaller than a BAM file as the BAM file also contains the positions of the alignments.

See also:

Compression of genomic sequences in FASTQ format


Efficient storage of high throughput sequencing data using reference-based compression

ADD COMMENTlink written 7.2 years ago by Pierre Lindenbaum116k
gravatar for toni
7.2 years ago by
toni2.1k wrote:

Hi Abhi,

yes we do this in our team. You can use Picard 'FastqToSam' utility.

Compared to 2 fastq files (plain, not gzipped as suggested by Pierre), an unaligned BAM file allows to save 60°% to 65% of storage space. It is also practical because you can store some useful information (Sample, Library, Run, any useful comments ...) in the header if you want to.

ADD COMMENTlink modified 7.2 years ago • written 7.2 years ago by toni2.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1620 users visited in the last hour