Puzzle on size of Bam file
2
0
Entering edit mode
9.8 years ago

I found one wired thing about Bam file. The size of my original Bam file is about 13GB. I converted the bam file into sam file and extracted the reads with mapping score less than 10. I put these reads into a new mini sam file and converted this mini sam file into a mini Bam file. The size of the mini bam file is about 2.5GB.

However, I actually plotted the histogram of the distribution of mapping score in the original Bam file. The proportion of mapping scores which are less than 10 should be far less than 20%.

Could anyone tell me why the size of the mini Bam file is around 20% of the original one?

next-gen • 2.7k views
ADD COMMENT
0
Entering edit mode
9.8 years ago

you cannot easily predict what the size of a file will be after compression, so don't worry about it

ADD COMMENT
0
Entering edit mode
9.8 years ago

Like Istvan said - don't worry about the size, count the number of lines in your bam file to see the effect.

ADD COMMENT

Login before adding your answer.

Traffic: 2521 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6