Large *.bam files difficult to open in UGENE
0
0
Entering edit mode
2.2 years ago
Askhat • 0

Hello everyone!

I have a .bam file and bai file.

.bam file's size is 13.8 Gb. I attempted to open it in UGENE in order to see what's inside (reference gene sequence, in this case it's CYP2D6, and reads mapped to it), but the loading starting from 50% (cause I created bai file via samtools index before that) only proceeded to 56% in 5 hours and eventually got stuck.

file opening progress bar stuck in UGENE

I have the following PC specs:

  • Processors: 8 × Intel® Core™ i7-4810MQ CPU @ 2.80GHz
  • Memory: 23,1 GiB of RAM
  • Graphics Processor: Mesa DRI Intel® HD Graphics 4600
  • Graphics Processor: Quadro K1100M

Honestly, I have no idea how to open this file and see the reads aligned. Is there any idea what to do with the *.bam file in order to make it smaller? Is there any way to "reduce the depth of the reads"? I assume this is why the file is so big.

What I mean by the depth of the reads is pretty much the coverage depth, or the number of reads going deeper (shown on the screenshot below)

general view of reads alignments in *.bam files in UGENE

visualization UGENE BAM • 779 views
ADD COMMENT
0
Entering edit mode

How about 'just' using IGV ?

ADD REPLY
0
Entering edit mode

Thank you for suggestion! I will try it out

ADD REPLY
0
Entering edit mode

Is there any idea what to do with the *.bam file in order to make it smaller?

samtools view -o subset.bam -O BAM indexed.bam "chrom1:234-567"

Is there any way to "reduce the depth of the reads"?

samtools view with option -s (Subsampling shorthand option)

ADD REPLY
0
Entering edit mode

Thank you very much, Pierre!

This is a very helpful info.

ADD REPLY

Login before adding your answer.

Traffic: 3146 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6