Hey Guys
I am wondering if there is a slick way access all the possible alignments for a read present in sam or bam file given the read header. Since the existing codebase is in perl I would prefer something which can be done in/via perl.
By default BAM's are indexed by location so the inbuilt samtools indexing wont work I guess.
I should also say the input bam file will have in the order of 500 million total alignments and many reads are expected to be aligned to more than one place in the genome. Given the size of the data loading it all in one big hash is not turning out to be memory friendly.
Thanks!
-Abhi
Actually, you don't need the Kent source tree (that would be needed for
Bio::DB::BigFile
), but you do need a recent version of samtoolsThanks, Chris. That was my error in memory. It looks like
Bio::DB::BigFile
is for BigWig and BigBed anyway, so at leastBio::DB::Sam
is appropriate for the original question.@SES @Chris : Thanks guys .. on my weekend test list
Actually, you don't need the Kent source tree (that would be Bio::DB::BigFile), but you do need a recent version of samtools.