Counting Number Of Bam Reads Directly Within Set Of Intervals With Bedtools
0
2
Entering edit mode
12.6 years ago
User 9996 ▴ 840

how can I count the number of BAM reads falling directly within a set of intervals, given in a GFF format? Note that I do not want reads overlapping the intervals, but ones that fall directly within them.

I tried the following:

intersectBed -abam reads.bam -b exons.gff -wb -f 1

this has redundancies, so I pipe it into coverageBed as follows:

intersectBed -abam reads.bam -b exons.gff -wb -f 1 | coverageBed -abam stdin -b exons.gff

Is this correct? Thanks.

bedtools bed bam gff rna • 5.1k views
ADD COMMENT
2
Entering edit mode

Yes, that is correct. I suggest using version 2.13.1 and use the -counts option in coverageBed. This will use less memory and run much more quickly than the default coverageBed.

ADD REPLY
1
Entering edit mode

Both BAM (not SAM, which is 1-based) and BED are zero-based, half open. BEDTools handles this automatically anyway.

ADD REPLY
0
Entering edit mode

is there a caveat about 0-based versus 1-based coordinates of SAM and BED format?

ADD REPLY
0
Entering edit mode

How can this be extended to get all mapped pairs that land in the boundaries of a certain intervals from BAM?

ADD REPLY

Login before adding your answer.

Traffic: 2415 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6