Question

extract the number of reads around TSS

0

Entering edit mode

7.9 years ago

Ben ▴ 60

I have coded oen python script to extract the amount of reads around TSS from ChIP-seq sam file, but the speed is too slow. Does anyone have any suggestions to solve this question? Thanks in advance!

ChIP-Seq • 2.3k views

ADD COMMENT • link updated 7.9 years ago by Alex Reynolds 36k • written 7.9 years ago by Ben ▴ 60

score 3 · Answer 1 · 2017-08-11

Via BEDOPS:

$ bedmap --echo --count TSSs.bed <(sam2bed < reads.sam) > answer.bed

If you want to pad the TSSs, say by 1k bases on both sides, add the --range N option:

$ bedmap --echo --count --range 1000 TSSs.bed <(sam2bed < reads.sam) > answer.bed

This will likely increase the number of overlaps between reads and (padded) TSSs.

score 1 · Answer 2 · 2017-08-11

1

Entering edit mode

7.9 years ago

h.mon 35k

bam-readcount.

ADD COMMENT • link 7.9 years ago by h.mon 35k