Subscription between genomic regions
1
0
Entering edit mode
6.9 years ago
niutster ▴ 110

Hi,

I have two files contain genomic regions, each row of file contains chromosome name, start site and end site. Each row determines a genomic region, I am interested to count how many regions are shared between these files. I don't want to determine regions are completely the same. How can I identify them? Are there any tools?

genome subscription • 1.3k views
ADD COMMENT
1
Entering edit mode
6.9 years ago

You can use BEDOPS bedmap --count to count the number of overlaps between reference and map files:

$ bedmap --echo --count --delim '\t' fileA.bed fileB.bed > answer.bed

The default overlap criterion is a minimum of one base of overlap. You can adjust this to be more stringent, if need be. See bedmap --help or the online documentation for more detail.

ADD COMMENT
0
Entering edit mode

Thanks, does it install on windows?

ADD REPLY
0
Entering edit mode

If that is a requirement then:

  • You should mention that at the beginning of your post
  • You should realise that many tools are unavailable for you
ADD REPLY
0
Entering edit mode

It installs in Cygwin under Windows, or you can run Linux inside VirtualBox under Windows, and use this kit within a Linux environment. It does not run directly under Windows, however, and not many bioinformatics tools do.

ADD REPLY

Login before adding your answer.

Traffic: 3146 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6