checking overlap between two genomic coordinates
3
0
Entering edit mode
3.7 years ago
kanwarjag ★ 1.2k

I have two different Chipseq datasets files with peak coordinates. and other information like gene name etc. I want check the overlap of coordinates in two files. Is there any online tool that can help me in checking the overlap between two files. Thanks

Kanwar

ChIP-Seq • 2.2k views
ADD COMMENT
0
Entering edit mode

In my opinion the most clean way to do this is using bedtools intersect. This is the most standard and "old school" way, and can be easily integrated into pipelines with only one easy to install dependency (bedtools)

ADD REPLY
0
Entering edit mode

Hi, an answer with bedtools intersect already exists. If you have more to add, please add it as a comment to that answer so people going down that path can find all relevant information in the same thread.

ADD REPLY
0
Entering edit mode

If you have gene annotation in gff/gtf format and reference genome fasta with you, then it is very easy to annotate peaks using annotatePeaks.pl, a perl script which is a part of HOMER package.

Command goes like this:

perl annotatePeaks.pl sample_peak.bed Reference_Genome.fasta -gft Reference_genome_annotation.gft   >homer_annotation.txt
ADD REPLY
0
Entering edit mode

Original question is not about annotating peaks. It is for finding overlap between two sets.

ADD REPLY
1
Entering edit mode
3.7 years ago
Ram 43k

findOverlaps from GenomicRanges might be of use. See: https://stackoverflow.com/questions/19101849/overlapping-genomic-ranges

ADD COMMENT
1
Entering edit mode
3.7 years ago

You can also try bedtools intersect. See here for more details

ADD COMMENT
0
Entering edit mode
3.7 years ago
Shalu Jhanwar ▴ 520

Have a look at BedSect and Galaxy online platforms.

ADD COMMENT

Login before adding your answer.

Traffic: 1907 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6