How to get the no-overlapping exome region (.bed file)?
1
0
Entering edit mode
7.3 years ago
weiyuchung • 0

I want to get the hg38 whole exome regions (.bed) for the downstream analysis. However, some regions in the .bed file browsed from the UCSC table are overlapped. How to get the no-overlapping exome region .bed file?

My settings for the UCSC table browser as follows:

  • clade: mammal
  • genome: human
  • assembly: Dec.2013 (GRCh38/hg38)
  • group: Genes and Gens Predictions
  • track: GENCODE v24
  • table: knownGene
  • position: chr22:1-50818468
  • output format: BED

Click "Get Output", select “Exons plus 0 bases at each end”, and get the .bed file.

In this .bed file, the regions from row 29:15690025-15690709 and row 40:15690077-15690314 are overlapped. There are some other overlapped regions in this file. How to get the .bed file with no-overlapping exome regions? BEDtools?

Thank you!!!

exome exon bed UCSC hg38 • 3.1k views
ADD COMMENT
0
Entering edit mode
7.3 years ago

With BEDOPS bedops -n:

$ bedops -n 1 regions.bed regions.bed > non-overlapping-regions.bed

What comes out of UCSC is sorted correctly, usually, so I think you can skip presorting.

ADD COMMENT

Login before adding your answer.

Traffic: 3097 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6