Blacklisted regions for mm10
4
0
Entering edit mode
7.0 years ago
rbronste ▴ 420

Hi,

Trying to find out where I can get the most recent file of mm10 blacklisted regions? Thank you.

Rob.

ChIP-Seq blacklisted regions • 6.8k views
ADD COMMENT
0
Entering edit mode

What is the definition of blacklisted regions?

ADD REPLY
0
Entering edit mode

"regions in the human genome that have anomalous, unstructured, high signal/read counts in next gen sequencing experiments independent of cell line and type of experiment."

I have this info for mm9 which I guess I can liftover but was wondering if there were any updated BED files.

Thanks

ADD REPLY
0
Entering edit mode

Just curious: where you got this info from (url)? Lift over is a good idea indeed (and may be the only possibility!)

ADD REPLY
0
Entering edit mode

Lifting over blacklisted regions generally doesn't work, since it's typically the case that those regions have been resolved in subsequent releases.

ADD REPLY
0
Entering edit mode

Hi Devon, as I understand, the blacklisted regions refers to NextGen experiments, not to the Genome assembly per se. A new genome assembly might resolve the regions not assembled earlier. But these regions are not always the same regions which will have anomalous read counts in Next-Gen experiments. These two seem to be different things to me.

ADD REPLY
0
Entering edit mode

They're often one and the same. The regions tend to overlap assembly issues. Yes, this won't always be the case, but this is much of the reason for the difference in blacklisted regions between GRCh38 and GRCh37.

ADD REPLY
0
Entering edit mode

I'm getting more confused, sorry :) If the regions are not in the genome assembly in the very first place, how can NGS reads map there? For me, it seems like the blacklisted regions are mostly repeats, which are present in the assembly, but it is difficult to map NGS reads on them. In this sense, they are "resolved" in assembly, but difficult to map NGS reads on them.

ADD REPLY
0
Entering edit mode

Often the copy number in the assembly has been fixed. For repeats, this makes the resulting alignments actually repetitive in the newer assembly so there's no reason to blacklist since you no longer get aberrant peaks.

ADD REPLY
1
Entering edit mode
7.0 years ago
James Ashmore ★ 3.4k

Select mm10 in the drop-down box

https://www.encodeproject.org/annotations/ENCSR636HFF/

ADD COMMENT
1
Entering edit mode
7.0 years ago
BioinfGuru ★ 1.7k

The ATAC-seq authors recently created a mitochondrial blacklist (found here) for use on ATAC-seq data which represents high signal regions on the nuclear genome caused by read sequence homology with the mitochondrial genome. A signal artifact blackist has also been created by ENCODE (found here).

In the command line:

for i in *.bedfile; do bedtools intersect -v -a $i -b [PATH]/mitochondrial.blacklist.bed [PATH]/signal.artifact.blacklist.bed > $i.bed; done

Be careful to not create an infinite loop with this command (all the files may end in .bed)

ADD COMMENT
1
Entering edit mode
7.0 years ago

Probably you are looking for this

https://sites.google.com/site/anshulkundaje/projects/blacklists

ADD COMMENT
0
Entering edit mode
3.6 years ago
ATpoint 82k

Current NGS blacklists based on the official paper:

Official Nature Paper (Amamiya, Kundaje, Boyle et al 2019)

https://www.nature.com/articles/s41598-019-45839-z

Download from GitHub (link from the paper)

https://github.com/Boyle-Lab/Blacklist/archive/v2.0.zip

ADD COMMENT

Login before adding your answer.

Traffic: 2178 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6