Question on removing duplicates with ATAC-Seq and ChIP-Seq
0
1
Entering edit mode
12 months ago
tamu.anand ▴ 30

Hi

I am new to ATAC-Seq and ChIP-Seq and I have a question on removing duplicates when it comes to paired-end ATAC-Seq and paired-end ChIP-Seq pipelines.

I have seen some papers where clumpify is used in the first step to remove duplicates followed by bwa-mem and passing the bam files (after shifting for ATAC-Seq) to MACS2 with no usage of Picard MarkDuplicates.

https://www.cell.com/cell-genomics/pdf/S2666-979X(23)00019-8.pdf
Single nucleus multiomics identifies ZEB1 and MAFB as candidate regulators of Alzheimer’s disease-specific cis-regulatory elements
ChIP-seq analysis
Prior to analysis, reads were processed to remove optical duplicates with clumpify (BBMap v38.20; https://sourceforge.net/projects/
bbmap/) [dedupe = t optical = t dupedist = 2500]

I have seen this post - Did you remove ChIP-seq duplicates - where Picard MarkDuplicates is used

Hence just curious - is there benefits with one approach or the other (Approach 1 - using clumpify as step 1 and no MarkDuplicates; Approach 2 - use aligned bam files and then do MarkDuplicates)

If using clumpify, is the above correct - dedupe=t optical=t dupedist=2500

I would like to seek your advice and guidance on the above for both ATAC-Seq and ChiP-Seq.

Thanks in advance.

clumpify chip-seq atac-seq bbmap • 823 views
ADD COMMENT
1
Entering edit mode

Using clumpify.sh will allow you to do duplicate removal in an alignment independent manner. It will also allow you to remove just optical (really clustering) duplicates, which is what the paper you linked above seems to be doing. It will also make the size of the input files smaller thus reducing time required for alignment.

Keep this statement from @Ian sudbery's answer linked above in mind

If you are using MACS for your peak-calling, you'll want to mark duplicates rather than remove them.

ADD REPLY
0
Entering edit mode

Thanks GenoMax for your answer.

ADD REPLY

Login before adding your answer.

Traffic: 3587 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6