Hi,
I’m fairly new to ATAC-seq and have successfully run MACS2 separately for each of my samples. I now have individual *.narrowPeak files as output.
My experimental design looks like this:
Sample_ID Cell_type Condition Donor
Sample_1 T_cells Tumor Donor_1
Sample_2 T_cells Normal Donor_1
Sample_3 T_cells Tumor Donor_2
Sample_4 T_cells Normal Donor_2
...
Sample_11 Dendritics Tumor Donor_10
Sample_12 Dendritics Normal Donor_10
Sample_13 Dendritics Tumor Donor_11
Sample_14 Dendritics Normal Donor_11
As you can see, I have two cell types (T_cells and Dendritics), and for each donor, I have paired Tumor and Normal samples.
My goal is to perform a differential accessibility analysis (Tumor vs Normal), accounting for both Donor and Cell_type. I’m also interested in comparing Tumor (T_cells) vs Tumor (Dendritics).
I heard that it is possible to use DESeq2 for ATAC-seq data, so my design will look like this: https://bioconductor.org/packages/3.21/bioc/vignettes/DESeq2/inst/doc/DESeq2.html#group-specific-condition-effects-individuals-nested-within-groups
I have a few questions:
1) Can I use the exact same code from the vignette, as typically done for RNA-seq data? Or are there any parameters or steps specific to ATAC-seq that I should consider?
2) I’m struggling with how to convert my individual *.narrowPeak files into a count matrix. Do you have any recommendations or tools to help with this step?
3) Are there alternative methods to DESeq2 that would be better suited for this kind of analysis? I guess limma should work the same no ?
Thank you in advance for your help !