Hi friends,
I'm handling some ATAC-seq data. I used FastQC to get a look at the FASTQs. The data (paired-end) are high quality and have no other relevant "issues", but in the "Per base sequence content" plot I see noticeable 5' base composition bias in both R1 and R2:
I detected Nextera adapter contamination and looked around to find that apparently the Nextera kit introduces some bias. And my plot does look similar to the one described here.
I'm guessing that this is an enrichment bias (similar to RNA-seq random hexamers) so nothing much can be done about it. But as I found little information I was hoping someone with more ATAC-seq (or Nextera) experience could clarify whether this is a normal issue and needs no preprocessing, or redirect me to more resources.
Thanks!
OK! thanks for the advice and I will take a look at the package!