ATAC-seq FastQC showing 5' sequence content bias
1
0
Entering edit mode
3.1 years ago
Papyrus ★ 2.9k

Hi friends,

I'm handling some ATAC-seq data. I used FastQC to get a look at the FASTQs. The data (paired-end) are high quality and have no other relevant "issues", but in the "Per base sequence content" plot I see noticeable 5' base composition bias in both R1 and R2:

Image

I detected Nextera adapter contamination and looked around to find that apparently the Nextera kit introduces some bias. And my plot does look similar to the one described here.

I'm guessing that this is an enrichment bias (similar to RNA-seq random hexamers) so nothing much can be done about it. But as I found little information I was hoping someone with more ATAC-seq (or Nextera) experience could clarify whether this is a normal issue and needs no preprocessing, or redirect me to more resources.

Thanks!

ATAC-seq fastqc • 1.9k views
ADD COMMENT
3
Entering edit mode
3.1 years ago
ATpoint 82k

My best guess is that this is the transposase-5 (Tn5) sequence bias as Nextera (and by this ATAC-seq) uses a modified Tn5 to cleave chromatin and insert sequencing adapters. I always see this pattern in my data as well. I think this is relatively well described if you google for Tn5 bias, and there are even attempts to correct for this, see for example the chromVAR (a Bioc package) paper.

ADD COMMENT
0
Entering edit mode

OK! thanks for the advice and I will take a look at the package!

ADD REPLY

Login before adding your answer.

Traffic: 2130 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6