GC/AT base content crossed at the tail of read
0
0
Entering edit mode
2.5 years ago
tomas4482 ▴ 390

I used fastp and fastqc for quality control and trimming of some concatenated fq files. But I found something weird.

The GC/AT base content line at the tail crossed at around 90bp.

crossed tail.

At first I thought trimming the tail could solve this problem. But it fails. When trim the front 15bp and tail 10bp using fastp -f 15 -t 15 , the crossed base content line still crossed at around 60bp.

after trimming

This situation occurs only in a few concatenated fq. Others are fine. It seems concatenation is not the cause. Does anyone know what happens here?

Thanks.

DNA sequence WGS WES • 740 views
ADD COMMENT
0
Entering edit mode
ADD REPLY
0
Entering edit mode

I've read this document before. I don't think the front bias is problematic.

The real problems are: 1. I don't understand why and how the nt base content largely changed at the tail (but the ratio of G:C and A:T remains normal). 2. No matter how long I trim the tail (I've tried 10bp and 15bp), it does not remove the bias. This bias will move to a upstream position after filtering rather than disappear.

What does it mean? Do you have any idea?

ADD REPLY
0
Entering edit mode

What kind of data is this? Have you tried to align it? That pattern may be a result of library prep method.

ADD REPLY

Login before adding your answer.

Traffic: 3274 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6