bwa mem hard clipping
2
2
Entering edit mode
8.1 years ago
ARB ▴ 120

Hi all,

I apologize if this question has been posted or answered earlier. But I did not seem to find a reasonable explanation and hence positing it here. Can anybody please help me understand how bwa mem differentiates whether a read is hard clipped or soft clipped. I mean what is the criteria to differentiate between these two by bwa mem?

Thanks in advance.

bwa-mem • 8.8k views
ADD COMMENT
4
Entering edit mode
8.1 years ago
piet ★ 1.8k

When I started NGS data processing, it took me quite a while to understand this. 'bwa mem' is a local aligner. Soft clipped alignments have the full sequence of the read reported in column SEQ, while hard clipped alignments only have the part which actually aligns to the ref sequence. This is kind of a performance optimization, but with some unpleasant side effects.

Usually the first alignment of a read is soft clipped (dumped into SAM file with its full sequence), while any further alignments are hard clipped. Often you will reorder the SAM file with 'samtools sort'. After this operation it is no longer warrantied that the first occurence of a read has the full sequence.

ADD COMMENT
2
Entering edit mode

Hard-clipping does indeed cause unexpected unpleasantness. I consider it bad practice (a premature optimization, essentially).

ADD REPLY
0
Entering edit mode

Can you disable hard clipping by bwa altogether ?

ADD REPLY
0
Entering edit mode

Have you tried bwa -Y?

ADD REPLY
1
Entering edit mode
ADD COMMENT

Login before adding your answer.

Traffic: 2664 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6