Question: How Does Bwa Report Multi-Mapped Reads
0
gravatar for bbio
5.2 years ago by
bbio80
United Kingdom
bbio80 wrote:

I am mapping some RNA-seq data with bwa and would like to do some analysis on where multi-mapped reads fall.

I know that I can extract multi-mapped reads by looking for mapq < 23 and/or the XA flag on the reads. However, I am wondering how bwa decides which location to report for a read that can be mapped to two different locations equally well. Does it choose a random one? Does it always report the first one? Something else?

Does anybody know what exactly bwa does here?

mapping bwa • 3.9k views
ADD COMMENTlink modified 5.2 years ago by Istvan Albert ♦♦ 80k • written 5.2 years ago by bbio80
3
gravatar for Istvan Albert
5.2 years ago by
Istvan Albert ♦♦ 80k
University Park, USA
Istvan Albert ♦♦ 80k wrote:

A random location is selected.

Also see this post, although not specifically bwa related but it goes to show that things can go wrong:

ATTENTION: bowtie2 and multiple hits

ADD COMMENTlink modified 5.2 years ago • written 5.2 years ago by Istvan Albert ♦♦ 80k

Do you have any idea if the MAPQ will be 0 in case of multiple mapping or something else? (I read two opinions about that)

ADD REPLYlink written 21 months ago by Medhat8.2k

The MAPQ=0 is a convention that bwa uses and not a standard.

And even considering it a convention it is not quite right. Having a multi-mapped read does not mean that the chance of the alignment being correct is zero.

The best way to detect multimapping is the check the SAM tag for alternative mappings.

ADD REPLYlink written 21 months ago by Istvan Albert ♦♦ 80k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1873 users visited in the last hour