Question

Mapping Quality Scale Filtering

0

Entering edit mode

10.6 years ago

rob234king ▴ 610

I have a highly repetitive, 3x versions of chromosome containing reference with a great deal of similarity to map to.

I've mapped reads using bowtie2 but want to exclude those of reads that could have mapped to two or more locations so were not identified a unique position. I was thinking best way was to filter SNP calling i.e. samtools mpileup based upon mapping score as the better the mapping the higher the score? But I don't know what value of mapping score (-q) to start filtering from because I don't know the scale or any experience where to start from. Any advise helpful.

Thanks

mapping • 4.7k views

ADD COMMENT • link updated 10.6 years ago by Ashutosh Pandey 12k • written 10.6 years ago by rob234king ▴ 610

score 4 · Answer 1 · 2013-10-04

4

Entering edit mode

10.6 years ago

Ashutosh Pandey 12k

When you used bowtie2 you should have gone for the stringent mode. In case of an aligner like BWA, if you have a mpileup then I would suggest to go with a mapping score of 20 that will correspond to 1 in 100 false positives. But Bowtie2 doesnt produces smooth mapping quality scores and the only possible values are

255 = unique mapping

3 = maps to 2 locations in the target

2 = maps to 3 locations

1 = maps to 4-9 locations

0 = maps to 10 or more locations.

I would suggest you to just go with 255 or uniquely mapped reads. I am not sure how much data you would loose like this but you can try. At least it will give you very less false positives.

ADD COMMENT • link 10.6 years ago by Ashutosh Pandey 12k

0

Entering edit mode

You're thinking of tophat2, bowtie2 produces smooth MAPQs.

ADD REPLY • link 10.6 years ago by Devon Ryan 104k

0

Entering edit mode

Yup you are right. My bad. In that case going with a MAPQ of 20 will be better. Though this is not the best approach but it should be fine.

ADD REPLY • link 10.6 years ago by Ashutosh Pandey 12k

0

Entering edit mode

So 20 for Bowtie2 and BWA?

ADD REPLY • link 10.6 years ago by rob234king ▴ 610

1

Entering edit mode

I will go with 20 for both.

ADD REPLY • link 10.6 years ago by Ashutosh Pandey 12k