Average depth/coverage before and after PCR duplicates cleanup
1
0
Entering edit mode
4.1 years ago

Hello everyone,

I am using a pipeline (Paleomix) which mark duplicates in BAM files using Picard MarkDuplicates, but without removing them from the outputs.

I wish to compare the coverage in these files both with and without PCR duplicates filtering.

Would you know any tool or command adapted to this task?

I have tried to use different samtools depth scripts, but they all give me different results.

Thanks for your help!

markduplicates samtools meandepth BAM • 2.0k views
ADD COMMENT
0
Entering edit mode

Try Mosdepth.

ADD REPLY
0
Entering edit mode

Thanks for the advice, Mosdepth looks a very interesting tool! Unfortunately, on the Github page I don't see the flag or any option for allowing/ignoring the PCR duplicates annotations in the BAM file. Would you remember the command used for it?

ADD REPLY
1
Entering edit mode
4.1 years ago
ATpoint 82k

-F --flag <FLAG> exclude reads with any of the bits in FLAG set [default: 1796]

The duplicate flag is 1024, see http://broadinstitute.github.io/picard/explain-flags.html. I second the advice for mosdepth (unfortunately only available on Linux) because it is wickedly fast.

Edit 10.5.20: I managed to build mosdepth from source on Mac, see comment at the bottom of this issue: https://github.com/brentp/mosdepth/issues/103

ADD COMMENT
0
Entering edit mode

That's very precious information as well ATpoint!

I did not make the relation between this flag and the annotations in the BAM file. Definitely going to use Mosdepth. Thanks!

ADD REPLY

Login before adding your answer.

Traffic: 2908 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6