Question: Picard remove duplicate leads to some unpaired reads
gravatar for billzt
5.1 years ago by
billzt20 wrote:

Hi to all. I have a Illuminna pair-end reads mapping result in bam format. I use picard to remove PCR duplicate reads:

java -jar picard.jar MarkDuplicates INPUT=a.sortpos.bam OUTPUT=a.rmdup.bam METRICS_FILE=a.rmdup.log REMOVE_DUPLICATES=true MAX_FILE_HANDLES_FOR_READ_ENDS_MAP=1000

Then I use bamToFastq (from bedtools) to extract the reads from the output file "a.rmdup.bam".  However bamToFastq warns a lot of unpaired reads. My raw input reads were all exactly paired. Therefore it must be picard's removing duplicate reads leading to unpaired, that it, it might only remove one read but left its mate.

How to deal with this problem?


picard next-gen duplicate • 2.3k views
ADD COMMENTlink written 5.1 years ago by billzt20

What exactly is the problem you want to solve? Are you wanting Picard's Remove Duplicates tool to remove both the duplicate read and its mate?

ADD REPLYlink written 5.1 years ago by Dan D7.1k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 872 users visited in the last hour