Picard remove duplicate leads to some unpaired reads
0
0
Entering edit mode
7.2 years ago
billzt ▴ 20

Hi to all. I have a Illuminna pair-end reads mapping result in bam format. I use picard to remove PCR duplicate reads:

java \
  -jar picard.jar \
  MarkDuplicates \
  INPUT=a.sortpos.bam \
  OUTPUT=a.rmdup.bam \
  METRICS_FILE=a.rmdup.log \
  REMOVE_DUPLICATES=true \
  MAX_FILE_HANDLES_FOR_READ_ENDS_MAP=1000

Then I use bamToFastq (from bedtools) to extract the reads from the output file "a.rmdup.bam". However bamToFastq warns a lot of unpaired reads. My raw input reads were all exactly paired. Therefore it must be picard's removing duplicate reads leading to unpaired, that it, it might only remove one read but left its mate.

How to deal with this problem?

next-gen duplicate picard • 3.0k views
ADD COMMENT
1
Entering edit mode

What exactly is the problem you want to solve? Are you wanting Picard's Remove Duplicates tool to remove both the duplicate read and its mate?

ADD REPLY

Login before adding your answer.

Traffic: 1917 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6