Hi to all. I have a Illuminna pair-end reads mapping result in bam format. I use picard to remove PCR duplicate reads:
java \ -jar picard.jar \ MarkDuplicates \ INPUT=a.sortpos.bam \ OUTPUT=a.rmdup.bam \ METRICS_FILE=a.rmdup.log \ REMOVE_DUPLICATES=true \ MAX_FILE_HANDLES_FOR_READ_ENDS_MAP=1000
Then I use bamToFastq (from bedtools) to extract the reads from the output file "a.rmdup.bam". However bamToFastq warns a lot of unpaired reads. My raw input reads were all exactly paired. Therefore it must be picard's removing duplicate reads leading to unpaired, that it, it might only remove one read but left its mate.
How to deal with this problem?