Our routine QC procedures include using fastqc to estimate duplicate reads. Some recently added datasets caught my attention. We noticed a subset of these samples have wildly different estimated duplicated reads in each end. What could be the issue here?
A related post: High level of duplicate in one reads of paired-end data