The purpose of removing duplicates is to mitigate the effects of PCR amplification bias. However, this step can lead to removing reads that were not a consequence of PCR amplification, so removing important info. Some works suggest that duplicate removal is not necessary because the impact of doing so is minimal when calling variants (see link).
What do you recommend? Do you know any paper or work which points benefits of doing duplicate removal?