Entering edit mode
11 months ago
archanaverma433
▴
10
Hi,
I have single cell RNA seq data and cell ranger count output. How to calculate duplicated reads in each sample.
How this duplicated reads affect the quality or further analysis?
thank you for the reply.
https://kb.10xgenomics.com/hc/en-us/articles/115003646912 In this article they have mentioned this method, they have used "samtools flagstat"
...
In this duplicated reads is more than secondary.
i have also run same for my sample :
in my bam you can see duplicated reads is much more. Is this correct way to calculate and what does it means 19909293 reads are duplicated in 31596859 reads mapped?
Is this affect my downstream analysis?
Thank you!!!
Honestly, don't do these sorts of analysis. Single-cell data have tremendous duplication, and this is expected. That's why one uses UMIs, and that is all that you can do about it. Continue with downstream analysis, there is most likely no novel and interesting biology you are going to generate from counting duplicate reads.