Entering edit mode
16 months ago
Mo
▴
50
Hi,
I have mapped my reads from nanopore RNAseq to the reference database using the minimap. Minimap SAM files have MAPQ values from 0 to 60 in the fifth column. While 0 indicates the reads that can possibly map to many locations and have the lowest quality, 60 indicates the highest quality and unique reads.
I am losing a bit more reads at the MAPQ 60 and the reads are way shorter (up to 200-300bp, I want at least 1000bp). I was wondering what could be the minimum MAPQ values that I can use to retain the unique reads with better length and quality.
Thanks a lot.
Look at the distribution of MAPQ values and decide. See: https://github.com/lh3/minimap2/issues/447
It always depends on what you want to do with the data.
That is not too specific, do you mean genome, transcriptome, or something else?
Hi, I am sequencing RNA of the GFP library, so I am mapping to GFP sequences (database having 5'UTR-CDS-barcodes_3'UTR of all the variants).