Hello everyone, I have an expression matrix log TPM+1 for 14,000 cells and 23,000 genes (GSE87544). In the paper (https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5782816/#SD9), the authors analysed 14,000 cells and reduced the data to 3000 cells and 2000 genes, before using Seurat for cell clustering.
I am new to single cell seq and in the learning process. I would appreciate help regarding the pre-processing of single-cell seq data (or finding gene/transcript distribution as in this case), since I could not find sources discussing the data pre-processing in detail.
Thank you very much!