Hello, I am working with Alzheimer's single nucleus rna-seq data. The data from post-mortem human brain tissue. Clearly Alzheimer's pathophysiology affects cell collection quality.
I was given FASTQ files which I processed with the standard 10X cell ranger pipeline. I ended up getting raw bc matrices and filtered bc matrices. I am not getting a strong neuronal signal when using the filtered bc matrices. Should I be using the raw bc matrices and then adding my own filters for better neuronal identification during clustering?
I'm not sure why I am not seeing any real neuronal signal. I am seeing a mixed array of immune cells but not neuron signatures and definitely not layer specific markers in my highly variable genes.
Is it appropriate to start with raw bc matrices, I suspect cell ranger might be getting rid of neurons in its filtering process?
Thanks,
Cellranger is usually quite harsh on cell selection and easily discard non optimal cells, but nuclei should be more resilient to bursting. How many nuclei are present in your raw compare to your filtered ?
Not finding neuronal genes expressed in a brain tissue is quite disturbing. Do you see any expression plotting neuronal gene markers on your UMAP/tSNE ?
Do you have the whole brain sequenced or a specific area ?
Its on a sample by sample basis, but raw has ~2 million, but cell ranger filtered h5 files have ~7k cells. None of key neuronal markers make it past HVG (top 5000), since they are not variable. None of clusters have a clear neuronal signal. The brain region of interest is the MTG.
The filters i use are min_genes=100 and min_counts=900 for sc.pp.filter_cells in scanpy, mt_thresh=5, ribo_thresh=25, hb_thresh=1
Should I change them?