Unable to validate biomarkers obtained from single-cell RNA-seq data
1
0
Entering edit mode
14 months ago
Gene_MMP8 ▴ 240

I have compiled a list of biomarkers related to chemoresistance in triple-negative breast cancer patients using single-cell RNA-seq data. I am now performing validation on independent patient cohorts with microarray measurements. Some of the top genes identified by my method include a lot of single nuclear RNA (snRNA) genes such as SNAR-A10, SNAR-A11, SNAR-A7, SNAR-B8, etc. I cannot find these genes in the microarray dataset and am stuck at the validation step. My questions are as follows:

  1. Should I remove these RNA genes upfront and continue with the analysis? That way, these won't show up as biomarkers.
  2. Is there any way to map these genes to the microarray data (GPL96 HG-U133A Affymetrix Human Genome U133A Array)?

Any help is highly appreciated.

biomarkers single-cell Seurat • 641 views
ADD COMMENT
0
Entering edit mode
14 months ago
ATpoint 82k

These short RNA species (snoRNA, miRNA etc) are usually not covered in arrays or RNA-seq since the array or library preparation as well as initial RNA extraction do not capture them well. I am not sure whether scRNA-seq should capture them or whether this is rather noise, check the literature. As for your problem, you need dedicated smallRNA-seq datasets or arrays for this analysis.

ADD COMMENT
0
Entering edit mode

Thank you for your comment. I will look through the literature and try to find dedicated arrays. Would you suggest removing these genes from the single-cell data? I am thinking in lines of preprocessing.

ADD REPLY
0
Entering edit mode

I would consider that indeed. Maybe remove all small RNA types or retain only biotypes such as lncRNAs and protein-coding genes. This can be fone directly be filtering the count matrix.

ADD REPLY

Login before adding your answer.

Traffic: 2447 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6