Question

Normalisation on subset of genes

1

Entering edit mode

22 months ago

Mia ▴ 20

Hi everyone

My lab collected blood plasma cfRNA samples from breast cancer patients and non-cancer patients as controls. The PI designed a custom gene-chip to sequence 65 genes he predicts will be upregulated in cancer patients. He did this to save cost (cheaper than sequencing entire transcriptome) and to reduce noise (since his previous study showed that cfRNA data can be very noisy).

We now have the data and I'm meant to start analysing it, but I have no idea how to normalise it...

No housekeeping genes were sequenced, and many of the genes are expected to be differentially expressed. This makes TMM, RPKM, and other commonly used methods like DESEq2 inappropriate.

Any idea what I could do?

I thought perhaps to CLR or log transform it and then doing a Welch t-test between the two groups.

Thank you in advance for your feedback.

genechip RNA-Seq differential-expression • 861 views

ADD COMMENT • link updated 22 months ago by ATpoint 88k • written 22 months ago by Mia ▴ 20

1

Entering edit mode

So it's only 65 genes and all of these are expected to be DE? If so, most terrible possible design. I see no way to analyse this for DE since there is no reference. No matter which test you do, it all comes down to the same question, which is the baseline? You have none, so no analysis can be done. Absolutely terrible design...

RNA-seq these days costs 200$ per sample at commercial providers, I can hardly imagine PI saved costs here.

ADD REPLY • link 22 months ago by ATpoint 88k

score 2 · Answer 1 · 2023-09-05

2

Entering edit mode

22 months ago

Trivas ★ 1.9k

Look into some microarray DGE analysis. This is a handy slidedeck that could be a starting point https://www3.nd.edu/~steve/Rcourse/Lecture11v1.pdf

and a Plos Comp Bio paper https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2762517/

ADD COMMENT • link 22 months ago by Trivas ★ 1.9k