Question

Integration Seurat different healthy samples (fresh vs frozen)

2

Entering edit mode

2.0 years ago

Tolga ▴ 30

Hi Bio-community,

I am investigating a single cell dataset using the seurat workflow. In total I have 8 different samples, each from a different patient. 7 of them were frozen samples and S8 in the umap plot is the only fresh sample. So we can see a clear batch effect between S8 and all the other samples. How can I continue here? I have several questions:

Should I integrate only between fresh vs frozen?
Should I integrate between fresh vs frozen and that the samples are originated from different donors/patients?
How can I adress 1 and 2?
Should i better investigate fresh and frozen separately?

Best, Tolga

before integration:

before integration

Integration Seurat • 2.4k views

ADD COMMENT • link 2.0 years ago by Tolga ▴ 30

score 3 · Answer 1 · 2023-10-28

Should I integrate only between fresh vs frozen?

If you think fresh and frozen as confounded factors then data integration based on fresh and frozen will attempt to remove differences between them and therefore will capture differences between the samples.

Should I integrate between fresh vs frozen and that the samples are originated from different donors/patients?

The best approach will be to define each sample as a batch which generally produces the strongest batch correction. But, you can also add fresh vs frozen as batch too besides each sample as batch. I would evaluate them separately and make good judgement that makes biological sense. So, for example if you are using Harmony, you can do something like this-

harmonized_SO <- RunHarmony(SO, 
                group.by.vars = "Sample", 
                reduction = "pca", assay.use = "SCT", reduction.save = "harmony")

OR

harmonized_SO <- RunHarmony(SO, 
                group.by.vars = c("Sample", "fresh_vs_frozen"), 
                reduction = "pca", assay.use = "SCT", reduction.save = "harmony")

Should i better investigate fresh and frozen separately? You can follow this approach too. Once again it depends upon what you want to achieve. I would recommend you to go through Data Integration approach that has been discussed very well in Single-cell best practices.

score 0 · Answer 2 · 2023-10-28

0

Entering edit mode

2.0 years ago

Ming Tommy Tang ★ 4.7k

it depends on what you want to do. They are all healthy samples and if you expect all the cell types should be similar you can use Harmony or others to do batch correction. I recommend you reading https://bioconductor.org/books/3.17/OSCA.multisample/differential-abundance.html#comments-on-interpretation