Harmonized RNA-Seq Analysis of Different GDC Queried Projects
1
0
Entering edit mode
23 months ago
dk0319 ▴ 70

Are there any vignettes or resources anyone can recommend for performing RNA-seq in which different projects from GDC are compared?

In my specific case I am comparing unmatched metastatic prostate samples to prostate samples from a primary tumor, using harmonized star-counts.

I have tried TMM/voom/limma and when I performed PCA the majority of samples cluster based on sample type (eg. Met or Primary). However, this could easily be due to the projects being (Project 1=499/500 samples are Primary and Project 2=99/99 samples are Mets)

Ideally, I am interested in advice on how to best handle this sort of analysis along with advice on how to remove potential bias due to the samples coming from different projects.

GDC Normalization DGE RNA-Seq • 602 views
ADD COMMENT
0
Entering edit mode
22 months ago
Zhenyu Zhang ★ 1.2k

Batch effect correction is time consuming, and requires a lot of knowledge and investigations. I assume you can check those PanCan or PCWAG papers for some examples. Or you can play lazy and use automatic batch effects analysis (e.g. Combat).

A good resource for TCGA RNA-Seq batch effects can be found here https://bioinformatics.mdanderson.org/public-software/tcga-batch-effects/. You can also adopt their algorithm.

ADD COMMENT

Login before adding your answer.

Traffic: 2642 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6