Getting a Count Matrix from GEO for RNA-eq
1
0
Entering edit mode
7.9 years ago
ahnje770 ▴ 20

Hi,

Could anyone help me get some sort of a count matrix when processed RNA-seq data is extracted from GEO. I have some experience using microarray data from GEO.

For example, for GSE78220, the data has been processed by: "FASTQ files were mapped by Tophat2 Tophat BAMs were quantified and normalized using Cuffnorm (for gene analysis) and by htseq-count followed by edgeR's log CPM (for gene-set analysis) Genome_build: hg19 Supplementary_files_format_and_content: The normalized expression levels by cuffnorm"

After geting the GSE by:

g <- getGEO("GSE78220")

What would be the next step?

RNA-Seq • 5.3k views
ADD COMMENT
4
Entering edit mode
7.9 years ago
GZ1995 ▴ 410

I've been recently working on reanalyzing this data set. The data for this series was provided in the supplementary file and cannot be directly accessed by exprs(). So just download the supplementary file "GSE78220_PatientFPKM.xlsx" into your working directory.

assay <- read.csv("GSE78220_PatientFPKM.csv", row.names = 1)

For phenotypic data,

e <- getGEO("GSE78220", destdir = ".")
e <- e[[1]]
pheno <- pData(e)
ADD COMMENT

Login before adding your answer.

Traffic: 3480 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6