Entering edit mode
3.6 years ago
Rob
▴
170
Hi friends I am using R code to download HT-Seq count data Does anybody know what line should I add to the code to download only coding genes? because my code output is all genes (coding& non-coding)
my code:
CancerProject <- "TCGA-KIRC"
query <- GDCquery(project = CancerProject,
data.category = "Transcriptome Profiling",
data.type = "Gene Expression Quantification",
sample.type = c("Primary Tumor"),
workflow.type = "HTSeq - Counts")
#download raw counts for DESEq2
GDCdownload(query)
data <- GDCprepare(query, save = TRUE, save.filename = "exp.rda")
rna <- as.data.frame(SummarizedExperiment::assay(data)) # exp matrix
write.csv(rna, "rna.csv")
clinical <- data.frame(data@colData) # associated clinical data
write.csv(clinical, "clinical.csv")