[TCGAbiolinks] miRNA expression analysis error: unidentified columns
Entering edit mode
18 months ago


I am new to using R and have wanted to use to analyze expression data from TCGA. Right now I am using TCGAbiolinks and I am following their documentation on miRNA expression analysis.

I have made a few changes to code outlined there to fit analyzing the L-AML dataset, which is as follows:

Getting L-AML data

require(TCGAbiolinks) LAMLdata <- "TCGA-LAML" DataDirectory <- paste0("LAML", gsub("-", "_", LAMLdata)) FileNameData <- paste0(DataDirectory, "_", "miRNA_gene_quantification" , ".rda")

query.miR <- GDCquery(project = LAMLdata, data.category = "Transcriptome Profiling", data.type = "miRNA Expression Quantification")

samplesDown.miR <- getResults(query.miR, cols=c("cases"))

dataSmTRBM.miR <- TCGAquery_SampleTypes(barcode = samplesDown.miR, typesample = "TRBM")

dataSmNBM.miR <- TCGAquery_SampleTypes(barcode = samplesDown.miR, typesample = "NBM")

dataSmTRBM_short.mir <- dataSmTRBM.miR[1:10]

dataSmNBM_short.mir <- dataSmNBM.miR[1:10]

queryDown.miR <- GDCquery(project = LAMLdata, data.category = "Transcriptome Profiling", data.type = "miRNA Expression Quantification", barcode = c(dataSmTRBM_short.mir, dataSmNBM.miR))

GDCdownload(query = queryDown.miR, directory = DataDirectory)

dataAssay.miR <- GDCprepare(query = queryDown.miR, save = TRUE, save.filename = FileNameData, summarizedExperiment = TRUE, directory = DataDirectory)

rownames(dataAssay.miR) <- dataAssay.miR$miRNA_ID

using read_count's data

read_countData <- colnames(dataAssay.miR)[grep("count", colnames(dataAssay.miR))] dataAssay.miR <- dataAssay.miR[,read_countData] colnames(dataAssay.miR) <- gsub("read_count_", "", colnames(dataAssay.miR))

dataFilt <- TCGAanalyze_Filtering(tabDF = dataAssay.miR, method = "quantile", qnt.cut = 0.25)

dataDEGs <- TCGAanalyze_DEA(mat1 = dataFilt[,dataSmNBM_short.mir], mat2 = dataFilt[,dataSmTRBM_short.mir], Cond1type = "Normal", Cond2type = "Tumor", fdr.cut = 0.01, logFC.cut = 1, method = "glmLRT")

When I source the code, I am ultimately met with this error:

Error in [.data.frame(dataFilt, , dataSmNBM_short.mir) : undefined columns selected

What I have noticed is that dataSmNBM.miR and dataSmTRBM.miR have a character (empty) value assigned to them, and thus I suspect this to be the first cause of my woes.

Can someone explain to me where things might have gone awry? I have gone over the code multiple times and have consulted documentation and everything still seems to be logical to me.

Thank you for your time and many thanks!

tcgabiolinks • 387 views

Login before adding your answer.

Traffic: 1565 users visited in the last hour
Help About
Access RSS

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6