Issue with DESeq2: Unable to create a DESeqDataSet because names in colData don't match column names in countData
0
0
Entering edit mode
5.8 years ago
catglen012 ▴ 10

Hello! I loaded my gene(/transcript) count matrix and labels into DESeq2:

> countData <- as.matrix(read.csv(file.choose(), row.names = "gene_id"))
> colData <- read.table(text = readLines(file.choose(), warn = FALSE), header = TRUE, sep = "," )

But when I check that all sample IDs in colData are also in CountData and match their orders, I obtain the following:

> all(rownames(colData) %in% colnames(countData))
[1] FALSE

It then checked to see how my gene_count_matrix.csv looks like and the column has MSTRG as the gene ID and it does not match the ID names found in phenotype_data.

I was wondering how do I make them both match so that I am able to generate a DESeqDataSet? My phenotype_data looks just like the data found in the nature paper

https://www.nature.com/articles/nprot.2016.095

and I followed exactly the protocol for Using StringTie with DESeq2.

RNA-Seq R DESeq2 • 3.1k views
ADD COMMENT
0
Entering edit mode

Please post the first line of countData and the contents of colData.

ADD REPLY
0
Entering edit mode

countData:

gene_id    Ceratodon_GG1    Ceratodon_R40
MSTRG.9134      3497                  998

colData:

ids                 sex
Ceratodon_GG1   female
ADD REPLY
0
Entering edit mode

You only have 2 samples?

ADD REPLY
0
Entering edit mode

hello! I was able to resolve the issue by fixing how the colData is arranged. Thanks for the help nevertheless.

ADD REPLY
0
Entering edit mode

Hi catglen012, could you mention what exactly you did to the colData to fix it? I have the same issue

ADD REPLY

Login before adding your answer.

Traffic: 2036 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6