Question

WGCNA sequence of commands

0

Entering edit mode

7.9 years ago

aj123 ▴ 120

Hi,

Im trying to do a WGCNA on my data. I have 3 columns:

gene1 expression_value_for_condition_1 expression_value_for_condition_2

im trying to follow the commands on the WGCNA tutorial to generate the heatmap and matrix but im getting this error:

Error in goodGenes(datExpr, goodSamples, goodGenes, minFraction = minFraction,  : 
  Too few genes with valid expression levels in the required number of samples.

further:

gsg = goodSamplesGenes(ecvssmc, verbose = 3)
 Flagging genes and samples with too many missing values...
  ..step 1
Error in goodGenes(datExpr, goodSamples, goodGenes, minFraction = minFraction,  : 
  Too few genes with valid expression levels in the required number of samples.
> gsg$allOK
Error: object 'gsg' not found

i have installed the gsg package and loaded it also.

is this a case of too few samples?

WGCNA R gene expression corelation statistics • 4.3k views

ADD COMMENT • link updated 7.7 years ago by Lluís R. ★ 1.2k • written 7.9 years ago by aj123 ▴ 120

0

Entering edit mode

Why are you running WGCNA on two samples ?

ADD REPLY • link 7.9 years ago by GouthamAtla 12k

0

Entering edit mode

Goutham-are you the same guy who was with me at NCBS? i notice youve been making random comments on my posts. your comment here is misleading and appears to be giving others the wrong impression of what my file is like, which is not very helpful. i intented to mean that this was a sample file with those titles. note i said these are the columns. not the actual samples themselves.

ADD REPLY • link 7.9 years ago by aj123 ▴ 120

1

Entering edit mode

The error is very much related to the number of samples. The error says "Too few genes with valid expression levels in the required number of samples."

WGCNA is recommended to run on at least 15 samples but you have only 2 samples. I was trying to point out that.

And gsg is an object you were trying to create, so no need to install gsg package which might introduce unknown naming conflicts.

ADD REPLY • link 7.9 years ago by GouthamAtla 12k

0

Entering edit mode

Ok yes - I understand what youre trying to say. But like i mentioned these are the column headers. I have more than 40 samples so I was not sure why I was getting the error.

ADD REPLY • link 7.9 years ago by aj123 ▴ 120

0

Entering edit mode

You need a 'header-field' for each sample - an mxn table as commonly said. Every column needs explicitly the name of the sample from which the (normalized!) counts are derived.

ADD REPLY • link 7.9 years ago by WouterDeCoster 47k

0

Entering edit mode

enter image description here each of my columns has headers. here is a sample of how my data looks like:

edit: apologies. image attached.

ADD REPLY • link 7.9 years ago by aj123 ▴ 120

0

Entering edit mode

What is your starting point? You performed counting using htseq-count or similar?

ADD REPLY • link 7.9 years ago by WouterDeCoster 47k

0

Entering edit mode

these are PCR Ct values

ADD REPLY • link 7.9 years ago by aj123 ▴ 120

1

Entering edit mode

I was under the impression that you had microarray intensities or RNA-seq counts. I'm not sure that qPCR Cts is a datatype valid for WGCNA and would recommend contacting the WGCNA authors or use the Bioconductor forum.

ADD REPLY • link 7.9 years ago by WouterDeCoster 47k

0

Entering edit mode

Since all fields are just following each other it's not clear to me how your data looks like. Formatting is important.

ADD REPLY • link 7.9 years ago by WouterDeCoster 47k

0

Entering edit mode

once you read the data in to an R object, can you post the output of dim(object) ? to see if the data has loaded properly ? It should not be a problem what values are they as long as they are numeric. It should work.

ADD REPLY • link 7.9 years ago by GouthamAtla 12k

1

Entering edit mode

Random comments? He addresses exactly the issue you are facing.

ADD REPLY • link 7.9 years ago by WouterDeCoster 47k

score 0 · Answer 1 · 2016-05-16

0

Entering edit mode

7.9 years ago

liangjiao.xue ▴ 100

Yes. In this file, too few samples are included. No gene pass the QC filtering.

ADD COMMENT • link 7.9 years ago by liangjiao.xue ▴ 100

score 0 · Answer 2 · 2016-08-03

As per your comments and question I think you have your expression data in the wrong format, you need to have it in the following format:

        gene1   gene2   gene3   gene4   gene5   gene6   gene.
sample1 
sample2
sample3
sample4
sample5
sample6
sample7
sample.

If you have different conditions that should go on a different object (as trait condition)