Question: read count normalization
0
gravatar for Liftedkris
10 months ago by
Liftedkris10
Germany
Liftedkris10 wrote:

hi experts

i am very new in R. i am trying to normalize my read counts generated from HTseq using R. first of all i have up to 28 samples. i am working on console. now i created a metadata containing all the experimental variables using the command

> group <- factor(c(rep("D1T0", 3), rep("D4T0", 3), rep("D5T0", 3), rep("D6T0", 3), rep("D1T30", 3), rep("D4T30", 3), rep("D5T30", 3), rep("D6T30", 3), rep("D1T90", 3), rep("D4T90", 3), rep("D5T90", 3), rep("D6T90", 3), rep("D1T180", 3), rep("D2T180", 3), rep("D4T180", 3), rep("D6T180", 3), rep("D1C30", 3), rep("D4C30", 3), rep("D5C30", 3), rep("D6C30", 3), rep("D1C90", 3), rep("D4C90", 3), rep("D5C90", 3), rep("D6C90", 3), rep("D1C180", 3), rep("D4C180", 3), rep("D5C180", 3), rep("D6C180", 3)))

next i tried to combine count files into a DGEList in R using edgeR by running the following command

> counts.host <- readDGE(list.files(pattern = ".count"), data, columns = c(1,2))

but it gives the following error message:

Error in file.path(path, fn) : 
  cannot coerce type 'closure' to vector of type 'character'

please i need help on how to resolve this error.

thanks

liftedkris

rna-seq • 428 views
ADD COMMENTlink modified 10 months ago • written 10 months ago by Liftedkris10
1

You already posted earlier today: read count normalization

Please follow-up on my answer. It is the least that one could do.

ADD REPLYlink written 10 months ago by Kevin Blighe53k

thanks for the response. i have been trying to add the read counts to gallaxy but it keeps saying: "no tabular dataset available" and the column is not even clickable... i do not know exactly what i am doing wrong

ADD REPLYlink written 10 months ago by Liftedkris10

Hey, cool, there is a help site for Galaxy: https://help.galaxyproject.org/

For the current issue here with EdgeR, what is the output of list.files(pattern = ".count")? To what does data relate?

ADD REPLYlink written 10 months ago by Kevin Blighe53k

yes the list.file(pattern = ".count"). the data is related to human neutrophils infected with aspargillus

ADD REPLYlink written 10 months ago by Liftedkris10

That is not what I asked. When you execute just list.files(pattern = ".count") at the command prompt in R, what is the output to your screen?

What is contained in your object called data? - type head(data)

ADD REPLYlink written 10 months ago by Kevin Blighe53k
> list.files(pattern = ".count")
 [1] "D1C180_Host_infected_rep1.sorted_name.count"
 [2] "D1C30_Host_infected_rep1.sorted_name.count" 
 [3] "D1C90_Host_infected_rep1.sorted_name.count" 
 [4] "D1T0_Host_infected_rep1.sorted_name.count"  
 [5] "D1T180_Host_infected_rep1.sorted_name.count"
 [6] "D1T30_Host_infected_rep1.sorted_name.count" 
 [7] "D1T90_Host_infected_rep1.sorted_name.count" 
 [8] "D2T180_Host_infected_rep1.sorted_name.count"
 [9] "D4C180_Host_infected_rep1.sorted_name.count"
[10] "D4C30_Host_infected_rep1.sorted_name.count" 
[11] "D4C90_Host_infected_rep1.sorted_name.count" 
[12] "D4T0_Host_infected_rep1.sorted_name.count"  
[13] "D4T180_Host_infected_rep1.sorted_name.count"
[14] "D4T30_Host_infected_rep1.sorted_name.count" 
[15] "D4T90_Host_infected_rep1.sorted_name.count" 
[16] "D5C180_Host_infected_rep1.sorted_name.count"
[17] "D5C30_Host_infected_rep1.sorted_name.count" 
[18] "D5C90_Host_infected_rep1.sorted_name.count" 
[19] "D5T0_Host_infected_rep1.sorted_name.count"  
[20] "D5T30_Host_infected_rep1.sorted_name.count" 
[21] "D5T90_Host_infected_rep1.sorted_name.count" 
[22] "D6C180_Host_infected_rep1.sorted_name.count"
[23] "D6C30_Host_infected_rep1.sorted_name.count" 
[24] "D6C90_Host_infected_rep1.sorted_name.count" 
[25] "D6T0_Host_infected_rep1.sorted_name.count"  
[26] "D6T180_Host_infected_rep1.sorted_name.count"
[27] "D6T30_Host_infected_rep1.sorted_name.count" 
[28] "D6T90_Host_infected_rep1.sorted_name.count"
ADD REPLYlink modified 10 months ago by Benn7.9k • written 10 months ago by Liftedkris10

Cool, and, to what does data refer? - a directory in your current working directory? Current working directory can be seen with the command getwd().

Please take a look here to see if the answers help: https://support.bioconductor.org/p/83881/#83885

ADD REPLYlink written 10 months ago by Kevin Blighe53k

galaxy is giving me this error:

Fatal error: Exit code 1 () Error: Sample IDs in factors file and count matrix don't match

ADD REPLYlink written 10 months ago by Liftedkris10

For Galaxy-related issues, you can post here: https://help.galaxyproject.org/

ADD REPLYlink written 10 months ago by Kevin Blighe53k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2069 users visited in the last hour