Question: How should I analyze the following dataset (GSE150847)?
gravatar for arjuncsubra
7 weeks ago by
arjuncsubra0 wrote:

Hi all, new to Bioconductor/scRNA-seq analyses here.

For an upcoming project, I have chosen to analyze this dataset, "Generation of a Broadly Useful Model for COVID-19 Pathogenesis, Vaccination, and Treatment".

I have preprocessed and loaded the .tsv file (from supplemental files in GEO) into a SingleCellExperiment. My understanding is that the row names are the individual mouse genes and the column names are the six samples that were taken.

Here is how my SingleCellExperiment looks in R:

> #what my sce is called
class: SingleCellExperiment 
dim: 55339 6 
assays(1): counts
rownames(55339): 0610005C13Rik 0610006L08Rik ... n-TStga1 SARS-CoV-2
rowData names(0):
colnames(6): Ad5.Empty.rep1 Ad5.Empty.rep2 ... Ad.hACE2.rep2 Ad.hACE2.rep3
colData names(0):

What is your advice going forward/what should my end product be?

Furthermore, can I create a clustering map with this type of data/how applicable will that be with this dataset (individual cells do not seem to be in the columns, as there are only 6)?

rna-seq bioconductor R • 141 views
ADD COMMENTlink written 7 weeks ago by arjuncsubra0
gravatar for RamRS
7 weeks ago by
Baylor College of Medicine, Houston, TX
RamRS30k wrote:

Are you asking us what you can do with a publicly available dataset, or do you have a specific hypothesis/scientific question in mind?

ADD COMMENTlink written 7 weeks ago by RamRS30k

I do not really have a particular hypothesis in mind; I would like to know what end product makes the most sense given this dataset and data type.

ADD REPLYlink written 7 weeks ago by arjuncsubra0

That is not really a way to approach a project - you should have some idea of what you're looking for. Without that, the dataset is just a bunch of random data points. Why did you pick this dataset in particular? What about it appeals to you?

ADD REPLYlink modified 7 weeks ago • written 7 weeks ago by RamRS30k

I picked this dataset because I want to do some clustering related to COVID-19 and its effect on the lungs.

ADD REPLYlink written 7 weeks ago by arjuncsubra0

What can you tell me about this dataset, the attributes it contains and how it relates to Covid-19 and lungs? What does this dataset have that other Covid-19 datasets don't? What was the process by which you arrived to this dataset?

ADD REPLYlink written 7 weeks ago by RamRS30k

To my understanding (I am really new to this), this dataset seems to have mouse genes in the rows and 6 columns which seem to be the test subjects. This dataset seems to involve mice that were gene-edited so they could become susceptible to the effects of COVID-19 respiratorily. I was just looking for COVID-19 datasets on GEO, that is how I found it.

I am still wondering what the practical possibilities are when it comes to graphing this data set (does this mean clustering, etc.).

ADD REPLYlink written 7 weeks ago by arjuncsubra0

Just some training, I presume? You could try to follow the methods that my colleague and I used here:

We never published that. Publishing already-published data in this way is difficult.

ADD REPLYlink written 6 weeks ago by Kevin Blighe65k

Yes - it was for training, thank you!

ADD REPLYlink written 6 weeks ago by arjuncsubra0
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1618 users visited in the last hour