User: keith.hughitt
keith.hughitt • 280
- Reputation:
- 280
- Status:
- Trusted
- Location:
- United States
- Website:
- https://github.com/khu...
- Last seen:
- 2 weeks, 3 days ago
- Joined:
- 8 years, 2 months ago
- Email:
- k************@gmail.com
about me
Posts by keith.hughitt
<prev
• 16 results •
page 1 of 2 •
next >
1
vote
0
answers
424
views
0
answers
... Greetings!
I was wondering what people's thoughts are on performing a preranked GSEA type analysis on unsigned / one-sided gene statistics are?
[Most of the](https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0079217) [literature](https://bmcbioinformatics.biomedcentral.com/articles ...
written 14 months ago by
keith.hughitt • 280
• updated
14 months ago by
Biostar ♦♦ 20
7
votes
0
answers
1.5k
views
7 follow
0
answers
... Greetings,
I would like to share a small tool I developed during my graduate work over the past couple years. The software is called "[labnote][1]" and is a simple python-based tool for scanning a Unix-based directory structure and generating a HTML table of contents page for various research outpu ...
written 3.6 years ago by
keith.hughitt • 280
0
votes
0
answers
1.2k
views
0
answers
... I suspect you are seeing a batch effect. It is not uncommon for differences in batches to explain more of the variance in your data than the biological effects of interest. The fact that there are also differences in the total expression levels differentiating the two groups also supports that idea. ...
written 4.4 years ago by
keith.hughitt • 280
3
votes
1
answer
3.0k
views
1
answers
... From [The NCBI Handbook, ch18][1]: (via [this answer][2])
**NC** - "Complete genomic molecule, usually reference assembly"
**NZ** - "Unfinished WGS"
[1]: http://www.ncbi.nlm.nih.gov/books/NBK21091/table/ch18.T.refseq_accession_numbers_and_mole/
[2]: https://www.biostars.org/p/105529/ ...
written 4.4 years ago by
keith.hughitt • 280
0
votes
0
answers
1.6k
views
0
answers
Comment:
C: Error in SAMseq analysis with R
... Since the error appears to result from some non-numeric or non-finite values, first thing you might want to try is to check for those in your input counts, e.g.: `sapply(RNA_seq_DNAm, function(x) { sum(is.na(x)) })` for NAs. You can modify this to check for NaN's (`is.nan`) or Infs (`abs(x) == Inf`) ...
written 4.4 years ago by
keith.hughitt • 280
1
vote
2
answers
8.2k
views
2
answers
... The [Distance][1] library for Python also has an implementation of hamming distance and some other metrics which I've found useful in the past.
[1]: https://pypi.python.org/pypi/Distance/ ...
written 4.4 years ago by
keith.hughitt • 280
2
votes
1
answer
863
views
1
answers
... You might want to start by checking out the Bioconductor microarray analysis guide: https://www.bioconductor.org/help/workflows/arrays/ ...
written 4.4 years ago by
keith.hughitt • 280
0
votes
2
answers
7.0k
views
2
answers
... What is the structure of your data, e.g. #samples, #conditions, #replicates/condition? And are you attempting to find clusters of samples? Or genes? Depending on what you are interested in, using z-scores may not be necessary. ...
written 4.4 years ago by
keith.hughitt • 280
1
vote
0
answers
1.2k
views
0
answers
... Then WouterDeCoster is correct - experiment and cell type are confounded and you cannot reliably detect differences between them. This would only work if you had both cell types for both studies. ...
written 4.4 years ago by
keith.hughitt • 280
8
votes
1
answer
41k
views
1
answers
... You can use the `table` function in R to get the count of each duplicated gene.
For example, if the gene IDs are stored in a column `gene_id`, you could do:
> dat <- data.frame(gene_id=sample(1:3, 20, replace=TRUE), other_col='foo')
> table(dat$gene_id)
1 2 3
5 6 9
...
written 4.4 years ago by
keith.hughitt • 280
Latest awards to keith.hughitt
Popular Question
14 months ago,
created a question with more than 1,000 views.
For labnote: A light-weight HTML lab notebook generator
Appreciated
2.9 years ago,
created a post with more than 5 votes.
For A: Finding number of duplicates in R
Appreciated
3.0 years ago,
created a post with more than 5 votes.
For A: Finding number of duplicates in R
Voter
3.2 years ago,
voted more than 100 times.
Appreciated
3.5 years ago,
created a post with more than 5 votes.
For A: Finding number of duplicates in R
Good Answer
3.5 years ago,
created an answer that was upvoted at least 5 times.
For A: Finding number of duplicates in R
Teacher
4.4 years ago,
created an answer with at least 3 up-votes.
For A: Proper construction of data matrix for WGCNA (weighted gene coexpression network
Scholar
4.4 years ago,
created an answer that has been accepted.
For A: Proper construction of data matrix for WGCNA (weighted gene coexpression network
Teacher
4.4 years ago,
created an answer with at least 3 up-votes.
For A: Proper construction of data matrix for WGCNA (weighted gene coexpression network
Scholar
4.5 years ago,
created an answer that has been accepted.
For A: Proper construction of data matrix for WGCNA (weighted gene coexpression network
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: Proper construction of data matrix for WGCNA (weighted gene coexpression network
Supporter
5.6 years ago,
voted at least 25 times.
Use of this site constitutes acceptance of our User
Agreement
and Privacy
Policy.
Powered by Biostar
version 2.3.0
Traffic: 1527 users visited in the last hour