Moderator: Obi Griffith

gravatar for Obi Griffith
Obi Griffith17k
Reputation:
16,650
Status:
Trusted
Location:
Washington University, St Louis, USA
Website:
http://www.obigriffith...
Twitter:
obigriffith
Scholar ID:
Google Scholar Page
Last seen:
1 day, 14 hours ago
Joined:
6 years, 8 months ago
Email:
o**********@gmail.com

I am an Assistant Professor of Medicine (Oncology) and Genetics at Washington University School of Medicine and Assistant Director at the McDonnell Genome Institute. I study cancer genomics, primarily through clinical statistics and bioinformatic analyses of next-generation sequence data such as whole genome (WGS), exome, and transcriptome (RNA-seq) data.

Posts by Obi Griffith

<prev • 420 results • page 1 of 42 • next >
6
votes
1
answer
174
views
1
answers
Answer: A: Paternity Testing from WGS Trio
... It is definitely *possible* to assess paternity from whole genome sequence (WGS) data. Paternity can probably be established with as little as a few dozen or maybe hundreds of well-chosen single nucleotide polymorphisms (SNPs). If you have decent WGS data you can expect to genotype millions of SNPs. ...
written 5 weeks ago by Obi Griffith17k
2
votes
1
answer
39k
views
1
answers
Comment: C: Cheat Sheet For One-Based Vs Zero-Based Coordinate Systems
... postimages.org has been having some issues with their hosting service (and business model for that matter). The domain name postimg.org was apparently locked by registry. They moved to postimg.cc. Once I updated the urls they returned. BioStars developers have been informed about this as I suspect i ...
written 5 months ago by Obi Griffith17k
0
votes
1
answer
39k
views
1
answers
Comment: C: Cheat Sheet For One-Based Vs Zero-Based Coordinate Systems
... Still there for me. This sounds like a BioStars or system-specific issue? ...
written 6 months ago by Obi Griffith17k
4
votes
9
answers
46k
views
9
answers
Comment: C: Database Of Tumor Suppressors And/Or Oncogenes
... I feel like, right now, the best answer to this question is the Cancer Gene Census. They currently provide a TSV download of their complete list of 567 genes with nearly all being indicated as oncogene and/or tumor suppressor (TSG). ...
written 14 months ago by Obi Griffith17k
0
votes
0
answers
22k
views
0
answers
Comment: C: Machine Learning For Cancer Classification - Part 1 - Preparing The Data Sets
... Please note that the trainset_gcrma.txt and testset_gcrma.txt can change over time if using different versions of the custom CDF annotations. The clindetails.txt files should be static. I don't know why that command doesn't work. It seems like there are regularly issues with GEO services failing or ...
written 18 months ago by Obi Griffith17k
0
votes
3
answers
9.7k
views
3
answers
Comment: C: Machine Learning For Cancer Classification - Part 3 - Predicting With A Random F
... This happens because over time the custom CDF files change so that hardcoded number to eliminate AFFX (control) probes becomes incorrect. It is then possible that some of these get included in the model. We added the alternative way to remove AFFX probes in some places but missed others and this lik ...
written 18 months ago by Obi Griffith17k
1
vote
3
answers
9.7k
views
3
answers
Comment: C: Machine Learning For Cancer Classification - Part 3 - Predicting With A Random F
... @debitboro - welcome to Biostars. The predict function will only use predictor variables that exist in the (previously trained) model that is provided. Running most recently, I get 1247 predictors after filtering the training data down from 12264 total predictors. The RF model is trained on, and onl ...
written 18 months ago by Obi Griffith17k
0
votes
0
answers
22k
views
0
answers
Comment: C: Machine Learning For Cancer Classification - Part 1 - Preparing The Data Sets
... All of the files can be found here: https://github.com/obigriffith/biostar-tutorials/tree/master/MachineLearning. The file you are specifically looking for could be downloaded here: https://raw.githubusercontent.com/obigriffith/biostar-tutorials/master/MachineLearning/trainset_clindetails.txt ...
written 18 months ago by Obi Griffith17k
0
votes
0
answers
22k
views
0
answers
Comment: C: Machine Learning For Cancer Classification - Part 1 - Preparing The Data Sets
... The general principles are the same but this tutorial (part 1) is really focused on preparing a datafile from microarray data. You will want to use an alternative approach to create a file that is analogous to datafile="trainset_gcrma.txt". This would be a simple matrix of gene (or transcript or oth ...
written 18 months ago by Obi Griffith17k
1
vote
2
answers
965
views
2
answers
Comment: C: Uneven coverage correlated with Alu sequences (and discordant read pairs) in NGS
... You are indeed correct that GC/AT content correlates with Alu elements. In general Alu elements seem to be correlated with more GC-rich regions, although it depends on the type/age of the Alu element (and is probably [more complicated][1] than that). Indeed the very top track in the IGV snapshot sho ...
written 20 months ago by Obi Griffith17k

Latest awards to Obi Griffith

Teacher 5 weeks ago, created an answer with at least 3 up-votes. For A: How Has Bioinformatics Improved Over Time?
Scholar 5 weeks ago, created an answer that has been accepted. For A: How to download DNA sample files in dbGap?
Good Answer 5 weeks ago, created an answer that was upvoted at least 5 times. For A: How Has Bioinformatics Improved Over Time?
Appreciated 5 weeks ago, created a post with more than 5 votes. For A: How Has Bioinformatics Improved Over Time?
Good Answer 6 months ago, created an answer that was upvoted at least 5 times. For A: How Has Bioinformatics Improved Over Time?
Teacher 9 months ago, created an answer with at least 3 up-votes. For A: How Has Bioinformatics Improved Over Time?
Commentator 9 months ago, created a comment with at least 3 up-votes. For C: Suggesting The Removal Of Post Closing
Popular Question 11 months ago, created a question with more than 1,000 views. For Bioinformatics Positions In Canada (Elsewhere)
Teacher 12 months ago, created an answer with at least 3 up-votes. For A: How Has Bioinformatics Improved Over Time?
Scholar 12 months ago, created an answer that has been accepted. For A: What read lengths are produced by modern Illumina sequencers?
Popular Question 13 months ago, created a question with more than 1,000 views. For Dgidb - Mining The Druggable Genome
Popular Question 13 months ago, created a question with more than 1,000 views. For Dgidb - Mining The Druggable Genome
Popular Question 14 months ago, created a question with more than 1,000 views. For Dgidb - Mining The Druggable Genome
Gold Standard 15 months ago, created a post with more than 25 bookmarks. For Analysing Microarray Data In Bioconductor
Great Question 16 months ago, created a question with more than 5,000 views. For Isaac Asimov Accurately Predicts Woes Of Bioinformatician
Good Answer 16 months ago, created an answer that was upvoted at least 5 times. For A: How Has Bioinformatics Improved Over Time?
Appreciated 17 months ago, created a post with more than 5 votes. For A: How Has Bioinformatics Improved Over Time?
Appreciated 19 months ago, created a post with more than 5 votes. For A: How Has Bioinformatics Improved Over Time?
Popular Question 19 months ago, created a question with more than 1,000 views. For Dgidb - Mining The Druggable Genome

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 772 users visited in the last hour