Moderator: David W
David W ♦ 4.8k
- Reputation:
- 4,780
- Status:
- Trusted
- Location:
- New Zealand
- Website:
- http://www.theatavism....
- Last seen:
- 1 year, 1 month ago
- Joined:
- 10 years, 8 months ago
- Email:
- d***********@gmail.com
Posts by David W
<prev
• 326 results •
page 1 of 33 •
next >
3
votes
2
answers
9.5k
views
2
answers
Comment:
C: Watterson's estimator "theta"
... Maybe read this, then ask a more specific question if you are still unclear? https://en.wikipedia.org/wiki/Watterson_estimator
...
written 5.6 years ago by
David W ♦ 4.8k
0
votes
3
answers
2.2k
views
3
answers
... I don't know if it can be used in the particular case you are taking about, but genome query tools (https://github.com/ryanlayer/gqt) has been written specifically to handle very large genotype files and with low memory footprint and speedy queries.
EDIT
Some of gqt's tricks are specifically for ...
written 5.6 years ago by
David W ♦ 4.8k
1
vote
3
answers
2.2k
views
3
answers
... I recently ran a similar task to the OPs with the mask saved as BED files. First time I ran it'd forgotten to tabix the files: ~7hrs to run. Once I'd tabixed them ~ 1 minute :)
...
written 5.6 years ago by
David W ♦ 4.8k
3
votes
2
answers
4.2k
views
2
answers
... This seems to be a problem with the Bioproject database not having a DTD:
http://lists.open-bio.org/pipermail/biopython/2015-May/015632.html
That thread has a work-around, but you might also want to try the development version of Biopython (https://github.com/biopython/biopython)to see if the new ...
written 5.6 years ago by
David W ♦ 4.8k
1
vote
1
answer
2.6k
views
1
answers
... bedtools intersect -a a.bed -b b.bed > intersect_ab.bed
The '>’ bit redirects the output into a (newly created/overwritten) file instead of printing to screen. Welcome to unix-like commands, once you 'get' this and piping you'll find you can do a lot from the command line :-)
...
written 5.7 years ago by
David W ♦ 4.8k
2
votes
3
answers
2.3k
views
3
answers
... If you use the link EUtil with dbfrom="protein", db="pubmed" you'll get a list of pmids associated with that protein.
You can then use esummary or efetch on those pmids.
...
written 5.7 years ago by
David W ♦ 4.8k
1
vote
1
answer
2.6k
views
1
answers
... Short "answer": to avoid overfitting: http://en.m.wikipedia.org/wiki/Overfitting ...
written 5.7 years ago by
David W ♦ 4.8k
0
votes
3
answers
1.1k
views
3
answers
... Biomart (linked to by Sean above) is pretty much your first port of call for all id-mapping problems.
In this case, ensembl also has a REST endpoint for gathering cross refs:
http://rest.ensembl.org/xrefs/id/ENST00000288602?content-type=application/json;
...
written 5.7 years ago by
David W ♦ 4.8k
4
votes
4
answers
1.6k
views
4
answers
... For anyone else interested in the different performance of these solutions, I did a crude comparison.
The TLDR would be: the "grow a for loop" approach works for short vectors, but scales terribly. The rbind method, as unintuitive as it might be, is the fastest of the lot
https://gist.github.c ...
written 5.7 years ago by
David W ♦ 4.8k
1
vote
4
answers
1.6k
views
4
answers
... This should get the job done, and for shorter vectors won't take too long. But be aware, starting an empty vector and growing it like this, though sensible in many languages, is usually very slow in r.
I've not tested it, but as.character(rbind(a,b)) should work, and starting a vector of the final ...
written 5.7 years ago by
David W ♦ 4.8k
Latest awards to David W
Good Answer
4.5 years ago,
created an answer that was upvoted at least 5 times.
For A: How Can I Compare Rates Of Evolution For Two Sets Of Genes?
Appreciated
4.5 years ago,
created a post with more than 5 votes.
For A: How Can I Compare Rates Of Evolution For Two Sets Of Genes?
Appreciated
4.5 years ago,
created a post with more than 5 votes.
For A: How Do I Quantify Divergence Between A Given Set Of Species?
Good Answer
4.5 years ago,
created an answer that was upvoted at least 5 times.
For A: How Do You Extract The Query Title From A Blast Xml Output Using Biopython
Good Answer
4.5 years ago,
created an answer that was upvoted at least 5 times.
For A: What Is A Good Measure Of Sequence Divergence?
Popular Question
4.5 years ago,
created a question with more than 1,000 views.
For Postdoc: Genomics and bioinformatics of mutation, Cartwright Lab, Arizona State University, USA
Scholar
4.5 years ago,
created an answer that has been accepted.
For A: How to know if the third codon position is evolving too fast
Appreciated
4.5 years ago,
created a post with more than 5 votes.
For C: Looking For A Taxonomy With Species And Breeds
Commentator
4.5 years ago,
created a comment with at least 3 up-votes.
For C: Looking For A Taxonomy With Species And Breeds
Appreciated
4.5 years ago,
created a post with more than 5 votes.
For A: How Do You Extract The Query Title From A Blast Xml Output Using Biopython
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: How Can I Compare Rates Of Evolution For Two Sets Of Genes?
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: How Do You Extract The Query Title From A Blast Xml Output Using Biopython
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: How To Simulate The Null Distribution For Tajima'S D?
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: Converting Genbank To Fasta In Protein Form
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: How Do Biologists Use Inferred Phylogenies?
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: Phylip Distance Matrix Question
Teacher
4.5 years ago,
created an answer with at least 3 up-votes.
For A: In Mrbayes, What Does It Mean For A Node To Have More Than Two Descendants?
Popular Question
4.8 years ago,
created a question with more than 1,000 views.
For Postdoc: Genomics and bioinformatics of mutation, Cartwright Lab, Arizona State University, USA
Good Answer
4.8 years ago,
created an answer that was upvoted at least 5 times.
For A: How Can I Compare Rates Of Evolution For Two Sets Of Genes?
Use of this site constitutes acceptance of our User
Agreement
and Privacy
Policy.
Powered by Biostar
version 2.3.0
Traffic: 2165 users visited in the last hour