Moderator: John

gravatar for John
John12k
Reputation:
12,030
Status:
Trusted
Location:
Germany
Website:
http://at.cg/
Last seen:
2 months, 2 weeks ago
Joined:
7 years ago
Email:
m***********@gmail.com

Mistakes will be made...

Posts by John

<prev • 1,146 results • page 1 of 115 • next >
0
votes
2
answers
1.4k
views
2
answers
Comment: C: BQSR: when it is applicable?
... 1. is difficult to answer, but essentially yes. It's not quite so straight forward, because first the tool builds rules for the entire data, and nothing based on genomic mapping location. It will not see a mismatch on chr1:5000 100 times and say "the quality scores of chr1:5000 need to be reduced", ...
written 12 months ago by John12k
1
vote
2
answers
1.4k
views
2
answers
Comment: C: BQSR: when it is applicable?
... Sure, um, well it's to do with how BQSR works. Let's start at the beginning. Base quality scores are the sequencing machines guess at how likely it is to be wrong when base calling. It is important to realise that it's not how likely the machine is to be wrong, but a *guess* at how likely the machin ...
written 12 months ago by John12k
0
votes
2
answers
1.4k
views
2
answers
Comment: C: BQSR: when it is applicable?
... Eek, you're right, i pulled the 13m number from the [NCBI's FAQ][1], but it hasn't been updated since 2008 -_-; Regarding PHRED scores, you're right, this is on the assumption Illumina is producing probabilities of error that match reality. I've always known their process as having a 1:1000 error r ...
written 12 months ago by John12k
0
votes
1
answer
617
views
1
answers
Comment: C: How to tell if a FastQ file is a concatinate of 2 seperate illumina runs?
... Hahah, hey man :) ...
written 12 months ago by John12k
0
votes
1
answer
617
views
1
answers
Comment: C: How to tell if a FastQ file is a concatinate of 2 seperate illumina runs?
... Can you run "`head my.fastq`" and "`tail my.fastq`" on your file and paste the results here? One should be able to make a fair guess based on that. Strictly speaking however, it's not possible to be able to determine this in all situations. FASTQ is a terrible file format for metadata. ...
written 12 months ago by John12k
1
vote
2
answers
1.4k
views
2
answers
Comment: C: BQSR: when it is applicable?
... I think the probability of that would be tiny. You'd have to essentially have more variations in the genome of the individual than the sequencing technology. But i can see where you're coming from - take for example [this quote][1] from the NIH about what SNPs are: > SNPs occur normally througho ...
written 12 months ago by John12k
0
votes
5
answers
662
views
5
answers
Answer: A: Is there any disease mainly caused by underexpression or overexpression of some
... Type-2 Diabetes is a neato example of something that's a combination of both. ...
written 13 months ago by John12k
2
votes
2
answers
3.8k
views
2
answers
Answer: A: Meaning of BWA-MEM MAPQ values
... MAPQ scores are not meaningful because BAM is not meaningful - or rather, the field has yet to define the difference between read-alignments (what BAM officially stores) and fragment-alignments (what most aligners produce). The issue goes far deeper than MAPQ scores, but if you want to read about M ...
written 13 months ago by John12k
0
votes
4
answers
2.6k
views
4
answers
Comment: C: Amzon EC2 for bioinformatics, genomics, NGS analysis
... The vast majority of what AWS offers is unrelated to bioinformatic analysis. I give lot of examples in the first two paragraphs of the above, like DNS, SSL, etc. These are features that businesses and SMCs will pay a premium for, because setting that stuff up on your own is a real hassle. I was also ...
written 15 months ago by John12k
0
votes
5
answers
1.9k
views
5
answers
Comment: C: Declining quality of biostars
... I think you'd need to really dramatically change the way threads are ordered on the main page, not ordered by time posted or time since last activity, but by using an algorithm much like Facebook's to push to the top of the page threads the site wishes users to see - perhaps because they are logged ...
written 16 months ago by John12k

Latest awards to John

Commentator 12 months ago, created a comment with at least 3 up-votes. For C: Cost of computing
Commentator 12 months ago, created a comment with at least 3 up-votes. For C: Create directed acyclic graph of enriched GO terms
Commentator 12 months ago, created a comment with at least 3 up-votes. For C: I have to learn another language, but which one?
Popular Question 12 months ago, created a question with more than 1,000 views. For Create your own VPN to access work resources from home
Popular Question 12 months ago, created a question with more than 1,000 views. For GC bias correction deepTools2
Popular Question 12 months ago, created a question with more than 1,000 views. For DNA composition - all k-mers and their frequency in some sequencing data
Popular Question 12 months ago, created a question with more than 1,000 views. For Antisense transcription - how to detect it?
Popular Question 12 months ago, created a question with more than 1,000 views. For In-place writable BAM files
Popular Question 12 months ago, created a question with more than 1,000 views. For How many ways a read pair can be mapped?
Popular Question 12 months ago, created a question with more than 1,000 views. For Pysam under pypy?
Popular Question 12 months ago, created a question with more than 1,000 views. For Poll: Does your filesystem support xattr?
Popular Question 12 months ago, created a question with more than 1,000 views. For Signal Distribution Charts
Popular Question 12 months ago, created a question with more than 1,000 views. For log / log.bio - keeping track of command line workflows
Popular Question 12 months ago, created a question with more than 1,000 views. For Job Manager to parallelize otherwise consecutive bash scripts..?
Popular Question 12 months ago, created a question with more than 1,000 views. For Check BAMs are complete and clean
Popular Question 12 months ago, created a question with more than 1,000 views. For Is there an authoritative source for optional BAM tags?
Popular Question 12 months ago, created a question with more than 1,000 views. For Thoughts on Bioinformatics and programming
Popular Question 12 months ago, created a question with more than 1,000 views. For Bioinformatic PhD Thesis
Good Answer 12 months ago, created an answer that was upvoted at least 5 times. For A: samtools sorting and indexing
Good Question 12 months ago, asked a question that was upvoted at least 5 times. For DNA composition - all k-mers and their frequency in some sequencing data
Great Question 12 months ago, created a question with more than 5,000 views. For Genomic Coverage - Samtool's undocumented "depth" verses the poorly documented pileup.
Great Question 12 months ago, created a question with more than 5,000 views. For pybam - 100% python BAM reader
Appreciated 12 months ago, created a post with more than 5 votes. For C: Cost of computing
Teacher 13 months ago, created an answer with at least 3 up-votes. For A: Why can't we downvote on this forum?

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1739 users visited in the last hour