Moderator: Brian Bushnell

gravatar for Brian Bushnell
Reputation:
17,380
Status:
Trusted
Location:
Walnut Creek, USA
Website:
http://sourceforge.net...
Last seen:
1 month ago
Joined:
6 years, 2 months ago
Email:
b********@lbl.gov

I like to program in Java.

Posts by Brian Bushnell

<prev • 1,931 results • page 1 of 194 • next >
3
votes
2
answers
180
views
2
answers
Answer: C: BBMap tool error - randomreads.sh
... Hi! I'm sorry about this, but BBMap (and thus RandomReads, which uses the same index) does not support chromosomes over 500Mbp in length. Wheat is the only genome I know of that has such a long chromosome. If you break it up (e.g. with shred.sh shredding into lengths of under 500Mbp max) then it ...
written 4 weeks ago by Brian Bushnell17k
0
votes
1
answer
16k
views
1
answers
Comment: C: Introducing Clumpify: Create 30% Smaller, Faster Gzipped Fastq Files. And remov
... Optical duplicates can be detected by adding the "optical" flag to Clumpify if the Illumina reads still have original headers, though the exact settings depend on the platform. PCR duplicates cannot be detected and differentiated from coincidental duplicates with high confidence in RNA-seq librarie ...
written 7 months ago by Brian Bushnell17k
4
votes
3
answers
833
views
3
answers
Answer: A: Current challenges in SARS-CoV-2 analysis?
... There are many challenges here. @[gb][1], it was not wrong of you to mention regulations; those are one of the biggest challenges with Covid-19 bioinformatics. Due to privacy regulations, people change their experimental design and processing pipelines, and restrict data-sharing, to reduce the ris ...
written 9 months ago by Brian Bushnell17k
0
votes
0
answers
6.5k
views
0
answers
Comment: C: BBSketch - A Tool for Rapid Sequence Comparison
... Yes, catting would work; in addition to catting the files, you can also stream them to stdin: cat *.fasta | sketch.sh in=stdin.fa onesketch out=xyz.sketch That said, I'm not sure why you would want to make one sketch for thousands of fasta files. If you want one sketch per input file, with al ...
written 12 months ago by Brian Bushnell17k • updated 12 months ago by GenoMax94k
0
votes
1
answer
928
views
1
answers
Comment: C: viral genome assembly
... Tadpole seems to do a better job of assembling viruses than Spades. I won't guarantee that, but it seems to be generally true. Viruses and hosts can share sequence. If you remove all sequences shared between the virus and host, it's likely that you will incur holes in your assembly, if you are tr ...
written 13 months ago by Brian Bushnell17k
4
votes
2
answers
1.0k
views
2
answers
Answer: A: rRNA decontamination of RNA-Seq reads (tool choice, introns etc)
... BBDuk is a filter. At JGI we use it to remove reads with 31-mers corresponding to common ribosomal 31-mers, because some downstream tools like Trinity fail when there is an overload of rRNA reads. Running BBDuk with common ribosomal kmers is good practice, but not ideal; the sensitivity and specif ...
written 13 months ago by Brian Bushnell17k
0
votes
1
answer
1.5k
views
1
answers
Comment: C: BBTools/BBMap/BBSuite's sketch.sh and sendsketch.sh are inconsistent
... I just revised all of the fetch scripts in 38.61b which I released today. You should be able to do this: Run fetchTaxonomy.sh Then modify fetchNt.sh to change the line TAXPATH="auto" to point to wherever you downloaded the taxonomy data. Then run fetchNt.sh. That will give you nt which is not as ...
written 17 months ago by Brian Bushnell17k
0
votes
1
answer
1.5k
views
1
answers
Comment: C: BBTools/BBMap/BBSuite's sketch.sh and sendsketch.sh are inconsistent
... OK, all servers are back up again! RefSeq is the default server. You can specify: refseq, protein, nt, or ribo (aka silva). It will only connect to one server each time you run it. ...
written 17 months ago by Brian Bushnell17k
0
votes
1
answer
1.5k
views
1
answers
Answer: C: BBTools/BBMap/BBSuite's sketch.sh and sendsketch.sh are inconsistent
... The important thing here is: Query: scaffolds.drop_bacteria.fasta DB: RefSeq **SketchLen: 52788 Seqs: 738 Bases: 82339741** In every case it used all 738 sequences and 82Mbp, and made one sketch, though the sketch size was different in the two cases since the RefSeq server uses larger-tha ...
written 17 months ago by Brian Bushnell17k • updated 17 months ago by GenoMax94k
1
vote
0
answers
6.5k
views
0
answers
Comment: C: BBSketch - A Tool for Rapid Sequence Comparison
... BBSketch servers are hosted on NERSC, which is down for ~5 days starting at 8am PST this morning for electrical work. Unfortunately, that means that they are also down. In fact, everything at JGI is down until Monday or Tuesday night. ...
written 17 months ago by Brian Bushnell17k

Latest awards to Brian Bushnell

Teacher 7 months ago, created an answer with at least 3 up-votes. For A: Local alignment linear space
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: Aligner for super-short sequences
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: Is it possible to convert fastq format to PacBio bax.h5 file?
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: why do people filter common variants such as dbSNP and is it right thing to do?
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: How to add 'placeholder' quality scores to FASTA files (for converting FASTA wit
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: bwa mem multiple alignments doesn't work
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: [samopen] SAM header is present: xxx sequences.
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: exome sequencing, target capture
Teacher 7 months ago, created an answer with at least 3 up-votes. For A: How to calculate genome side????
Commentator 7 months ago, created a comment with at least 3 up-votes. For C: Benchmark - Comparison of Different NGS Mappers
Scholar 7 months ago, created an answer that has been accepted. For A: How to download human whole genome database?
Good Answer 7 months ago, created an answer that was upvoted at least 5 times. For A: Denovo assembly SPAdes ERROR K-mer Counting: too many k-mers to fit into availab
Great Question 7 months ago, created a question with more than 5,000 views. For BBSketch - A Tool for Rapid Sequence Comparison
Great Question 7 months ago, created a question with more than 5,000 views. For Introducing FilterByTile: Remove Low-Quality Reads Without Adding Bias
Appreciated 7 months ago, created a post with more than 5 votes. For A: What does these characters mean ?!!
Appreciated 7 months ago, created a post with more than 5 votes. For A: Filter Bacterial sequences from Metagenomics
Teacher 9 months ago, created an answer with at least 3 up-votes. For A: How to calculate genome side????
Good Answer 9 months ago, created an answer that was upvoted at least 5 times. For A: Denovo assembly SPAdes ERROR K-mer Counting: too many k-mers to fit into availab
Appreciated 9 months ago, created a post with more than 5 votes. For A: Filter Bacterial sequences from Metagenomics
Teacher 9 months ago, created an answer with at least 3 up-votes. For A: How to calculate genome side????
Teacher 12 months ago, created an answer with at least 3 up-votes. For A: [samopen] SAM header is present: xxx sequences.
Teacher 12 months ago, created an answer with at least 3 up-votes. For A: exome sequencing, target capture
Teacher 12 months ago, created an answer with at least 3 up-votes. For A: How to calculate genome side????
Scholar 12 months ago, created an answer that has been accepted. For A: How to download human whole genome database?

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2104 users visited in the last hour