User: jkbonfield

gravatar for jkbonfield
jkbonfield430
Reputation:
430
Status:
Trusted
Location:
Last seen:
5 days, 14 hours ago
Joined:
3 years, 3 months ago
Email:
j*********@gmail.com

Posts by jkbonfield

<prev • 30 results • page 1 of 3 • next >
1
vote
1
answer
167
views
1
answers
Answer: A: Read name not stored in CRAM file. Is it possible to create new read names for r
... See my reply to your question on github too. However I suspect this is a processing error somewhere else which has "baked in" the pregenerated names at some point, and then a merge or something has caused the collision. CRAM *does* track read-pairs even when the names are removed. However it's po ...
written 6 days ago by jkbonfield430
2
votes
1
answer
370
views
1
answers
Answer: A: What is the difference of output samtools depth and samtools view -c on location
... Samtools depth is using the mpileup algorithm to find overlapping data, along with all the nuances that involves. That means filtering by flags (unmapped data, secondary reads, duplicates, QC failure), limits of maximum depth, possibly some other things like removal of overlapping templates (I can ...
written 8 months ago by jkbonfield430
1
vote
1
answer
301
views
1
answers
Comment: C: file size after sorting the BAM file using samtools
... The answer about sizes has already been given so I won't repeat it. However in answer to part 2, we locally use Biobambam's bamseqchksum tool to validate that a file operation hasn't lost data in the process, or that it's lost only the bits we know will be lost. For example it can compute checksum ...
written 8 months ago by jkbonfield430
0
votes
0
answers
265
views
0
answers
Comment: C: Collapsing BAM based on seq and positions
... This sounds much like the ReducedReads format from early GATK versions. Ultimately it was retired because it wasn't sufficient to capture all the important information, but it may still be available if you can find an old enough GATK (2.8?). ...
written 9 months ago by jkbonfield430
0
votes
4
answers
1.0k
views
4
answers
Comment: C: BAM files compression
... I did the maths on how long it takes to recover AWS CPU costs (based on a spot price some arbitrary time ago) in the reduction of AWS standard S3 disk charges for a BAM to CRAM conversion. At that point it happened to be around 1 day! Obviously longer for cheaper storage tiers. I didn't do the r ...
written 9 months ago by jkbonfield430 • updated 9 months ago by RamRS30k
2
votes
4
answers
1.0k
views
4
answers
Answer: C: BAM files compression
... CRAM generation is actually faster than BAM generation in samtools, at least at the default compression levels. CRAM decoding is slower than BAM though unless you're I/O bound, in which case CRAM will be faster due to being smaller. See https://github.com/samtools/www.htslib.org/pull/23/commits/6a ...
written 9 months ago by jkbonfield430
7
votes
1
answer
951
views
1
answers
Answer: A: What is the difference between mpileup samtools and bcftools?
... `Bcftools mpileup` should be used instead of `samtools mpileup` for variant calling. That is, the VCF / BCF output mode of mpileup is better in bcftools. `Samtools mpileup` however has two different formats with the default always being a simple columnar format showing chr, pos, reference, depth, ...
written 9 months ago by jkbonfield430
0
votes
1
answer
428
views
1
answers
Comment: C: Aligning, Sorting and Converting to bam at the same command - possible?
... If you think you'll be doing markdup at some point then you may also want to add a "samtools fixmate -m" in there after the bowtie command as this way it doesn't require an additional sort later on. Also when piping it's often best to pipe uncompressed BAM. Some samtools commands have a "-u" optio ...
written 10 months ago by jkbonfield430
4
votes
2
answers
1.6k
views
2
answers
Answer: A: How does samtools markdup works?
... Samtools markdup is written to match Picard 2.10.3 (also Biobambam's bamstreamingmarkduplicates) so if you can find documentation on those then it should also apply to Samtools. It may seem like a complex dance to have both name and position sorted requirements, but this is perhaps due to a traditi ...
written 10 months ago by jkbonfield430
0
votes
1
answer
252
views
1
answers
Comment: C: Extract all BAM reads that intersect a given region using the BAI index
... Not an answer as I haven't implemented this myself so don't know all the ins and outs. However basically the BAI index maps ranges (produced as "bins") to file offsets. Given the R-Tree isn't binary, a single bin may have multiple start/stop points for data within it, hence the linear index too. ...
written 10 months ago by jkbonfield430

Latest awards to jkbonfield

Good Answer 4 weeks ago, created an answer that was upvoted at least 5 times. For A: Is it possible to directly convert fastq to CRAM ?
Teacher 6 months ago, created an answer with at least 3 up-votes. For A: Recovering bam files after unknow deletion in the storage
Scholar 9 months ago, created an answer that has been accepted. For A: How does samtools markdup works?
Appreciated 9 months ago, created a post with more than 5 votes. For A: What is the difference between mpileup samtools and bcftools?
Scholar 9 months ago, created an answer that has been accepted. For A: How does samtools markdup works?
Teacher 9 months ago, created an answer with at least 3 up-votes. For A: Recovering bam files after unknow deletion in the storage
Scholar 10 months ago, created an answer that has been accepted. For A: How does samtools markdup works?
Teacher 10 months ago, created an answer with at least 3 up-votes. For A: Recovering bam files after unknow deletion in the storage
Appreciated 16 months ago, created a post with more than 5 votes. For A: Is it possible to directly convert fastq to CRAM ?
Good Answer 16 months ago, created an answer that was upvoted at least 5 times. For A: Is it possible to directly convert fastq to CRAM ?
Teacher 16 months ago, created an answer with at least 3 up-votes. For A: Recovering bam files after unknow deletion in the storage
Teacher 16 months ago, created an answer with at least 3 up-votes. For A: what should a SAM/BAM record contain when there are no quality scores
Teacher 17 months ago, created an answer with at least 3 up-votes. For A: Recovering bam files after unknow deletion in the storage

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1620 users visited in the last hour