User: jkbonfield

gravatar for jkbonfield
jkbonfield180
Reputation:
180
Status:
Trusted
Location:
Last seen:
1 day, 8 hours ago
Joined:
2 years, 2 months ago
Email:
j*********@gmail.com

Posts by jkbonfield

<prev • 15 results • page 1 of 2 • next >
0
votes
0
answers
91
views
0
answers
Comment: C: Merge bam files
... You should be using bcftools mpileup now as the samtools one is deprecated. Also note you can specify multiple files directly with mpileup and they'll be merged in-situ as it processes them so there is no need to merge them manually. Eg: bcftools mpileup -f reference.fa -o output.vcf file_1.b ...
written 24 days ago by jkbonfield180
0
votes
0
answers
818
views
0
answers
Comment: C: Generating consensus sequence from bam file
... "samtools mpileup foo.bam" will produce the columns showing what bases are aligned at each position. The reference column will be N, but that's irrelevant if you want the consensus (unless you're planning on using imputation to use reference in zero coverage regions, which can be useful in some sit ...
written 25 days ago by jkbonfield180
0
votes
0
answers
131
views
0
answers
Comment: C: Counting reads starting at a coordinate in bam file
... I'm unsure of precisely what you want, but chr:pos-pos returns all alignments that *overlap* pos and not all alignments *starting* at pos. Your title implies you wanted the latter. If this is the case, I don't think such a tool exists but you could use mpileup to get a single coordinate and then c ...
written 25 days ago by jkbonfield180
0
votes
0
answers
818
views
0
answers
Comment: C: Generating consensus sequence from bam file
... This seems like a very long-winded solution and doesn't apply if you don't have a reference to hand. However that said, if you don't then your aligned BAM was probably aligned against a de-novo assembly and somewhere there should be a consensus lurking around. I've generally done some trivial coun ...
written 25 days ago by jkbonfield180
1
vote
1
answer
128
views
1
answers
Comment: C: samtools flagstat calculation
... I don't understand your explanations: "various step dupemerge" could hide a multitude of sins! If you're using different pipelines and comparing the stats at the end then can see why flagstats would be different and I don't see why you'd assume samtools is the cause. Flagstat is a very trivial alg ...
written 28 days ago by jkbonfield180
1
vote
1
answer
94
views
1
answers
Comment: C: .bam mpileup mapping quality
... The bit after the `^` in the sequence column is the mapping quality. You're referring to the next column which contains base qualities. Note that that the mapping quality may have been modified by the BAQ algorithm (see `mpileup -B` option). ...
written 4 weeks ago by jkbonfield180
0
votes
1
answer
94
views
1
answers
Answer: A: .bam mpileup mapping quality
... The answer to this is both yes and no depending on interpretation of the question. The mapping quality is only shown for the first base in the alignment, and if that base has been filtered out (e.g. due to use of the `-Q` option) then the corresponding `^x` part is also filtered out and it will no ...
written 4 weeks ago by jkbonfield180
0
votes
1
answer
105
views
1
answers
Answer: A: Consensus sequence from multiple BAM files on a single position
... I have some old perl one-liner kicking around to do this from the samtools mpileup output, but it doesn't look like it'll handle multiple chromosomes. Likely it was a one-off hack for some task. Anyway maybe it can be adapted. Be warned: it's hideous! :-) samtools mpileup in.bam | perl -ane ...
written 4 weeks ago by jkbonfield180
0
votes
6
answers
7.3k
views
6
answers
Comment: C: What Is Best Current Dna Compression Algorithm?
... Bzip2 is old, and besides bzip (aka bzip1) was better but was replaced because of now expired patent problems. The modern alternative to bzip2 is bsc (https://github.com/IlyaGrebnov/libbsc) and it totally owns bzip2. Very impresive work. Also for sure you do get better compression with dedicated ...
written 9 weeks ago by jkbonfield180
6
votes
3
answers
306
views
3
answers
Answer: A: Is it possible to directly convert fastq to CRAM ?
... While CRAM is indeed much smaller than BAM, its primary benefit is when using aligned data so you can use a reference sequence (whether external or embedded). It does however still work without a reference. I wouldn't recommend creating a fake reference just to get it to swallow things. You shou ...
written 3 months ago by jkbonfield180

Latest awards to jkbonfield

Appreciated 3 months ago, created a post with more than 5 votes. For A: Is it possible to directly convert fastq to CRAM ?
Good Answer 3 months ago, created an answer that was upvoted at least 5 times. For A: Is it possible to directly convert fastq to CRAM ?
Teacher 3 months ago, created an answer with at least 3 up-votes. For A: Recovering bam files after unknow deletion in the storage
Teacher 3 months ago, created an answer with at least 3 up-votes. For A: what should a SAM/BAM record contain when there are no quality scores
Teacher 4 months ago, created an answer with at least 3 up-votes. For A: Recovering bam files after unknow deletion in the storage

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1684 users visited in the last hour