User: bari.ballew

gravatar for bari.ballew
bari.ballew90
Reputation:
90
Status:
Trusted
Location:
USA/NIH
Twitter:
@BariBallew
Last seen:
1 day, 12 hours ago
Joined:
1 year, 4 months ago
Email:
b**********@gmail.com

Posts by bari.ballew

<prev • 9 results • page 1 of 1 • next >
2
votes
1
answer
162
views
1
answers
Answer: A: snakemake wildcard for fastq files
... It looks like you may have duplicated part of your path. Right now, your path to your data reads: `/t6/h7/data/expression/t6/h7/data/expression/demo_data/{sample}_R1.fastq.gz` I'm assuming you need to access the paired fastq files in tandem for alignment or something similar. Try something like t ...
written 11 days ago by bari.ballew90
1
vote
3
answers
148
views
3
answers
Answer: A: Rename FASTA headers based on filename
... Just using bash: ``` for i in *fasta; do n="${i%.fasta}"; sed -i.bak "s/>[^_]\+/>$n/" $i; done ``` This loops over all files in the current directory that end with "fasta". For each file: 1. `n="${i%.fasta}"` removes the .fasta file extension (can be generalized to any extension by using ` ...
written 27 days ago by bari.ballew90
0
votes
1
answer
3.1k
views
1
answers
Comment: C: Looking For Approaches For Finding Contamination And Sample Identity Matching Is
... Hi Sean! Did your group ever explore options for this sort of QC? We are currently using VerifyBamID for both germline and tumor/normal data, but we are concerned with the possibility that it will erroneously interpret somatic copy number changes in tumor samples as contamination. Thinking about ...
written 27 days ago by bari.ballew90
0
votes
1
answer
172
views
1
answers
Comment: C: MAF file having mutation data
... Yes, any potentially identifiable data in GDC falls under controlled access. From GDC's documentation: > Open access data generally includes high level genomic data that is > not individually identifiable, as well as most clinical and all > biospecimen data elements. > > Controlle ...
written 9 weeks ago by bari.ballew90
0
votes
1
answer
172
views
1
answers
Comment: C: MAF file having mutation data
... A MAF file is just a list of mutations found in a given sample(s). You can find mutations in either WES or WGS data, so you can have a MAF file (or a VCF file) for either type of experiment. For WGS, you are looking at mutations across the whole genome, so you will likely have more mutations liste ...
written 9 weeks ago by bari.ballew90
0
votes
1
answer
172
views
1
answers
Answer: A: MAF file having mutation data
... TCGA has both WES and WGS data. If you're looking at WES data, you'll see mainly protein-coding mutations, as this is what exome sequencing focuses on. If you're looking for non-coding regions, you may be more interested in the whole genome data. ...
written 9 weeks ago by bari.ballew90
1
vote
2
answers
538
views
2
answers
Answer: A: Downstream analysis of VCF files obtained from VarScan2
... Just realized this is an older question, but I already wrote up an answer so hopefully this is still helpful! First, regarding merging somatic, germline, and LOH files for SNPs and indels: this depends on the question you're trying to answer, but you probably don't want to merge them, at least at f ...
written 10 weeks ago by bari.ballew90
4
votes
2
answers
251
views
2
answers
Answer: A: tool to detect large deletions
... There are several major algorithms that are used to find large deletions (structural variants, or SVs). Most of them rely on a pair of sequencing reads spanning the breakpoints of the SV. If you have WGS data, the chance that you sequenced across the breakpoint is pretty good, but for WES data, it ...
written 10 weeks ago by bari.ballew90
1
vote
1
answer
615
views
1
answers
Answer: A: From pooled fastq data to SNPs
... You probably want to check out Broad's best practices (https://software.broadinstitute.org/gatk/best-practices/). Specifically, look mainly at the sections on data pre-processing and germline SNPS+indels. Alternatively, you could check out the bcbio pipeline (http://bcbio-nextgen.readthedocs.io/en ...
written 16 months ago by bari.ballew90

Latest awards to bari.ballew

Scholar 9 weeks ago, created an answer that has been accepted. For A: tool to detect large deletions
Teacher 9 weeks ago, created an answer with at least 3 up-votes. For A: tool to detect large deletions

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1699 users visited in the last hour