User: Kevin

gravatar for Kevin
Kevin560
Reputation:
560
Status:
Trusted
Location:
Website:
http://kevin-gattaca.b...
Last seen:
3 months, 3 weeks ago
Joined:
7 years, 6 months ago
Email:
a******@gmail.com

Posts by Kevin

<prev • 40 results • page 1 of 4 • next >
0
votes
1
answer
909
views
1
answers
Comment: C: randomly subsampling a bam file three times
... in case it's not apparent , PROBABILITY=Double P=Double The probability of keeping any individual read, between 0 and 1. Default value: 1.0. This option can be set to 'null' to clear the default value. use 0.25 for ~ 25% of the reads. ...
written 4 months ago by Kevin560
0
votes
2
answers
1.4k
views
2
answers
Answer: A: Where Can I Find Data Setnormal\Tumor Of Cancer?
... Other possible sources which have other kinds of datasets European Genome-phenome Archive https://www.ebi.ac.uk/ega/datasets EGAD00001000082 20 Matched Pair Breast Cancer Genomes Illumina HiSeq 2000, Illumina Genome Analyzer II 42 bam https://www.ebi.ac.uk/ega/datasets/EGAD00001000082 https://dis ...
written 13 months ago by Kevin560
1
vote
3
answers
2.4k
views
3
answers
Answer: A: 10Kb Reads From Illumina Hiseq
... The first set of public data is out already. they have 3000 8500bp reads (single end) http://kevin-gattaca.blogspot.sg/2013/07/what-would-you-do-with-3k-of-8500-bp.html ...
written 4.4 years ago by Kevin560
0
votes
1
answer
1.8k
views
1
answer
Convert Illumina Human Hapmap 550 K Chip Manifest File From B36 To B37
... Hi I have old data that was genotyped on convert Illumina Human Hapmap 550 k chip the old manifest file is in b36. What's the best way to migrate this info to b37? I know one approach was to use bwa to align the probes and annotating the sam file. But I googled and can't find a tool that does t ...
illumina convert written 5.2 years ago by Kevin560 • updated 5.2 years ago by Istvan Albert ♦♦ 75k
0
votes
2
answers
9.6k
views
2
answers
Answer: A: Best Practices / Faster Samtools Mpileup, Any Tips?
... Ok just learnt about the new calling by region option in mpileup. Calling SNPs/INDELs in small regions (see http://samtools.sourceforge.net/mpileup.shtml ) vcfutils.pl splitchr -l 500000 ref.fa.fai | xargs -i \ echo samtools mpileup -C50 -m3 -F0.0002 -DSuf ref.fa -r {} -b bam.list \| bcftoo ...
written 5.3 years ago by Kevin560
0
votes
2
answers
3.4k
views
2
answers
Comment: C: Split Bam Files By Region For Parallel Variant Calling
... Not a direct answer but GATK 2 has support for reduced bams now so they might reduce your bam sizes for multi sample calling. but of course if your issue is getting more chunks to parallelise then I think the samtools -L bedfile seems like a good idea ...
written 5.4 years ago by Kevin560
0
votes
1
answer
3.1k
views
1
answers
Comment: C: Putting Run/Lane Info Back Into Readgroup For Gatk Pipeline
... As I understand, the RG field info can be found in the read name? e.g. format of the template name (header in the bam file), is it in the format of sequencer:lane:tile:coord-x:coord-y? Machine>_Run number> : Lane> : Tile> : X coordinate of cluster> : Y coordinate of cluster> ...
written 5.4 years ago by Kevin560
2
votes
1
answer
3.1k
views
1
answer
Putting Run/Lane Info Back Into Readgroup For Gatk Pipeline
... I am looking at the read name from the archival bam from the sequencing provider. It provides machine/ run / lane info but readgroup info isn't written in it. What are my options if I want to extract the fastq from the bam to align with BWA to annotate the RG info so that it is used in downstream ...
gatk bam written 5.4 years ago by Kevin560 • updated 5.4 years ago by Sean Davis24k
0
votes
0
answers
2.4k
views
0
answers
Comment: C: Bwa Aln -Q 15 When To Use It? What Does It Do?
... Thanks for the helpful links! I must have been using wrong keywords to search ...
written 5.4 years ago by Kevin560
8
votes
0
answers
2.4k
views
0
answers
Bwa Aln -Q 15 When To Use It? What Does It Do?
... I don't quite get what the -q option does with a value if 15, is there a more descriptive explanation of the option? -q INT Parameter for read trimming. BWA trims a read down to argmax_x{\sum_{i=x+1}^l(INT-q_i)} if q_l<INT where l is the original read length. [0] Thanks! ...
trimming bwa written 5.4 years ago by Kevin560 • updated 5.4 years ago by Istvan Albert ♦♦ 75k

Latest awards to Kevin

Great Question 13 months ago, created a question with more than 5,000 views. For Where Can I Download Vcf Files For Publicly Available Data?
Great Question 13 months ago, created a question with more than 5,000 views. For Fastest Way To Rename Sample Name In 37 Gb Gzipped Vcf File Or Binary Ped File
Great Question 13 months ago, created a question with more than 5,000 views. For Best Practices / Faster Samtools Mpileup, Any Tips?
Good Question 13 months ago, asked a question that was upvoted at least 5 times. For Where Can I Download Vcf Files For Publicly Available Data?
Popular Question 19 months ago, created a question with more than 1,000 views. For How To Extract Snps From A Desired List Of Gene Names
Appreciated 4.4 years ago, created a post with more than 5 votes. For Where Can I Download Vcf Files For Publicly Available Data?
Great Question 4.4 years ago, created a question with more than 5,000 views. For Best Practices / Faster Samtools Mpileup, Any Tips?
Good Answer 4.4 years ago, created an answer that was upvoted at least 5 times. For A: Is There Any R Or R / Bioconductor Package That Can Make Circular Plots Like Per
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Bwa Aln -Q 15 When To Use It? What Does It Do?
Popular Question 4.4 years ago, created a question with more than 1,000 views. For How To Extract Snps From A Desired List Of Gene Names
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Convert Illumina Human Hapmap 550 K Chip Manifest File From B36 To B37
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Snps In Promoter Regions In Exome Sequencing?
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Where Can I Download Vcf Files For Publicly Available Data?
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Fastest Way To Rename Sample Name In 37 Gb Gzipped Vcf File Or Binary Ped File
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Vcftools Can'T Create Ped Map File Due To Ulimit Of Allowed Open Files.
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Bigger Sample Sizes For Wgs Exome. Is Nosql The Way To Go? Or Bio Hdf
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Best Practices / Faster Samtools Mpileup, Any Tips?
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Is There A Tool To Combine Samtools / Sam File Bitwise Flags?
Popular Question 4.4 years ago, created a question with more than 1,000 views. For Putting Run/Lane Info Back Into Readgroup For Gatk Pipeline
Popular Question 4.4 years ago, created a question with more than 1,000 views. For How To Create Genome-Wide Estimation Of Ibd Sharing Using Plink From Tped Files?
Supporter 4.4 years ago, voted at least 25 times.
Teacher 4.4 years ago, created an answer with at least 3 up-votes. For A: List Of All Known Regulatory Regions In Human Genome
Teacher 4.4 years ago, created an answer with at least 3 up-votes. For A: Is There Any R Or R / Bioconductor Package That Can Make Circular Plots Like Per
Teacher 4.4 years ago, created an answer with at least 3 up-votes. For A: Bioinformatics "Cheat Sheet"

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 954 users visited in the last hour