Moderator: Damian Kao

gravatar for Damian Kao
Damian Kao14k
Reputation:
14,300
Status:
Trusted
Location:
USA
Website:
http://blog.nextgeneti...
Last seen:
14 hours ago
Joined:
6 years, 5 months ago
Email:
d*********@gmail.com

Bioinformatician at Janelia Research Campus.

Posts by Damian Kao

<prev • 869 results • page 1 of 87 • next >
0
votes
0
answers
54
views
0
answers
Comment: C: Hybrid assembly using MinION data + correction with Illumina. Which strategy to
... Try all of them. In my experience there isn't one method that just works all the time. Sometimes my Spades is better than my Canu. Sometimes my miniasm is better than my Spades,etc. ...
written 21 hours ago by Damian Kao14k
0
votes
0
answers
140
views
0
answers
Comment: C: using awk to extract a specific pattern
... Have you tried the -y parameter in gffread: -y write a protein fasta file with the translation of CDS for each record ...
written 22 hours ago by Damian Kao14k
0
votes
0
answers
140
views
0
answers
Comment: C: using awk to extract a specific pattern
... Your file looks like a gtf file, not gff. And you probably want to be careful with what you mean by "first" CDS. Do you mean "first" according to genomic coordinate or "first" according to the gene? If a gene is on the minus strand of the genome, the first CDS according to the gene is actually the l ...
written 23 hours ago by Damian Kao14k
1
vote
2
answers
112
views
2
answers
Answer: A: How to get the total genic and intergenic length of a chromosome?
... It looks like you have a .gtf file. That means you can extract the exon lines from the .gtf file and count and sum up the exonic intervals. You can generate a sorted .bed file of exon coordinates by: grep -P '\texon\t' your.gtf | cut -f 1,4,5 | sort -k1,1 -k2,2n > exons.bed You can merge t ...
written 1 day ago by Damian Kao14k
0
votes
1
answer
164
views
1
answers
Answer: A: long-range technologies for genome scaffolding
... I recently got back a genome from Dovetail that increased N50 from ~100kb to ~50MB. Yes, that's a 500X increase in contiguity. N50 should not be the only stat for assessing how good a genome is. But it is still pretty impressive. Gene discovery/BUSCO stats also significantly increased. A few cavea ...
written 10 days ago by Damian Kao14k
0
votes
2
answers
88
views
2
answers
Comment: C: Variant calling in genomic chunks
... Thanks for the reply. I see what you mean with the SVs and possibly even indels. I am not interested in SVs for now, but do want to preserve indel information if I can. The BAM files I am working with are low coverage. I guess I'll to write a script to chunk the BAM file based on coverage "islands ...
written 10 days ago by Damian Kao14k
5
votes
2
answers
88
views
6 follow
2
answers
Variant calling in genomic chunks
... Is variant calling done on a per-position basis? I've read recommendations to split the BAM file by chromosomes and parallel call the chromosomes for speed Could I further segment each chromosome into chunks and do calling on each chunk? For example if I: 1) Split a chromosome into 1MB segments. 2) ...
variant calling vcf written 10 days ago by Damian Kao14k • updated 10 days ago by finswimmer1.4k
0
votes
2
answers
86
views
2
answers
Answer: A: Assemble genomes from different sequencing platforms
... I've used GAM-NGS in the past with some success: https://github.com/vice87/gam-ngs ...
written 11 days ago by Damian Kao14k
0
votes
0
answers
114
views
0
answers
Comment: C: Is there a easy to use GATK pipeline for SNP calling?
... That pipeline might be a bit old. I don't think you need to do the realignment target creater/realign for indel anymore as haplotypeCaller will do that now. The general steps for me are: 1. trim reads 2. bwa mem align to genome 3. mark duplicates 4. use HaplotypeCaller to generate gvcf 5. Combine ...
written 16 days ago by Damian Kao14k
0
votes
0
answers
193
views
0
answers
Comment: C: Stringtie to DESeq2
... Can you paste a few lines of your phenotype_2.txt? **EDIT nevermind, look at Kevin's response below. You are using Python3+ and the script is meant for python2.7. ...
written 16 days ago by Damian Kao14k

Latest awards to Damian Kao

Teacher 23 days ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Scholar 23 days ago, created an answer that has been accepted. For C: What's the correct way to withdraw a published database/server ?
Good Answer 28 days ago, created an answer that was upvoted at least 5 times. For A: Bwa What Is In .Sai File
Appreciated 28 days ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Good Question 7 weeks ago, asked a question that was upvoted at least 5 times. For Given Unlimited Resources, What Would Be Your Dream Experiment?
Appreciated 7 weeks ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Popular Question 8 weeks ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Popular Question 10 weeks ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Good Answer 11 weeks ago, created an answer that was upvoted at least 5 times. For A: A Farewell To Bioinformatics
Popular Question 12 weeks ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Popular Question 3 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Appreciated 3 months ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Teacher 3 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Popular Question 3 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Popular Question 3 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Popular Question 4 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Commentator 4 months ago, created a comment with at least 3 up-votes. For C: Shall We Go Back To Stackexchange?
Popular Question 5 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Popular Question 5 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Popular Question 6 months ago, created a question with more than 1,000 views. For contigs and scaffolds in abyss
Popular Question 6 months ago, created a question with more than 1,000 views. For contigs and scaffolds in abyss
Scholar 7 months ago, created an answer that has been accepted. For C: What's the correct way to withdraw a published database/server ?
Teacher 10 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Good Answer 11 months ago, created an answer that was upvoted at least 5 times. For A: Deseq Sizefactors Help
Teacher 11 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 829 users visited in the last hour