Moderator: Damian Kao

gravatar for Damian Kao
Damian Kao15k
Reputation:
15,260
Status:
Trusted
Location:
USA
Website:
http://blog.nextgeneti...
Last seen:
22 hours ago
Joined:
7 years, 11 months ago
Email:
d*********@gmail.com

Bioinformatician at Janelia Research Campus.

Posts by Damian Kao

<prev • 912 results • page 1 of 92 • next >
0
votes
0
answers
169
views
0
answers
Comment: C: Right MongoDB approach for a multigenomic database?
... My recommendation is to not over-optimize unless this is a personal project for learning purposes. There are three main considerations here all related to scale: 1. How many concurrent users do you expect to be using this database? 2. Is this pre-dominantly a read-only database where users query ...
written 3 months ago by Damian Kao15k
0
votes
1
answer
235
views
1
answers
Comment: C: Is there a preferred RNA-seq aligner for (very) short reads and potential homolo
... Try setting --outFilterMatchNmin to 20 to see if you can get more mapping. However, that means it only requires 20 bases to map, which is pretty low. ...
written 5 months ago by Damian Kao15k
2
votes
2
answers
268
views
2
answers
Answer: A: Does Gene length corrected TMM [GeTMM] violate any assumptions of TMM normalizat
... Technically, RPK values do not violate assumptions of TMM. TMM is just a technique that tries to find the non-DE portion of the expression distribution by very liberally trimming off outliers. It doesn't matter what kind of expression units you are using. However, RPK values do violate assumptio ...
written 5 months ago by Damian Kao15k
3
votes
1
answer
309
views
1
answers
Answer: A: Trimmomatic output file Issue
... It looks like you are not specifying an output file for the -trimlog parameter. So it thinks your `output_forward_paired.fq.gz` is an input. I hope you still can still redownload `sample1_R1_001.fastq.gz`, because it might have been overwritten. ...
written 5 months ago by Damian Kao15k
1
vote
8
answers
478
views
8
answers
Answer: A: Code golf: detecting homopolymers of length N in the (human) genome
... Takes in fasta file and a second parameter for homopolymer length. It streams through each line to find homopolymers. Outputs chromosome, start, end, homopolymer base, length of homopolymer. I tested it out on this fasta file: >A AGTCAAAA GGGGTTTTCCCC >B AGTCCCCCTTTTAAA ...
written 5 months ago by Damian Kao15k
5
votes
4
answers
525
views
4
answers
Answer: C: Getting the number of SNPs in some ranges
... You have a .csv file with chromosome, start, end coordinates. You can change that into a .bed file pretty easily. Same with your SNP csv file. Convert those csv files to bed and then use bedtools intersect. ...
written 6 months ago by Damian Kao15k
0
votes
0
answers
347
views
0
answers
Comment: C: Strange Depth of Coverage distribution
... Yeah that could be it. You can check this by looking at the insert size distribution of your PE reads. See if it is smaller than 2 * average read length. The 9th column of your .sam/.bam should be the insert size. ...
written 6 months ago by Damian Kao15k
2
votes
0
answers
458
views
0
answers
Comment: C: Getting read depth for normal and tumour
... I am not sure what program you are using to generate the .vcfs. But you should look into the manual of the program and see if it outputs depths using the format fields in the vcf. For example, you have "DP" field in your vcf that shows the depth of the individual samples. Perhaps one of the other f ...
written 6 months ago by Damian Kao15k
0
votes
0
answers
458
views
0
answers
Comment: C: Getting read depth for normal and tumour
... Can you post a couple of lines of your vcf file? ...
written 6 months ago by Damian Kao15k
5
votes
1
answer
341
views
1
answers
Answer: A: Samtools segmentation fault when extracting exons from bam file.
... There is a -L flag that you can use with samtools to output alignments overlapping intervals defined by a .bed file. That's probably your best option. You'll have to convert your exons.txt to a bed file. ...
written 6 months ago by Damian Kao15k

Latest awards to Damian Kao

Popular Question 6 weeks ago, created a question with more than 1,000 views. For segment duplication vs heterozygosity
Teacher 9 weeks ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Teacher 3 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Appreciated 4 months ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Teacher 4 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Scholar 4 months ago, created an answer that has been accepted. For C: What's the correct way to withdraw a published database/server ?
Scholar 5 months ago, created an answer that has been accepted. For C: What's the correct way to withdraw a published database/server ?
Appreciated 5 months ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Teacher 5 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Scholar 6 months ago, created an answer that has been accepted. For C: What's the correct way to withdraw a published database/server ?
Appreciated 6 months ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Teacher 6 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Good Answer 8 months ago, created an answer that was upvoted at least 5 times. For A: A Farewell To Bioinformatics
Commentator 9 months ago, created a comment with at least 3 up-votes. For C: Shall We Go Back To Stackexchange?
Popular Question 9 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Teacher 10 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Popular Question 11 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors
Scholar 14 months ago, created an answer that has been accepted. For C: Does trinity discard reads shorter than 25-mer?
Scholar 14 months ago, created an answer that has been accepted. For C: What's the correct way to withdraw a published database/server ?
Good Answer 14 months ago, created an answer that was upvoted at least 5 times. For A: A Farewell To Bioinformatics
Appreciated 14 months ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Teacher 14 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Appreciated 14 months ago, created a post with more than 5 votes. For A: A Farewell To Bioinformatics
Teacher 14 months ago, created an answer with at least 3 up-votes. For A: Removing Redundant Amino Acid Sequences From Fasta - *But Also Give The Groups O
Popular Question 15 months ago, created a question with more than 1,000 views. For deseq2 user defined size factors

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1505 users visited in the last hour