User: toni

gravatar for toni
toni2.1k
Reputation:
2,130
Status:
Trusted
Location:
Lyon
Last seen:
3 months, 1 week ago
Joined:
9 years, 3 months ago
Email:
f*******@gmail.com

Bioinformatician. Interested in genomics, trancriptomics, NGS.

Posts by toni

<prev • 120 results • page 1 of 12 • next >
0
votes
4
answers
4.3k
views
4
answers
Answer: A: Calculate percentage of bases covered less than nX in targeted sequencing experi
... I know it's an old thread but as I have tested this tool recently, I would definitely use [Mosdepth][1] to answer this question. It's fast and its `--thresholds` option does exactly this. Anthony [1]: https://github.com/brentp/mosdepth ...
written 11 months ago by toni2.1k
0
votes
6
answers
13k
views
6
answers
Comment: C: Final Solution For "Mapq Should Be 0 For Unmapped Read."
... If you read carefully, this is already said in the first line of the question itself. ...
written 3.2 years ago by toni2.1k
0
votes
2
answers
3.8k
views
2
answers
Comment: C: how to calculate the correlation and p values for all combination of a gene expr
... Looks like there are too many questions in one : 1 - What should I use to test my geneXgene correlations ? 2 - How to adjust my p-values to get the significant ones (Multiple Hypothesis Testing) or how to rank (arbitrary threshold) ?  3 - What is the general approach for that kind of problem ? 4 ...
written 4.4 years ago by toni2.1k
9
votes
1
answer
1.3k
views
1
answers
Answer: A: Does build 37 of the human reference genome contain 2.85Gbp?
... I have computed this recently on GRCh37 :     (Mappability was computed with GEM program for reads of length 100 and 5 mismatches authorised) EDIT : And so dividing by 2.85Gb gives you a more realistic estimate of your mean coverage since N's will never be covered by definition.   ...
written 4.5 years ago by toni2.1k
2
votes
10
answers
44k
views
10
answers
Comment: C: Multiline Fasta To Single Line Fasta
... I do not think that this is a good advice for several reasons (even if it can work in a few cases): - FASTA files are sometimes BIG, like 20Gb, containing millions of records. Not sure whether Notepad will survive to this. - (I think) This forum/site is more about learning/developing some coding s ...
written 4.6 years ago by toni2.1k
1
vote
3
answers
1.8k
views
3
answers
Answer: C: Does read-depth refer to a fold?
... No. You should rather see this as a multiplicative factor. Increased by 3-fold means : NEW = 3 * OLD     ...
written 4.8 years ago by toni2.1k
0
votes
1
answer
3.3k
views
1
answers
Comment: C: Markduplicates Creating A Loss Of Mate Pair
... This is the behavior I always had with MarkDuplicates when removing them. It generates orphan reads. These orphan reads are unmapped. Actually, it happens when you have a pair where only one read is mapped. When this mapped read is tagged as a duplicate, MarkDuplicate leaves its unmapped mate in t ...
written 5.7 years ago by toni2.1k
1
vote
1
answer
2.4k
views
1
answers
Answer: A: Split Fastq Paired-Read Using Awk
... If you are sure that pair F3 is ALWAYS written before pair G3, you can try this : awk 'BEGIN{count=0}{count++; if(count<=4) {print $0 > "F3.fastq"} else {print $0 > "R3.fastq" } if(count==8) count=0 }' input.fastq ...
written 6.2 years ago by toni2.1k
2
votes
1
answer
7.4k
views
1
answers
Comment: C: How To Extract Consensus Sequence Of Mapped Regions From A Bam File?
... You might find this thread or this link useful for your purpose. How to generate a consensus fasta sequence from SAM tools pileup? http://chipster.csc.fi/manual/samtools-consensus.html ...
written 6.3 years ago by toni2.1k
0
votes
1
answer
9.4k
views
1
answers
Comment: C: Fastq Quality Control And Reporting - Aka Fastqc Versus The New Contenders
... I do agree. How many times (I am not going to cite the tools) in my team we have been lost into tool "bugs". We have spent a incredible amount of time running a tool to see it crash in the end, and for some tools it could fail after one week. You also sometimes report this to the developer and you s ...
written 6.4 years ago by toni2.1k

Latest awards to toni

Popular Question 4 months ago, created a question with more than 1,000 views. For C: How To Extract Consensus Sequence Of Mapped Regions From A Bam File?
Teacher 2.2 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Great Question 3.2 years ago, created a question with more than 5,000 views. For Library Duplicates Vs. Optical Duplicates (Picard Markduplicates)
Great Question 3.5 years ago, created a question with more than 5,000 views. For Gff3 Format - Human Genome
Great Question 3.9 years ago, created a question with more than 5,000 views. For Gff3 Format - Human Genome
Popular Question 4.0 years ago, created a question with more than 1,000 views. For Gff3 Format - Human Genome
Scholar 4.1 years ago, created an answer that has been accepted. For A: Does build 37 of the human reference genome contain 2.85Gbp?
Teacher 4.1 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Epic Question 4.2 years ago, created a question with more than 10,000 views. For Ngs - Huge (Fastq) File Parsing - Which Language For Good Efficiency ?
Teacher 4.2 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Good Question 4.3 years ago, asked a question that was upvoted at least 5 times. For Gff3 Format - Human Genome
Good Answer 4.5 years ago, created an answer that was upvoted at least 5 times. For A: Sam Validation Error: Error: Record 8404072, Read Name Srx020270.6546275, Mapq S
Scholar 4.5 years ago, created an answer that has been accepted. For A: Does build 37 of the human reference genome contain 2.85Gbp?
Teacher 4.5 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Commentator 4.6 years ago, created a comment with at least 3 up-votes. For C: Useful Bash Commands To Handle Fasta Files
Popular Question 4.8 years ago, created a question with more than 1,000 views. For Selecting Nodes Of High Correlation In A Tree
Popular Question 4.8 years ago, created a question with more than 1,000 views. For Gff3 Format - Human Genome
Good Question 4.9 years ago, asked a question that was upvoted at least 5 times. For Gff3 Format - Human Genome
Prophet 4.9 years ago, created a post with more than 20 followers. For Ngs - Huge (Fastq) File Parsing - Which Language For Good Efficiency ?
Popular Question 5.0 years ago, created a question with more than 1,000 views. For Gff3 Format - Human Genome

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2013 users visited in the last hour