User: toni

gravatar for toni
toni2.1k
Reputation:
2,140
Status:
Trusted
Location:
Lyon
Last seen:
1 month, 3 weeks ago
Joined:
9 years, 9 months ago
Email:
f*******@gmail.com

Bioinformatician. Interested in genomics, trancriptomics, NGS.

Posts by toni

<prev • 120 results • page 1 of 12 • next >
1
vote
4
answers
4.5k
views
4
answers
Answer: A: Calculate percentage of bases covered less than nX in targeted sequencing experi
... I know it's an old thread but as I have tested this tool recently, I would definitely use [Mosdepth][1] to answer this question. It's fast and its `--thresholds` option does exactly this. Anthony [1]: https://github.com/brentp/mosdepth ...
written 17 months ago by toni2.1k
0
votes
6
answers
13k
views
6
answers
Comment: C: Final Solution For "Mapq Should Be 0 For Unmapped Read."
... If you read carefully, this is already said in the first line of the question itself. ...
written 3.7 years ago by toni2.1k
0
votes
2
answers
4.0k
views
2
answers
Comment: C: how to calculate the correlation and p values for all combination of a gene expr
... Looks like there are too many questions in one : 1 - What should I use to test my geneXgene correlations ? 2 - How to adjust my p-values to get the significant ones (Multiple Hypothesis Testing) or how to rank (arbitrary threshold) ?  3 - What is the general approach for that kind of problem ? 4 ...
written 4.9 years ago by toni2.1k
9
votes
1
answer
1.4k
views
1
answers
Answer: A: Does build 37 of the human reference genome contain 2.85Gbp?
... I have computed this recently on GRCh37 :     (Mappability was computed with GEM program for reads of length 100 and 5 mismatches authorised) EDIT : And so dividing by 2.85Gb gives you a more realistic estimate of your mean coverage since N's will never be covered by definition.   ...
written 5.0 years ago by toni2.1k
2
votes
10
answers
49k
views
10
answers
Comment: C: Multiline Fasta To Single Line Fasta
... I do not think that this is a good advice for several reasons (even if it can work in a few cases): - FASTA files are sometimes BIG, like 20Gb, containing millions of records. Not sure whether Notepad will survive to this. - (I think) This forum/site is more about learning/developing some coding s ...
written 5.1 years ago by toni2.1k
1
vote
3
answers
1.9k
views
3
answers
Answer: C: Does read-depth refer to a fold?
... No. You should rather see this as a multiplicative factor. Increased by 3-fold means : NEW = 3 * OLD     ...
written 5.3 years ago by toni2.1k
0
votes
1
answer
3.5k
views
1
answers
Comment: C: Markduplicates Creating A Loss Of Mate Pair
... This is the behavior I always had with MarkDuplicates when removing them. It generates orphan reads. These orphan reads are unmapped. Actually, it happens when you have a pair where only one read is mapped. When this mapped read is tagged as a duplicate, MarkDuplicate leaves its unmapped mate in t ...
written 6.3 years ago by toni2.1k
1
vote
1
answer
2.6k
views
1
answers
Answer: A: Split Fastq Paired-Read Using Awk
... If you are sure that pair F3 is ALWAYS written before pair G3, you can try this : awk 'BEGIN{count=0}{count++; if(count<=4) {print $0 > "F3.fastq"} else {print $0 > "R3.fastq" } if(count==8) count=0 }' input.fastq ...
written 6.7 years ago by toni2.1k
2
votes
1
answer
7.8k
views
1
answers
Comment: C: How To Extract Consensus Sequence Of Mapped Regions From A Bam File?
... You might find this thread or this link useful for your purpose. How to generate a consensus fasta sequence from SAM tools pileup? http://chipster.csc.fi/manual/samtools-consensus.html ...
written 6.8 years ago by toni2.1k
0
votes
1
answer
9.7k
views
1
answers
Comment: C: Fastq Quality Control And Reporting - Aka Fastqc Versus The New Contenders
... I do agree. How many times (I am not going to cite the tools) in my team we have been lost into tool "bugs". We have spent a incredible amount of time running a tool to see it crash in the end, and for some tools it could fail after one week. You also sometimes report this to the developer and you s ...
written 6.9 years ago by toni2.1k

Latest awards to toni

Epic Question 5 months ago, created a question with more than 10,000 views. For Ngs - Huge (Fastq) File Parsing - Which Language For Good Efficiency ?
Popular Question 10 months ago, created a question with more than 1,000 views. For C: How To Extract Consensus Sequence Of Mapped Regions From A Bam File?
Teacher 2.7 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Great Question 3.7 years ago, created a question with more than 5,000 views. For Library Duplicates Vs. Optical Duplicates (Picard Markduplicates)
Great Question 4.0 years ago, created a question with more than 5,000 views. For Gff3 Format - Human Genome
Great Question 4.4 years ago, created a question with more than 5,000 views. For Gff3 Format - Human Genome
Popular Question 4.5 years ago, created a question with more than 1,000 views. For Gff3 Format - Human Genome
Scholar 4.6 years ago, created an answer that has been accepted. For A: Does build 37 of the human reference genome contain 2.85Gbp?
Teacher 4.6 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Epic Question 4.7 years ago, created a question with more than 10,000 views. For Ngs - Huge (Fastq) File Parsing - Which Language For Good Efficiency ?
Teacher 4.7 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Good Question 4.8 years ago, asked a question that was upvoted at least 5 times. For Gff3 Format - Human Genome
Scholar 5.0 years ago, created an answer that has been accepted. For A: Does build 37 of the human reference genome contain 2.85Gbp?
Good Answer 5.0 years ago, created an answer that was upvoted at least 5 times. For A: Sam Validation Error: Error: Record 8404072, Read Name Srx020270.6546275, Mapq S
Teacher 5.0 years ago, created an answer with at least 3 up-votes. For A: How To Parse Fastq File?
Commentator 5.1 years ago, created a comment with at least 3 up-votes. For C: Useful Bash Commands To Handle Fasta Files
Popular Question 5.3 years ago, created a question with more than 1,000 views. For Selecting Nodes Of High Correlation In A Tree
Popular Question 5.4 years ago, created a question with more than 1,000 views. For Gff3 Format - Human Genome
Good Question 5.4 years ago, asked a question that was upvoted at least 5 times. For Gff3 Format - Human Genome
Prophet 5.4 years ago, created a post with more than 20 followers. For Ngs - Huge (Fastq) File Parsing - Which Language For Good Efficiency ?

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 622 users visited in the last hour