Moderator: Stefano Berri
Stefano Berri ♦ 4.2k
- Reputation:
- 4,150
- Status:
- Trusted
- Location:
- Cambridge, UK
- Website:
- http://www.stefanoberr...
- Twitter:
- @5tefano8erri
- Scholar ID:
- Google Scholar Page
- Last seen:
- 3 years, 5 months ago
- Joined:
- 10 years, 8 months ago
- Email:
- s*****@illumina.com
Trained in molecular biology I slowly moved to the computational side and data anlysis of genomic data. I am now a bioinformatics scientist working at Illumina within the "Population and Medical Genomics" group. My interests are in genomics, molecular biology, cancer biology, tumour evolution and reproducible science. I regularly use R/Bioconductor, Python, SQL I know a thing or two about copy number abnormalities in cancer and I develop tools to analyse Next Generation Sequencing data.
Posts by Stefano Berri
<prev
• 123 results •
page 1 of 13 •
next >
0
votes
1
answer
2.6k
views
1
answers
Comment:
C: CWL: Loops and if conditionals
... I am a beginner with cwl, but here my approach to conditional events. Make the workflow linear, like this:
Inspect data
|
Preprocess-Required?
|
Pre-process data
|
Process Data
L'et's say that Preprocess-Required spit out a file with "YES" or "NO"
Then preprocess data read that file ...
written 3.7 years ago by
Stefano Berri ♦ 4.2k
0
votes
1
answer
1.8k
views
1
answers
... I am getting this message: "CWL Directory inputs not yet supported in Toil" ...
written 3.7 years ago by
Stefano Berri ♦ 4.2k
3
votes
1
answer
1.8k
views
1
answer
... I am experimenting with CWL and cwltool. I have successfully written a workflow that can scatter jobs. However, cwltool runs them sequencially, whereas I would like to to use the Sun Grid Engine (SGE) to run jobs in parallel. I looked into Toil to run it but I have found it not capable of running th ...
written 3.7 years ago by
Stefano Berri ♦ 4.2k
• updated
3.6 years ago by
Biostar ♦♦ 20
0
votes
0
answers
3.1k
views
0
answers
Comment:
C: CNV from FFPE genomes
... Can I ask you what tools you used, what is the typical coverage, if you use a match normal (and if so, if you use same library prep) and if the tools you used perform GC correction. These are the first relevant questions that pop in my mind...
...
written 6.7 years ago by
Stefano Berri ♦ 4.2k
5
votes
12
answers
4.0k
views
12
answers
... Hi.
I think there are quite few "pure biologist" that turned to the darks side of bioinformatics, and I am one of them. I didn't have any formal training in programming/informatics till the end of my Ph.D. in molecular biology. For sure, all you knowledge in biology is not wasted. It is an asset.
...
written 6.7 years ago by
Stefano Berri ♦ 4.2k
• updated
17 months ago by
_r_am ♦ 32k
0
votes
1
answer
2.3k
views
1
answers
... 50M reads is quite a lot, actually. In CNAnorm all windows are equally sized. If you set 10Kbp windows, you would get an average of 170 reads per window. Which is plenty.
From a quick count, 85% of exons are less than 10Kbp apart, and 93% less than 25kbp apart, so most of your windows will have som ...
written 6.9 years ago by
Stefano Berri ♦ 4.2k
0
votes
1
answer
2.3k
views
1
answers
... Hi. exome is a bit more tricky becose is uneven, but as a rule of thumb, try to have, as average, 50 reads per window. In gene rich regions you will have more, in gene poor a bit less. HOw many reads do you have in total? Good luck.
Stefano
...
written 6.9 years ago by
Stefano Berri ♦ 4.2k
1
vote
1
answer
2.3k
views
1
answers
... Hi. You don't need high coverage data for copy number detection at the resolution of arrays.
You could use CNAnorm. It is designed to detect somatic CNA from low coverage genomic data (2 Million reads would be enough, very affordable if you multiplex on a run) and it does not assumes a particular r ...
written 6.9 years ago by
Stefano Berri ♦ 4.2k
1
vote
4
answers
5.8k
views
4
answers
... thanks, if that is the case, then it's not too hard. I haven't found a documents that "officially" states something like that, though...
...
written 7.0 years ago by
Stefano Berri ♦ 4.2k
0
votes
4
answers
5.8k
views
4
answers
... yes, that might work actually... column 2 and 3 are always number in the content, but never in browser or track...
...
written 7.0 years ago by
Stefano Berri ♦ 4.2k
Latest awards to Stefano Berri
Librarian
3.5 years ago,
created a post with more than 10 bookmarks.
For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Appreciated
5.9 years ago,
created a post with more than 5 votes.
For A: Find A Substring And Count The Number Of Occurrences
Appreciated
5.9 years ago,
created a post with more than 5 votes.
For A: Creating A New Bioinformatics Unit
Epic Question
5.9 years ago,
created a question with more than 10,000 views.
For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Popular Question
5.9 years ago,
created a question with more than 1,000 views.
For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Appreciated
5.9 years ago,
created a post with more than 5 votes.
For A: Normal Distribution Of (Log) Microarray Results
Prophet
5.9 years ago,
created a post with more than 20 followers.
For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Good Answer
5.9 years ago,
created an answer that was upvoted at least 5 times.
For A: Creating A New Bioinformatics Unit
Great Question
5.9 years ago,
created a question with more than 5,000 views.
For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Teacher
5.9 years ago,
created an answer with at least 3 up-votes.
For A: Gatk Pipeline: Markduplicates At The End ?
Appreciated
5.9 years ago,
created a post with more than 5 votes.
For A: Genomic Read Mapping Biased Towards Coding Regions?
Teacher
5.9 years ago,
created an answer with at least 3 up-votes.
For A: How To Handle Replicates With Huge Differences In Number Of Reads?
Student
5.9 years ago,
asked a question with at least 3 up-votes.
For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Great Question
5.9 years ago,
created a question with more than 5,000 views.
For How To Produce Simulated 'Synthetic' Sequences
Great Question
6.2 years ago,
created a question with more than 5,000 views.
For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Commentator
6.5 years ago,
created a comment with at least 3 up-votes.
For C: What'S "Allelic Ratio"? Can It Equal 1 In Normal Cells?
Good Answer
6.5 years ago,
created an answer that was upvoted at least 5 times.
For A: Communicating With Non-Computational Biologists
Appreciated
6.7 years ago,
created a post with more than 5 votes.
For A: What'S "Allelic Ratio"? Can It Equal 1 In Normal Cells?
Teacher
6.7 years ago,
created an answer with at least 3 up-votes.
For A: Best Practices For Pipeline Versioning
Appreciated
6.8 years ago,
created a post with more than 5 votes.
For A: Best Way To Merge A Many Thousand Small Bam Files Into One Big Bam File?
Appreciated
6.8 years ago,
created a post with more than 5 votes.
For A: Communicating With Non-Computational Biologists
Appreciated
6.8 years ago,
created a post with more than 5 votes.
For A: Normal Distribution Of (Log) Microarray Results
Appreciated
6.8 years ago,
created a post with more than 5 votes.
For A: Genomic Read Mapping Biased Towards Coding Regions?
Appreciated
6.8 years ago,
created a post with more than 5 votes.
For C: Is There Interest In An Ios Bioinformatics "Toolkit"?
Appreciated
6.8 years ago,
created a post with more than 5 votes.
For A: What'S "Allelic Ratio"? Can It Equal 1 In Normal Cells?
Use of this site constitutes acceptance of our User
Agreement
and Privacy
Policy.
Powered by Biostar
version 2.3.0
Traffic: 1968 users visited in the last hour