Moderator: Stefano Berri

gravatar for Stefano Berri
Stefano Berri3.9k
Reputation:
3,870
Status:
Trusted
Location:
Cambridge, UK
Website:
http://www.stefanoberr...
Twitter:
@5tefano8erri
Scholar ID:
Google Scholar Page
Last seen:
1 month, 1 week ago
Joined:
7 years, 4 months ago
Email:
s*****@illumina.com

Trained in molecular biology I slowly moved to the computational side and data anlysis of genomic data. I am now a bioinformatics scientist working at Illumina within the "Population and Medical Genomics" group. My interests are in genomics, molecular biology, cancer biology, tumour evolution and reproducible science. I regularly use R/Bioconductor, Python, SQL I know a thing or two about copy number abnormalities in cancer and I develop tools to analyse Next Generation Sequencing data.

Posts by Stefano Berri

<prev • 123 results • page 1 of 13 • next >
0
votes
1
answer
277
views
1
answers
Comment: C: CWL: Loops and if conditionals
... I am a beginner with cwl, but here my approach to conditional events. Make the workflow linear, like this: Inspect data | Preprocess-Required? | Pre-process data | Process Data L'et's say that Preprocess-Required spit out a file with "YES" or "NO" Then preprocess data read that file ...
written 4 months ago by Stefano Berri3.9k
0
votes
1
answer
328
views
1
answers
Comment: C: cwltool SGE support for CWL parallel tasks
... I am getting this message: "CWL Directory inputs not yet supported in Toil" ...
written 4 months ago by Stefano Berri3.9k
3
votes
1
answer
328
views
5 follow
1
answer
cwltool SGE support for CWL parallel tasks
... I am experimenting with CWL and cwltool. I have successfully written a workflow that can scatter jobs. However, cwltool runs them sequencially, whereas I would like to to use the Sun Grid Engine (SGE) to run jobs in parallel. I looked into Toil to run it but I have found it not capable of running th ...
toil cwl sge written 4 months ago by Stefano Berri3.9k • updated 11 weeks ago by Biostar ♦♦ 20
0
votes
0
answers
1.2k
views
0
answers
Comment: C: CNV from FFPE genomes
... Can I ask you what tools you used, what is the typical coverage, if you use a match normal (and if so, if you use same library prep) and if the tools you used perform GC correction. These are the first relevant questions that pop in my mind... ...
written 3.4 years ago by Stefano Berri3.9k
5
votes
11
answers
2.1k
views
11
answers
Answer: A: Is there anyone who was a pure biologist?
... Hi. I think there are quite few "pure biologist" that turned to the darks side of bioinformatics, and I am one of them. I didn't have any formal training in programming/informatics till the end of my Ph.D. in molecural biology. For sure, all you knowledge in biology is not wasted. It is an asset. ...
written 3.4 years ago by Stefano Berri3.9k
0
votes
1
answer
1.4k
views
1
answers
Comment: C: Recommended Approach For Copy Number Analysis In Non-Human Organisms
... 50M reads is quite a lot, actually. In CNAnorm all windows are equally sized. If you set 10Kbp windows, you would get an average of 170 reads per window. Which is plenty. From a quick count, 85% of exons are less than 10Kbp apart, and 93% less than 25kbp apart, so most of your windows will have som ...
written 3.5 years ago by Stefano Berri3.9k
0
votes
1
answer
1.4k
views
1
answers
Comment: C: Recommended Approach For Copy Number Analysis In Non-Human Organisms
... Hi. exome is a bit more tricky becose is uneven, but as a rule of thumb, try to have, as average, 50 reads per window. In gene rich regions you will have more, in gene poor a bit less. HOw many reads do you have in total? Good luck. Stefano ...
written 3.5 years ago by Stefano Berri3.9k
1
vote
1
answer
1.4k
views
1
answers
Answer: A: Recommended Approach For Copy Number Analysis In Non-Human Organisms
... Hi. You don't need high coverage data for copy number detection at the resolution of arrays. You could use CNAnorm. It is designed to detect somatic CNA from low coverage genomic data (2 Million reads would be enough, very affordable if you multiplex on a run) and it does not assumes a particular r ...
written 3.5 years ago by Stefano Berri3.9k
1
vote
4
answers
3.0k
views
4
answers
Comment: C: What Does Distinguish Header From Content In Bed Files?
... thanks, if that is the case, then it's not too hard. I haven't found a documents that "officially" states something like that, though... ...
written 3.7 years ago by Stefano Berri3.9k
0
votes
4
answers
3.0k
views
4
answers
Comment: C: What Does Distinguish Header From Content In Bed Files?
... yes, that might work actually... column 2 and 3 are always number in the content, but never in browser or track... ...
written 3.7 years ago by Stefano Berri3.9k

Latest awards to Stefano Berri

Librarian 10 weeks ago, created a post with more than 10 bookmarks. For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Popular Question 2.6 years ago, created a question with more than 1,000 views. For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Teacher 2.6 years ago, created an answer with at least 3 up-votes. For A: Gatk Pipeline: Markduplicates At The End ?
Great Question 2.6 years ago, created a question with more than 5,000 views. For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Appreciated 2.6 years ago, created a post with more than 5 votes. For A: Normal Distribution Of (Log) Microarray Results
Good Answer 2.6 years ago, created an answer that was upvoted at least 5 times. For A: Creating A New Bioinformatics Unit
Teacher 2.6 years ago, created an answer with at least 3 up-votes. For A: How To Handle Replicates With Huge Differences In Number Of Reads?
Student 2.6 years ago, asked a question with at least 3 up-votes. For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Epic Question 2.6 years ago, created a question with more than 10,000 views. For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Great Question 2.6 years ago, created a question with more than 5,000 views. For How To Produce Simulated 'Synthetic' Sequences
Appreciated 2.6 years ago, created a post with more than 5 votes. For A: Creating A New Bioinformatics Unit
Appreciated 2.6 years ago, created a post with more than 5 votes. For A: Genomic Read Mapping Biased Towards Coding Regions?
Appreciated 2.6 years ago, created a post with more than 5 votes. For A: Find A Substring And Count The Number Of Occurrences
Prophet 2.6 years ago, created a post with more than 20 followers. For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Great Question 2.9 years ago, created a question with more than 5,000 views. For Read Group In Sam/Bam Files: What Do They Exactly Describe?
Good Answer 3.2 years ago, created an answer that was upvoted at least 5 times. For A: Communicating With Non-Computational Biologists
Commentator 3.2 years ago, created a comment with at least 3 up-votes. For C: What'S "Allelic Ratio"? Can It Equal 1 In Normal Cells?
Appreciated 3.3 years ago, created a post with more than 5 votes. For A: What'S "Allelic Ratio"? Can It Equal 1 In Normal Cells?
Teacher 3.4 years ago, created an answer with at least 3 up-votes. For A: Best Practices For Pipeline Versioning
Teacher 3.5 years ago, created an answer with at least 3 up-votes. For A: Determine Whether A Gene Product Is A Transcription Factor
Guru 3.5 years ago, received more than 100 upvotes.
Teacher 3.5 years ago, created an answer with at least 3 up-votes. For A: Technical/Biological Replicates In Rna-Seq For Two Cell Lines
Centurion 3.5 years ago, created 100 posts.
Voter 3.5 years ago, voted more than 100 times.
Teacher 3.5 years ago, created an answer with at least 3 up-votes. For A: List Of Freely Available Tool/Software To Generate Rooted Phylogeneitc Tree

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1474 users visited in the last hour