User: Carlos Borroto

gravatar for Carlos Borroto
Carlos Borroto1.6k
Reputation:
1,620
Status:
Trusted
Location:
Washington Metropolitan Area
Scholar ID:
Google Scholar Page
Last seen:
1 week, 6 days ago
Joined:
6 years, 11 months ago
Email:
c*************@gmail.com

Masters in Bioinformatics with strong focus in computational genomics and gene expression analysis. Bioinformatics Analyst with over seven years of experience in international and domestic laboratory environments. Extensive experience in computational applications for laboratory data analyses.

Specialties:Bioinformatics, UNIX/Linux administration, Molecular Biology, Cell Biology.

Posts by Carlos Borroto

<prev • 83 results • page 1 of 9 • next >
1
vote
0
answers
127
views
0
answers
Is there a good way to test CWL tools and workflows?
... I'm getting started writing CWL tools and workflows. I would like to get a good testing framework setup from the beginning. Is there something that can help testing CWL? How are you testing CWL? I saw [cwltest][1] exists. However, the documentation is very sparse. I haven't figured out what should ...
cwl written 19 days ago by Carlos Borroto1.6k • updated 9 days ago by Biostar ♦♦ 20
1
vote
1
answer
190
views
1
answers
Answer: A: VCF Normalisation is required ?
... 1) Yes, you do need to do normalization of the VCF, however the tools below will help you with that. 2) The main tools I know of are Illumina's [hap.py][1] and Real Time Genomics' [vcfeval][2]. Both are tools recommended by the PrecicionFDA [Truth Challenge][3]. [1]: https://github.com/Illumina ...
written 11 weeks ago by Carlos Borroto1.6k
0
votes
2
answers
2.4k
views
2
answers
Answer: A: Problem Downloading The Nt Database From Ncbi
... In case someone finds this thread while searching for this issue, you can ask update_blastdb to use passive mode. $ update_blastdb --passive --decompress nt ...
written 5 months ago by Carlos Borroto1.6k
1
vote
2
answers
5.1k
views
2
answers
Answer: A: Difference Between Picards Mergebamalignment And Mergesamfiles
... Long detailed explanation from the [GATK forum][1]. > 3C. Restore altered data and apply & adjust meta information using MergeBamAlignment >MergeBamAlignment is a beast of a tool, so its introduction is longer. It does more than is implied by its name. Explaining these features requires ...
written 11 months ago by Carlos Borroto1.6k
0
votes
1
answer
4.1k
views
1
answers
Comment: C: Introducing Clumpify: Create 30% Smaller, Faster Gzipped Fastq Files. And remov
... How do you think this will affect reads coming from two almost identical regions but for a few bases? For example SMN1/SMN2 have regions that are only differentiated by 1 base. Would `clumpify.sh` with the recommended allowed subs parameter remove non-duplicated reads for these two regions? --Carlo ...
written 14 months ago by Carlos Borroto1.6k
1
vote
0
answers
988
views
0
answers
Comment: C: Rsync and python for ftp.ncbi
... I think you are falling in the trap of the [XY problem][1]. Please tell us more about what you are trying to do. Even without the details of what you really want to do, I doubt you need to get python involved here, most probably rsync by itself can do it. [1]: http://meta.stackexchange.com/quest ...
written 22 months ago by Carlos Borroto1.6k
2
votes
3
answers
2.0k
views
3
answers
Answer: A: How to calculate average coverage for all genes
... I recently found myself looking for the exact info. I too decided to go with bedtools. This is my final one liner assuming a pre-sorted bam file. bedtools coverage -sorted -d -g human_g1k_v37.genome -a genes.bed -b my_sorted.bam \ | sort -k4,4 \ | bedtools groupby -g 4 -c 8 -o mean ...
written 23 months ago by Carlos Borroto1.6k
1
vote
2
answers
2.2k
views
2
answers
Answer: A: Human GRCh38 and dbSNP VCF and GATK
... You problem seems to be you are using a dbSNP VCF with GRCh37 coordinates. The FTP location Pierre links in his answer has a GRCh38 version that should solve your problem. However, you should know the current GATK version is not "alt" aware yet. While mappers like BWA are and can correctly map read ...
written 23 months ago by Carlos Borroto1.6k
2
votes
1
answer
736
views
1
answers
Answer: A: How do I get the lengths of different regions targeted by the Nextera expanded r
... You can use a GTF with annotated intervals for the categories you want. For example this [one][1] from Ensembl. Use something like unix grep to create files for the categories you are interested. Then use [bedtools intersect][2] to get the regions of your BED file overlapping with each category GTF ...
written 23 months ago by Carlos Borroto1.6k
0
votes
1
answer
1.0k
views
1
answers
Comment: C: UMI for Exome Target Sequencing
... I'm currently looking for how to do phasing with targeted sequencing data. I wonder if that's the reason you are looking into adding UMI to an exome analysis. I'll be highly interested in seeing if there are any valid answers to your question. ...
written 2.0 years ago by Carlos Borroto1.6k

Latest awards to Carlos Borroto

Good Answer 11 weeks ago, created an answer that was upvoted at least 5 times. For A: Scientific Names In Blast Output And Databases
Popular Question 3 months ago, created a question with more than 1,000 views. For Is There A Way To Start From 454'S ".Fna" And ".Qual" Files In Gkno?
Popular Question 4 months ago, created a question with more than 1,000 views. For Is There A Way To Start From 454'S ".Fna" And ".Qual" Files In Gkno?
Appreciated 5 months ago, created a post with more than 5 votes. For A: FreeBayes vs GATK's HaplotypeCaller
Scholar 9 months ago, created an answer that has been accepted. For A: Looking for a working copy of Picard metric definitions page
Epic Question 12 months ago, created a question with more than 10,000 views. For Scientific Names In Blast Output And Databases
Epic Question 16 months ago, created a question with more than 10,000 views. For Scientific Names In Blast Output And Databases
Gold Standard 21 months ago, created a post with more than 25 bookmarks. For Which Bioinformatic Friendly Pipeline Building Framework?
Teacher 23 months ago, created an answer with at least 3 up-votes. For A: Downloading multiple species from ftp.ncbi.nih.gov using wget and wildcards
Appreciated 23 months ago, created a post with more than 5 votes. For A: Scientific Names In Blast Output And Databases
Scholar 23 months ago, created an answer that has been accepted. For A: Looking for a working copy of Picard metric definitions page
Great Question 2.1 years ago, created a question with more than 5,000 views. For Which Bioinformatic Friendly Pipeline Building Framework?
Scholar 2.1 years ago, created an answer that has been accepted. For A: Looking for a working copy of Picard metric definitions page
Good Question 2.1 years ago, asked a question that was upvoted at least 5 times. For Scientific Names In Blast Output And Databases
Popular Question 2.2 years ago, created a question with more than 1,000 views. For Is There A Way To Start From 454'S ".Fna" And ".Qual" Files In Gkno?
Guru 2.2 years ago, received more than 100 upvotes.
Scholar 2.2 years ago, created an answer that has been accepted. For A: Looking for a working copy of Picard metric definitions page
Teacher 2.2 years ago, created an answer with at least 3 up-votes. For A: Downloading multiple species from ftp.ncbi.nih.gov using wget and wildcards
Appreciated 2.3 years ago, created a post with more than 5 votes. For A: Scientific Names In Blast Output And Databases
Teacher 2.4 years ago, created an answer with at least 3 up-votes. For A: Downloading multiple species from ftp.ncbi.nih.gov using wget and wildcards
Great Question 2.5 years ago, created a question with more than 5,000 views. For Scientific Names In Blast Output And Databases
Teacher 2.6 years ago, created an answer with at least 3 up-votes. For A: Downloading multiple species from ftp.ncbi.nih.gov using wget and wildcards
Popular Question 2.8 years ago, created a question with more than 1,000 views. For Is There A Way To Start From 454'S ".Fna" And ".Qual" Files In Gkno?
Appreciated 2.8 years ago, created a post with more than 5 votes. For A: Scientific Names In Blast Output And Databases

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 978 users visited in the last hour