Question: GATK dbSNP for Ensembl
gravatar for bharata1803
5.8 years ago by
bharata1803500 wrote:


I want to do some SNP calls from exome sequencing data and I found really good documentation in seqanswer. I have one question about the step for quality score recalibration in GATK ( which use dbSNP. From the tutorial, it shows the data from UCSC. Currently, I use ensemble GRCh38 for my genome reference so I have question about that. Can I use UCSC dbSNP for my aligned to Ensemble data? I also check Ensemble FTP and found this link and So, which one I should use because the tutorial use txt file from UCSC (I checked the UCSC the filetype still txt). Thank you for your answer.

ensembl snp gatk • 2.5k views
ADD COMMENTlink modified 2.5 years ago by zx875410.0k • written 5.8 years ago by bharata1803500
gravatar for Max Ivon
5.8 years ago by
Max Ivon120
Russian Federation
Max Ivon120 wrote:

You should use this file for your purpose if you have aligned reads on GRCh38 version of the genome. But im not sure that you use the right one guide for the score recalibration. According to this post, countcovariates tools is no longer supported by GATK. For the base quality recalibration it is recommended to use BaseRecalibrator and after snp calling for exome data (or WGS) it is recommended to perform automatic variant quality recalibration with VariantRecalibrator (not with VariantFiltration as said on seqanswers). You can find documentation directly on GATK site, which is really good.

ADD COMMENTlink modified 5.8 years ago • written 5.8 years ago by Max Ivon120

Thank you. I'm using it right now but I want to ask about something. The description of BaseRecalibartor is like this : This tool is designed to work as the first pass in a two-pass processing step. So, what is the second pass? I can not find the second step of this and I checked the CountCovariates and tableRecalibrator from the Seqanswer tutorial is no longer exist.

ADD REPLYlink written 5.8 years ago by bharata1803500

This may be useful, Firstly you use BaseRecalibraotor, which generates .grp table and then you use PrintReads with -BQSR argument.

ADD REPLYlink written 5.8 years ago by Max Ivon120
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1136 users visited in the last hour