Question: Dbsnp Rod File Still Useful?
0
gravatar for mylons
6.8 years ago by
mylons130
Boston, MA
mylons130 wrote:

I've been trying to find the documentation on generating a dbSNP.rod file that used to be in the GATK's re-aligning workflows and documented in the GATK wiki, but due to their new licensing they've taken that site down.

I also stumbled upon this, http://jianyingli18.wordpress.com/2011/04/04/gatk-base-calling-score-recalibration-first-task-on-duke-campus/ , which seems to imply they're not necessary anymore.

Can someone straighten me out?

gatk dbsnp • 2.6k views
ADD COMMENTlink written 6.8 years ago by mylons130
1
gravatar for Liye Zhang
6.8 years ago by
Liye Zhang80
United States
Liye Zhang80 wrote:

Hi,

  the rod file is used in older version of GATK, therefore, if you are using the current version of GATK (version 1.6, I haven't tried their 2.0 version yet, so I do not know about 2.0 version).
  I do not think you need to use the rod file. Instead, they just use the vcf format, as long as your vcf formats have the same chromosome order as your bam file. Just give you an example on Unified Genotyper. (--dbsnp dbSNP.vcf replaces the rod file).

java -jar GenomeAnalysisTK.jar \ -R resources/Homosapiensassembly18.fasta \ -T UnifiedGenotyper \ -I sample1.bam [-I sample2.bam ...] \ --dbsnp dbSNP.vcf \ -o snps.raw.vcf \ -standcallconf [50.0] \ -standemitconf 10.0 \ -dcov [50] \ [-L targets.interval_list]

Hope this will clarify your questions.

ADD COMMENTlink written 6.8 years ago by Liye Zhang80

Yes it did. I also finally found this documentation on the portion I was about to use: http://www.broadinstitute.org/gsa/gatkdocs/release/orgbroadinstitutestinggatkwalkersindelsRealignerTargetCreator.html

They state this in there: --known / -known (List[RodBinding[VariantContext]] with default value [])

Input VCF file with known indels. Any number of VCF files representing known SNPs and/or indels. Could be e.g. dbSNP and/or official 1000 Genomes indel calls. SNPs in these files will be ignored unless the --mismatchFraction argument is used. --known binds reference ordered data. This argument supports ROD files of the following types: VCF, VCF3

I didn't realize a ROD was a vcf. That was essentially my mistake. Thanks!

ADD REPLYlink written 6.8 years ago by mylons130
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1536 users visited in the last hour