Difficulty in using GATK for SNP and indel analysis
2
1
Entering edit mode
8.1 years ago

Hi,

I am new to Bioinformatics.I am trying to use GATK tool for finding SNP and indels but the problem is that the documentation seems to be complex for a beginner like me with so many tools to start with including IndelRealigner,MuTect,Haplotypecaller..The more I read the more confused I am getting.I also tried using Indel religner which is the first step in the process of identification of indels(correct me if wrong) but I got an error saying that you don't have the intervals file in the targetIntervals in the following command.

java -jar GenomeAnalysisTK.jar \ -T IndelRealigner \ -R reference.fasta \ -I input.bam \ -known indels.vcf \ -targetIntervals intervalListFromRTC.intervals \ -o realignedBam.bam

The error message is:Could not read file /home/aditya/Bioinfotools/gatk/intervalListFromRTC.intervals because The interval file does not exist. I don't know where to get the intervals file from.

But please guide me overall in understanding GATk.

SNP indel GATK • 2.2k views
ADD COMMENT
0
Entering edit mode

Thanks decosterwouter and igor.I read best practices,I am getting the confidence to proceed further.

ADD REPLY
1
Entering edit mode

Glad to help and good luck.

Upvoting our answer will make us happy ;)

ADD REPLY
2
Entering edit mode
8.1 years ago

There is a step by step guide: GATK best practices https://www.broadinstitute.org/gatk/guide/best-practices.php

ADD COMMENT
1
Entering edit mode
8.1 years ago
igor 13k

Realignment is a two-step process using RealignerTargetCreator and then IndelRealigner. See detailed instructions here: https://www.broadinstitute.org/gatk/guide/article?id=38

The rest of the best practices tutorial will be helpful too.

ADD COMMENT

Login before adding your answer.

Traffic: 2478 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6