Next Gen Sequencing Indel Realignment With The Genome Analysis Toolkit (Gatk)
1
3
Entering edit mode
12.9 years ago
Travis ★ 2.8k

As I continue to add steps to my SNP/Indel discovery workflow, the latest recommendation is local realignment around Indels using GATK following the initial alignment step. I have just commenced the step which generates the target intervals for realignment (RealignerTargetCreator) and it looks like it will take an hour to complete, with the realignment still required after that. My test data set is a single sample of approx 5 million paired end 100bp reads.

For an upcoming project, my plan is to run 150 similarly sized samples. Therefore the addition of such time-consuming steps will have a major impact on timelines. Can anyone with experience in this area comment on the time required for Indel realignment vs the benefits received? Is it worth it?

alignment indel snp gatk next-gen sequencing • 5.1k views
ADD COMMENT
3
Entering edit mode
12.9 years ago
Ian 6.0k

Read realignment certainly appears to be an essential step. Do you have access to a computer cluster? Working through the samples in parallel batches would certainly be less painful.

As an aside, I have had a good experience using Nils Homer's SRMA package. Perhaps you could do a time trial. SRMA just works over the whole batch of reads, so could be faster?

ADD COMMENT
0
Entering edit mode

I'll give SRMA a go. The GATK RealignerTargetCreator didn't generate a file in the end-up. Not sure what the problem is...

ADD REPLY
0
Entering edit mode

Also - I do have access to a cluster. I was hoping to avoid home-brew parallelization but it's looking increasingly necessary!

ADD REPLY
0
Entering edit mode

Why is it home brew :) I might misunderstand you, but running say 10 separate jobs in parallel seems efficient to me.

ADD REPLY
0
Entering edit mode

I guess I just mean that the program doesn't do it all automatically for me. Maybe I'm just lazy :)

ADD REPLY

Login before adding your answer.

Traffic: 1940 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6