Questions : Data management of whole exome sequencing
1
0
Entering edit mode
9.3 years ago
mangfu100 ▴ 800

Greeting All.

My post is about procedure of how to pre-process exome sequence files before variant calling.

I used bwa and bowtie each for increasing accuracy of my results and reducing false positives.

And then, next major steps are read duplicate removal, indel realignment and base quality score recalibration.

However, before stepping to read duplicate removal process, I heard that there is another minor step called 'AddOrReplaceReadGroup'.

Is it okay to ignore this minor preprocess? Or, ignoring this step will be resulting different variations?

I think that preprocessing is very important in detecting variants accurately. Therefore I ask a post in this forum.

next-gen-sequencing genome • 2.0k views
ADD COMMENT
4
Entering edit mode
9.3 years ago

The read group information is necessary for variant calling with GATK. Either you can append this information while aligning or later using picard tool.

This tutorial might be helpful for you. Tutorial (How to analyze) on Whole Exome sequencing. Common Errors. Best Practices.

ADD COMMENT

Login before adding your answer.

Traffic: 1489 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6