Question: Questions : Data management of whole exome sequencing
gravatar for mangfu100
5.4 years ago by
Korea, Republic Of
mangfu100730 wrote:

Greeting All. 

My post is about procedure of how to pre-process exome sequence files before variant calling.

I used bwa and bowtie each for increasing accuracy of my results and reducing false positives.

And then, next major steps are read duplicate removal, indel realignment and base quality score


However, before stepping to read duplicate removal process, I heard that there is another minor step called 'AddOrReplaceReadGroup'.

Is it okay to ignore this minor preprocess ? Or, ignoring this step will be resulting different variations?

I think that preprocessing is very important in detecting variants accurately. Therefore I ask a post in this forum.



sequencing next-gen genome • 1.5k views
ADD COMMENTlink modified 5.4 years ago by geek_y11k • written 5.4 years ago by mangfu100730
gravatar for geek_y
5.4 years ago by
geek_y11k wrote:

The read group information is necessary for variant calling with GATK. Either you can append this information while aligning or later using picard tool.


This tutorial might be helpful for you.  Tutorial (How to analyze) on Whole Exome sequencing. Common Errors. Best Practices.

ADD COMMENTlink written 5.4 years ago by geek_y11k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 911 users visited in the last hour