Question: variant calling after mapping via bwa mem
gravatar for reza
3.3 years ago by
reza220 wrote:

After mapping to reference using "bwa mem", downstream analysis in my project are variant calling using samtools and CNV detection using CNV-seq. In your opinion, default setting in bwa is proper for my goal? i must use -M in "bwa mem" (I want to mark duplicates via Picard after mapping to reference) ?

this project is my first NGS analysis and need to your kindly helps

snp alignment next-gen • 1.6k views
ADD COMMENTlink modified 3.2 years ago by kirannbishwa011.2k • written 3.3 years ago by reza220

You need to learn how to use google. This is not exactly rocket science and you can definitely find guidance on the internet, for example, this htslib workflow page and this samtools page. If you have more specific questions you can definitely ask them here.

ADD REPLYlink written 3.3 years ago by WouterDeCoster43k

i can use google and i know where i find programs manual, i am learning bioinformatic and i need the experience of bioinformatician more than programs manual. Here, there are people who, regardless of the question and questioner level, just try to respond to questioner and help her/him. (forgive me for my weak English, because it is not my maternal language).

ADD REPLYlink modified 3.3 years ago • written 3.3 years ago by reza220

So instead of going through a manual, your prefer that someone here spends time to type it out for you again. That's not how biostars works.

ADD REPLYlink written 3.3 years ago by WouterDeCoster43k

This is a "blinded" question - you are not telling us anything about your experiment or goals. What is the organism? What sequencing platform did you use? What depth did you sequence to, was it PCR-amplified, etc. It's not really possibly to help with no knowledge of the situation.

ADD REPLYlink written 3.2 years ago by Brian Bushnell17k
gravatar for dyollluap
3.2 years ago by
USA, California, Bay Area
dyollluap300 wrote:

For most standard bwa alignments it is best to run with the default bwa settings unless you have a specific requirement and understand the parameters you're tinkering.

ADD COMMENTlink written 3.2 years ago by dyollluap300
gravatar for kirannbishwa01
3.2 years ago by
United States
kirannbishwa011.2k wrote:

Hi @ reza

I think it would benefit you more if you could use the GATK pipeline.

Select the best practices and then go through the workflow-documentation. Also, remember GATK is an empirical method designed with human genome in mind. But, you may need to deviate from the pipeline in terms of parameters and steps, depending upon what your goal is. Also, remember to read the discussion, Q/A and comments to do what you need to do.

Using a empirical pipeline gives you some easiness, but at the same time everyone has different goals; so results you get should be looked with scrutiny - I mean you need to doubt your results you get using empirical pipelines - in biology and genetics anything can happen; that's the main point of doing biology and using bioinformatics.



ADD COMMENTlink modified 3.2 years ago • written 3.2 years ago by kirannbishwa011.2k
Please log in to add an answer.


Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 1510 users visited in the last hour