Question: How to account for multiple mapping reads while calling variants?
0
gravatar for kirannbishwa01
4.8 years ago by
kirannbishwa011.2k
United States
kirannbishwa011.2k wrote:

I am using BWA for mapping. My main objective is to call polymorphisms from my genome resequenced data for two different populations for a plant model which has its reference genome sequenced.

If there are reads that align to multiple regions mainly from the paralogs (from duplicate genes/genomic regions, not for other repetitive regions) is it possible to make the read only align to the best match rather than matching it to two different places. - I want the polymorphism information at coding region not to be ambiguous although sequence error might contribute some mis-matching which I think will be less.

If I want to call variant on the alignments (SAM/BAM)  which caller is best that will account for multiple mapped reads when scoring variants.

Thanks in advance !

sam/bam snp bwa alignment • 1.7k views
ADD COMMENTlink modified 4.8 years ago by Chris Fields2.1k • written 4.8 years ago by kirannbishwa011.2k
1
gravatar for Chris Fields
4.8 years ago by
Chris Fields2.1k
University of Illinois Urbana-Champaign
Chris Fields2.1k wrote:

In many cases there isn't a 'best match'; some reads will simply align with equivalent edit distance to separate regions. This is a reality of shorter read length data, not much you can do about it.

Variant calls are generally derived from high quality alignment data (mapping qual is above a certain threshold, I think above 0 which is generally what multimapping reads are given). You can lower this threshold with most tools, e.g. see GATK MappingQualityFilter, or allow for lower quality variant calls.  It will very likely give you tons of false positives you will have to sift through.

ADD COMMENTlink modified 11 months ago by _r_am31k • written 4.8 years ago by Chris Fields2.1k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 2.3.0
Traffic: 2083 users visited in the last hour