Hi,
I'm sorting through some bacterial VCF files generated by bowtie/samtools and a lot of variants have DP4 values indicating the alternative is only on one strand (e.g. DP4 = 5,0,7,0).
The sequences are high-coverage Illumina data.
Is this likely to be sequencing error due to phasing/prephasing rather than a genuine mutation, or could it be strand bias?
Other called variants also yield hits on both strands but do not equal DP (meaning some bases are below the quality threshold); e.g. DP=360, DP4=118,52,54,23 - are these variants likely error?
Thanks!