I'm constructing a matrix of pairwise genetic distance based on Sanger sequence data. The samples are diploid with many variable sites, so there are a bunch of legitimate ambiguous characters (R, M, W etc) in the DNA sequences. I'd like to calculate distances making use of this information, such that [for example] AAA and ARA have a pairwise distance greater than zero but less than the pairwise distance between AAA and AGA. That is, it makes sense to me that heterozygotes should have intermediate genetic distance between both types of homozygote.
I've tried dist.alignment in seqinR and dist.dna in ape, but they both seem to be dropping the ambiguous characters as missing data. Ideas on how I can fix this, or other commands/packages to try, would be so welcome!!