Hi, I have output from snp-dists (https://github.com/tseemann/snp-dists) in molten format, e.g.:
seq1 seq2 1 seq1 seq3 2 seq2 seq1 1 seq2 seq3 3 seq3 seq1 2 seq3 seq2 3
The third column gives the number of SNPs between the pair of sequences given in columns 1 and 2. As you can see, these values are duplicated, as it shows both the combination seq1 seq2 and seq2 seq1. How can I (in R or bash preferably) remove the duplicate values?