Say we have two sequences X1
and X2
that are orthologs to one another. A third sequence X3
gets found that happens to be orthologous to both X1
and X2
.
Now the species X3
comes from is phylogenetically closer to the species X2
comes from than that of X1
. However, X3
is more identical to X1
than it is to X2
.
Does this mean that X3
is more (?) orthologous to X1
than it is to X2
? If homology is a binary yes/no thing (two sequences are either homologs or they are not), then wouldn't orthology--by virtue of being a subset--also be a binary yes/no thing?
Why would it be the case that a sequence from a species that is evolutionarily closer is less identical* than one from a species that is (evolutionarily) farther away? I thought that orthologous sequences are similar, and the smaller the evolutionary distance between them, the more identical they should be.
In general, what is the right way to interpret sequence similarity/identity in the context of sequence orthology?
* - or similar, if we are looking at the similarity score.