I have noticed that pdb.org has an option "remove similar". Can someone explain how it works? My assumption until now was that it compares pairwise all against all and if they have more than X% identity it removes one of the two in the pair.
Or does it remove both?
If it removes only one then which one does it choose?
Also how does it calculate pairwise %identity? With local or global alignment?