Hierarchical sorting quickly constructs a tree-structured clustering, but one which is typically nonoptimal. In particular, this control strategy suffers from ordering effects: different orderings of the observations may yield different clusterings [Fisher, Xu & Zard, 1992]. Thus, after an initial clustering phase, a (possibly offline) process of iterative optimization seeks to uncover better clusterings.