To evaluate the algorithm itself, in this section we separately evaluate the components of our method for temporal reference resolution. Sections 8.1 and 8.2 assess the key contributions of this work: the focus model (in Section 8.1) and the deictic and anaphoric relations (in Section 8.2). These evaluations required us to perform extensive additional manual annotation of the data. In order to preserve the test dialogs as unseen test data, these annotations were performed on the training data only. In Section 8.3, we isolate the architectural components of our algorithm, such as the certainty factor calculation and the critics, to assess the effects they have on performance.