Abstract:
We present experimental results in learning to classify email in this
fashion, where each class corresponds to a verb-noun pair taken from a
predefined ontology describing typical "email speech acts". We demonstrate
that, although this categorization problem is quite different from "topical"
text classification, certain categories of messages can nonetheless be
detected with high precision (above 80%) and reasonable recall (above 50%)
using existing text-classification learning methods. This result suggests
that useful task-tracking tools could be constructed based on automatic
classification into this taxonomy.
This is joint work with William Cohen and Tom Mitchell. |