Proceedings:
Proceedings of the International AAAI Conference on Web and Social Media, 3
Volume
Issue:
Vol. 3 No. 1 (2009): Third International AAAI Conference on Weblogs and Social Media
Track:
Poster Papers
Downloads:
Abstract:
In this paper we present a regression-based machine learning approach to email thread summarization. The regression model is able to take advantage of multiple gold-standard annotations for training purposes, in contrast to most work with binary classifiers. We also investigate the usefulness of novel features such as speech acts. This paper also introduces a newly created and publicly available email corpus for summarization research. We show that regression-based classifiers perform better than binary classifiers because they preserve more information about annotator judgements. In our comparison between different regression-based classifiers, we found that Bagging and Gaussian Processes have the highest weighted recall.
DOI:
10.1609/icwsm.v3i1.13980
ICWSM
Vol. 3 No. 1 (2009): Third International AAAI Conference on Weblogs and Social Media