Fine-Grained Argument Unit Recognition and Classification

Dietrich Trautmann; Johannes Daxenberger; Christian Stab; Hinrich Schütze; Iryna Gurevych

doi:10.1609/aaai.v34i05.6438

Authors

Dietrich Trautmann LMU Munich
Johannes Daxenberger TU Darmstadt
Christian Stab TU Darmstadt
Hinrich Schütze LMU Munich
Iryna Gurevych TU Darmstadt

DOI:

https://doi.org/10.1609/aaai.v34i05.6438

Abstract

Prior work has commonly defined argument retrieval from heterogeneous document collections as a sentence-level classification task. Consequently, argument retrieval suffers both from low recall and from sentence segmentation errors making it difficult for humans and machines to consume the arguments. In this work, we argue that the task should be performed on a more fine-grained level of sequence labeling. For this, we define the task as Argument Unit Recognition and Classification (AURC). We present a dataset of arguments from heterogeneous sources annotated as spans of tokens within a sentence, as well as with a corresponding stance. We show that and how such difficult argument annotations can be effectively collected through crowdsourcing with high inter-annotator agreement. The new benchmark, AURC-8, contains up to 15% more arguments per topic as compared to annotations on the sentence level. We identify a number of methods targeted at AURC sequence labeling, achieving close to human performance on known domains. Further analysis also reveals that, contrary to previous approaches, our methods are more robust against sentence segmentation errors. We publicly release our code and the AURC-8 dataset.¹

Fine-Grained Argument Unit Recognition and Classification

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription