A Benchmark Dataset of Check-Worthy Factual Claims

  • Fatma Arslan University of Texas at Arlington
  • Naeemul Hassan University of Maryland
  • Chengkai Li University of Texas at Arlington
  • Mark Tremayne University of Texas at Arlington

Abstract

In this paper we present the ClaimBuster dataset of 23,533 statements extracted from all U.S. general election presidential debates and annotated by human coders. The ClaimBuster dataset can be leveraged in building computational methods to identify claims that are worth fact-checking from the myriad of sources of digital or traditional media. The ClaimBuster dataset is publicly available to the research community, and it can be found at http://doi.org/10.5281/zenodo.3609356.

Published
2020-05-26
How to Cite
Arslan, F., Hassan, N., Li, C., & Tremayne, M. (2020). A Benchmark Dataset of Check-Worthy Factual Claims. Proceedings of the International AAAI Conference on Web and Social Media, 14(1), 821-829. Retrieved from https://aaai.org/ojs/index.php/ICWSM/article/view/7346