Platform-Related Factors in Repeatability and Reproducibility of Crowdsourcing Tasks

Authors

Rehab Qarout,Alessandro Checco,Gianluca Demartini,Kalina Bontcheva

The University of Sheffield,The University of Sheffield,The University of Queensland,The University of Sheffield

Published:

2019-10-21

Proceedings:

Proceedings of the AAAI Conference on Human Computation and Crowdsourcing, 7

Volume

Issue:

Vol. 7 (2019): Proceedings of the Seventh AAAI Conference on Human Computation and Crowdsourcing

Track:

Technical Papers

Downloads:

Download PDF

Abstract:

Crowdsourcing platforms provide a convenient and scalable way to collect human-generated labels on-demand. This data can be used to train Artificial Intelligence (AI) systems or to evaluate the effectiveness of algorithms. The datasets generated by means of crowdsourcing are, however, dependent on many factors that affect their quality. These include, among others, the population sample bias introduced by aspects like task reward, requester reputation, and other filters introduced by the task design.In this paper, we analyse platform-related factors and study how they affect dataset characteristics by running a longitudinal study where we compare the reliability of results collected with repeated experiments over time and across crowdsourcing platforms. Results show that, under certain conditions: 1) experiments replicated across different platforms result in significantly different data quality levels while 2) the quality of data from repeated experiments over time is stable within the same platform. We identify some key task design variables that cause such variations and propose an experimentally validated set of actions to counteract these effects thus achieving reliable and repeatable crowdsourced data collection experiments.

DOI:

10.1609/hcomp.v7i1.5264

HCOMP

Vol. 7 (2019): Proceedings of the Seventh AAAI Conference on Human Computation and Crowdsourcing

ISBN 978-1-57735-820-6

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.