Anatomy of Online Hate: Developing a Taxonomy and Machine Learning Models for Identifying and Classifying Hate in Online News Media

Authors

Joni Salminen,Hind Almerekhi,Milica Milenković,Soon-gyo Jung,Jisun An,Haewoon Kwak,Bernard Jansen

Qatar Computing Research Institute, Hamad Bin Khalifa University,Hamad Bin Khalifa University,Independent Researcher,Qatar Computing Research Institute, Hamad Bin Khalifa University,Qatar Computing Research Institute, Hamad Bin Khalifa University,Qatar Computing Research Institute, Hamad Bin Khalifa University,Qatar Computing Research Institute, Hamad Bin Khalifa University

Proceedings:

Vol. 12 No. 1 (2018): Twelfth International AAAI Conference on Web and Social Media

Volume

Issue:

Vol. 12 No. 1 (2018): Twelfth International AAAI Conference on Web and Social Media

Track:

Full Papers

Downloads:

Download PDF

Abstract:

Online social media platforms generally attempt to mitigate hateful expressions, as these comments can be detrimental to the health of the community. However, automatically identifying hateful comments can be challenging. We manually label 5,143 hateful expressions posted to YouTube and Facebook videos among a dataset of 137,098 comments from an online news media. We then create a granular taxonomy of different types and targets of online hate and train machine learning models to automatically detect and classify the hateful comments in the full dataset. Our contribution is twofold: 1) creating a granular taxonomy for hateful online comments that includes both types and targets of hateful comments, and 2) experimenting with machine learning, including Logistic Regression, Decision Tree, Random Forest, Adaboost, and Linear SVM, to generate a multiclass, multilabel classification model that automatically detects and categorizes hateful comments in the context of online news media. We find that the best performing model is Linear SVM, with an average F1 score of 0.79 using TF-IDF features. We validate the model by testing its predictive ability, and, relatedly, provide insights on distinct types of hate speech taking place on social media.

DOI:

10.1609/icwsm.v12i1.15028

ICWSM

Vol. 12 No. 1 (2018): Twelfth International AAAI Conference on Web and Social Media

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.