Proceedings:
Vol. 16 (2022): Proceedings of the Sixteenth International AAAI Conference on Web and Social Media
Volume
Issue:
Vol. 16 (2022): Proceedings of the Sixteenth International AAAI Conference on Web and Social Media
Track:
Dataset Papers
Downloads:
Abstract:
New social networks and platforms such as Telegram, Gab and Parler offer a stage for extremist, racist and aggressive content, but also provide a safe space for freedom fighters in authoritarian regimes. Data from such platforms offer excellent opportunities for research on issues such as linguistic bias and toxic language detection. However, only a few, mostly unannotated, English-only corpora from such platforms exist. This article presents a new Telegram corpus in Russian and Belorussian languages tailored for research on linguistic bias in political news. In addition, we created a repository to make all currently available corpora from so-called "dark" platforms accessible in one place.
DOI:
10.1609/icwsm.v16i1.19378
ICWSM
Vol. 16 (2022): Proceedings of the Sixteenth International AAAI Conference on Web and Social Media