Transfer learning for hate speech detection in social media

Yuan, Lanqin; Wang, Tianyu; Ferraro, Gabriela; Suominen, Hanna; Rizoiu, Marian-Andrei

Transfer learning for hate speech detection in social media

Lanqin Yuan (), Tianyu Wang (), Gabriela Ferraro (), Hanna Suominen () and Marian-Andrei Rizoiu ()
Additional contact information
Lanqin Yuan: University of Technology Sydney
Tianyu Wang: The Australian National University
Gabriela Ferraro: The Australian National University
Hanna Suominen: The Australian National University
Marian-Andrei Rizoiu: University of Technology Sydney

Journal of Computational Social Science, 2023, vol. 6, issue 2, No 24, 1101 pages

Abstract: Abstract Today, the internet is an integral part of our daily lives, enabling people to be more connected than ever before. However, this greater connectivity and access to information increase exposure to harmful content, such as cyber-bullying and cyber-hatred. Models based on machine learning and natural language offer a way to make online platforms safer by identifying hate speech in web text autonomously. However, the main difficulty is annotating a sufficiently large number of examples to train these models. This paper uses a transfer learning technique to leverage two independent datasets jointly and builds a single representation of hate speech. We build an interpretable two-dimensional visualization tool of the constructed hate speech representation—dubbed the Map of Hate—in which multiple datasets can be projected and comparatively analyzed. The hateful content is annotated differently across the two datasets (racist and sexist in one dataset, hateful and offensive in another). However, the common representation successfully projects the harmless class of both datasets into the same space and can be used to uncover labeling errors (false positives). We also show that the joint representation boosts prediction performances when only a limited amount of supervision is available. These methods and insights hold the potential for safer social media and reduce the need to expose human moderators and annotators to distressing online messaging.

Keywords: Hate speech; Transfer learning; Visualization; Twitter; Domain adaptation; Offensive speech (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://link.springer.com/10.1007/s42001-023-00224-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:jcsosc:v:6:y:2023:i:2:d:10.1007_s42001-023-00224-9

Ordering information: This journal article can be ordered from
http://www.springer. ... iences/journal/42001

DOI: 10.1007/s42001-023-00224-9

Access Statistics for this article

Journal of Computational Social Science is currently edited by Takashi Kamihigashi

More articles in Journal of Computational Social Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().