Systematic literature review on hate speech detection in Indian low-resource languages

Pannerselvam, Kathiravan; Rajiakodi, Saranya

Systematic literature review on hate speech detection in Indian low-resource languages

Kathiravan Pannerselvam and Saranya Rajiakodi ()
Additional contact information
Kathiravan Pannerselvam: Central University of Tamil Nadu
Saranya Rajiakodi: Central University of Tamil Nadu

Journal of Computational Social Science, 2026, vol. 9, issue 1, No 5, 47 pages

Abstract: Abstract Hate speech detection is a growing area of research driven by the increasing spread of harmful content on social media platforms. Automated systems are needed to identify and manage such content, particularly in low-resource Indian languages such as Hindi, Tamil, Malayalam, Telugu, and others, which remain underrepresented in Natural Language Processing (NLP) research. We conducted a Systematic Literature Review (SLR) focused on hate speech detection in these languages to assess recent developments and challenges. We retrieved 2,372 records from databases including ScienceDirect, IEEE Xplore, ACM Digital Library, Google Scholar, and arXiv, and selected 61 studies based on predefined inclusion criteria. The review highlights significant challenges such as limited annotated datasets, class imbalance, and the difficulty of modeling socio-linguistic diversity. Although multilingual and code-mixed models show promise, they often fail to capture regional language variations and emerging forms of online hate speech. Ethical concerns about bias, fairness, and transparency in automated systems are also addressed. Additionally, the influence of social norms, language variation, and platform-specific policies on model performance is discussed. This review offers valuable directions for future research, emphasizing the need for culturally informed models, better data resources, and methods to address linguistic diversity. The goal is to support the development of fair and inclusive hate speech detection systems that protect vulnerable communities while upholding freedom of expression.

Keywords: Hate speech detection; Low resource language; Machine learning; Natural language processing; Offensive language detection; Social media post analysis (search for similar items in EconPapers)
Date: 2026
References: Add references at CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s42001-025-00432-5 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:jcsosc:v:9:y:2026:i:1:d:10.1007_s42001-025-00432-5

Ordering information: This journal article can be ordered from
http://www.springer. ... iences/journal/42001

DOI: 10.1007/s42001-025-00432-5

Access Statistics for this article

Journal of Computational Social Science is currently edited by Takashi Kamihigashi

More articles in Journal of Computational Social Science from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().