EconPapers    
Economics at your fingertips  
 

Audio Driven Detection of Hate Speech in Telugu: Toward Ethical and Secure CPS

M. Santhosh Kumar, P. Sai Ravula, M. Prasanna Teja, J. Ajay Surya, V. Mohitha and G. Jyothish Lal ()
Additional contact information
M. Santhosh Kumar: Amrita Vishwa Vidyapeetham, School of Artificial Intelligence
P. Sai Ravula: Amrita Vishwa Vidyapeetham, School of Artificial Intelligence
M. Prasanna Teja: Amrita Vishwa Vidyapeetham, School of Artificial Intelligence
J. Ajay Surya: Amrita Vishwa Vidyapeetham, School of Artificial Intelligence
V. Mohitha: Amrita Vishwa Vidyapeetham, School of Artificial Intelligence
G. Jyothish Lal: Amrita Vishwa Vidyapeetham, School of Artificial Intelligence

A chapter in Reliability in Cyber-Physical Systems: The Human Factor Perspective, 2026, pp 51-63 from Springer

Abstract: Abstract The rapid integration of social media platforms into cyber-physical systems has introduced new challenges in ensuring human-centric reliability and safety. This is mainly due to the widespread dissemination of hate speech and the inability of online systems to effectively moderate offensive content. While significant advances have been made toward hate speech detection in high-resource languages such as English, low-resource languages such as Telugu do not have the annotated datasets and tools to properly detect it. This project addresses this gap by creating a complete annotated multimodal hate speech dataset in the Telugu language, consisting of 2 h of audio-text pairs from YouTube. The dataset enables the exploration of hate speech detection in individual modalities, speech and text, as well as in a combined multimodal setting. The work presented in this paper is focused on the detection of hate speech based on audio data with text-based analysis incorporated as an ablation study to better understand the modality-specific contributions. Our classification results demonstrate that the combination of OpenSMILE acoustic features and an SVM classifier yields the highest performance in speech classification, achieving an F1 score of 0.89. In contrast, the best-performing text model, using LaBSE embeddings, attained an F1 score of 0.88.

Keywords: Hate speech detection; Dravidian languages; Telugu hatespeech; Secure cyber physical systems (search for similar items in EconPapers)
Date: 2026
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:ssrchp:978-3-032-09917-4_3

Ordering information: This item can be ordered from
http://www.springer.com/9783032099174

DOI: 10.1007/978-3-032-09917-4_3

Access Statistics for this chapter

More chapters in Springer Series in Reliability Engineering from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-01-23
Handle: RePEc:spr:ssrchp:978-3-032-09917-4_3