A Novel Trustworthy Toxic Text Detection Method with Entropy-Oriented Invariant Representation Learning for Portuguese Community

Fan, Wenting; Song, Haoyan; Zhang, Jun

A Novel Trustworthy Toxic Text Detection Method with Entropy-Oriented Invariant Representation Learning for Portuguese Community

Wenting Fan, Haoyan Song and Jun Zhang ()
Additional contact information
Wenting Fan: School of European Language and Culture Studies, Dalian University of Foreign Languages, Dalian 116044, China
Haoyan Song: University International College, Macau University of Science and Technology, Macau 999078, China
Jun Zhang: Graduate School of Education, Dalian University of Technology, Dalian 116024, China

Mathematics, 2025, vol. 13, issue 13, 1-16

Abstract: With the rapid development of digital technologies, data-driven methods have demonstrated commendable performance in the toxic text detection task. However, several challenges remain unresolved, including the inability to fully capture the nuanced semantic information embedded in text languages, the lack of robust mechanisms to handle the inherent uncertainty of text languages, and the utilization of static fusion strategies for multi-view information. To address these issues, this paper proposes a comprehensive and dynamic toxic text detection method. Specifically, we design a multi-view feature augmentation module by combining bidirectional long short-term memory and BERT as a dual-stream framework. This module captures a more holistic representation of semantic information by learning both local and global features of texts. Next, we introduce an entropy-oriented invariant learning module by minimizing the conditional entropy between view-specific representations to align consistent information, thereby enhancing the representation generalization. Meanwhile, we devise a trustworthy text recognition module by defining the Dirichlet function to model uncertainty estimation of text prediction. And then, we perform the evidence-based information fusion strategy to dynamically aggregate decision information between views with the help of the Dirichlet distribution. Through these components, the proposed method aims to overcome the limitations of traditional methods and provide a more accurate and reliable solution for toxic language detection. Finally, extensive experiments on the two real-world datasets show the effectiveness and superiority of the proposed method in comparison with seven methods.

Keywords: toxic language detection; invariant representation learning; trustworthy-driven adaptive fusion (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/13/13/2136/pdf (application/pdf)
https://www.mdpi.com/2227-7390/13/13/2136/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:13:y:2025:i:13:p:2136-:d:1690890

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().