Addressing the label dilemma: A self-semi-supervised step-wise complementary label boosting strategy for industrial anomaly detection

Yang, Jiayang; Zhao, Chunhui

Addressing the label dilemma: A self-semi-supervised step-wise complementary label boosting strategy for industrial anomaly detection

Jiayang Yang and Chunhui Zhao

Reliability Engineering and System Safety, 2025, vol. 264, issue PB

Abstract: Recently, Artificial Intelligence (AI) technology has been extensively employed in data-driven industrial anomaly detection. However, due to the difficulty of reliably acquiring the operating status of industrial processes, most process data may be collected without rigorous examination, resulting in uncertainty regarding their exact statuses and limiting their safe utilization for AI-powered anomaly detection modeling. Additionally, samples with definite annotations could still be subject to misjudgment of statuses by manual error, thereby exposing anomaly detection modeling to a significant risk of misleading. In this work, we accomplish anomaly detection as a binary classification task and recognize the aforementioned challenges as a modeling dilemma involving sample labels (annotations indicating their operating statuses, i.e., normal/abnormal), where the available labels are insufficient and unreliable simultaneously. Thereupon, a self-semi-supervised step-wise complementary label boosting (S4CLB) strategy is proposed to address that dilemma. The S4CLB strategy mainly consists of two stages, in the first stage, the self-supervised contrastive autoencoding Gaussian mixture model (CAGMM) is developed to provide representations of all the process samples for the subsequent anomaly detection by describing their data distribution information with low-dimensional features. In the second stage, a semi-supervised label boosting strategy is designed in a step-wise manner. Specifically, the noisy label filtering and adaptive label enrichment are conducted alternately to boost the sufficiency and reliability of available labels regressively. Meanwhile, the robust dual complementary classifier (RDCC) model comprising two peer classifiers with robustness and different views is developed to achieve the prompt feedback for label boosting, thus the reliability of label adjustment is further guaranteed. Finally, the anomaly detection results are obtained by the RDCC model. The effectiveness of the proposed method is verified by a real industrial process.

Keywords: Self-supervised learning; Semi-supervised learning; Noisy label learning; Symmetric loss function; Anomaly detection (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.sciencedirect.com/science/article/pii/S0951832025005708
Full text for ScienceDirect subscribers only

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:eee:reensy:v:264:y:2025:i:pb:s0951832025005708

DOI: 10.1016/j.ress.2025.111369

Access Statistics for this article

Reliability Engineering and System Safety is currently edited by Carlos Guedes Soares

More articles in Reliability Engineering and System Safety from Elsevier
Bibliographic data for series maintained by Catherine Liu ().