Big GCVAE: decision-making with adaptive transformer model for failure root cause analysis in semiconductor industry
Kenneth Ezukwoke (),
Anis Hoayek,
Mireille Batton-Hubert,
Xavier Boucher,
Pascal Gounet and
Jérôme Adrian
Additional contact information
Kenneth Ezukwoke: Univ. Clermont Auvergne, CNRS UMR 6158, LIMOS
Anis Hoayek: Univ. Clermont Auvergne, CNRS UMR 6158, LIMOS
Mireille Batton-Hubert: Univ. Clermont Auvergne, CNRS UMR 6158, LIMOS
Xavier Boucher: Univ. Clermont Auvergne, CNRS UMR 6158, LIMOS
Pascal Gounet: STMicroelectronics
Jérôme Adrian: STMicroelectronics
Journal of Intelligent Manufacturing, 2025, vol. 36, issue 4, No 10, 2423-2438
Abstract:
Abstract Pre-trained large language models (LLMs) have gained significant attention in the field of natural language processing (NLP), especially for the task of text summarization, generation, and question answering. The success of LMs can be attributed to the attention mechanism introduced in Transformer models, which have outperformed traditional recurrent neural network models (e.g., LSTM) in modeling sequential data. In this paper, we leverage pre-trained causal language models for the downstream task of failure analysis triplet generation (FATG), which involves generating a sequence of failure analysis decision steps for identifying failure root causes in the semiconductor industry. In particular, we conduct extensive comparative analysis of various transformer models for the FATG task and find that the BERT-GPT-2 Transformer (Big GCVAE), fine-tuned on a proposed Generalized-Controllable Variational AutoEncoder loss (GCVAE), exhibits superior performance in generating informative latent space by promoting disentanglement of latent factors. Specifically, we observe that fine-tuning the Transformer style BERT-GPT2 on the GCVAE loss yields optimal representation by reducing the trade-off between reconstruction loss and KL-divergence, promoting meaningful, diverse and coherent FATs similar to expert expectations.
Keywords: Failure root cause analysis; Large language model; Decision-making; Generalized latent space modelling (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10845-024-02346-x Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:joinma:v:36:y:2025:i:4:d:10.1007_s10845-024-02346-x
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10845
DOI: 10.1007/s10845-024-02346-x
Access Statistics for this article
Journal of Intelligent Manufacturing is currently edited by Andrew Kusiak
More articles in Journal of Intelligent Manufacturing from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().