Predicting Component Failures Using Latent Dirichlet Allocation
Hailin Liu,
Ling Xu,
Mengning Yang,
Meng Yan and
Xiaohong Zhang
Mathematical Problems in Engineering, 2015, vol. 2015, 1-15
Abstract:
Latent Dirichlet Allocation (LDA) is a statistical topic model that has been widely used to abstract semantic information from software source code. Failure refers to an observable error in the program behavior. This work investigates whether semantic information and failures recorded in the history can be used to predict component failures. We use LDA to abstract topics from source code and a new metric (topic failure density) is proposed by mapping failures to these topics. Exploring the basic information of topics from neighboring versions of a system, we obtain a similarity matrix. Multiply the Topic Failure Density (TFD) by the similarity matrix to get the TFD of the next version. The prediction results achieve an average 77.8% agreement with the real failures by considering the top 3 and last 3 components descending ordered by the number of failures. We use the Spearman coefficient to measure the statistical correlation between the actual and estimated failure rate. The validation results range from 0.5342 to 0.8337 which beats the similar method. It suggests that our predictor based on similarity of topics does a fine job of component failure prediction.
Date: 2015
References: Add references at CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://downloads.hindawi.com/journals/MPE/2015/562716.pdf (application/pdf)
http://downloads.hindawi.com/journals/MPE/2015/562716.xml (text/xml)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:hin:jnlmpe:562716
DOI: 10.1155/2015/562716
Access Statistics for this article
More articles in Mathematical Problems in Engineering from Hindawi
Bibliographic data for series maintained by Mohamed Abdelhakeem ().