Software fault proneness prediction: a comparative study between bagging, boosting, and stacking ensemble and base learner methods
Mohammed Akour,
Izzat Alsmadi and
Iyad Alazzam
International Journal of Data Analysis Techniques and Strategies, 2017, vol. 9, issue 1, 1-16
Abstract:
Modules with defects might be the prime reason for decreasing the software quality and increasing the cost of maintenance. Therefore, the prediction of faulty modules of systems under test at early stages contributes to the overall quality of software products. In this research three symmetric ensemble methods: bagging, boosting and stacking are used to predict faulty modules based on evaluating the performance of 11 base learners. The results reveal that the defect prediction performance of the base learner classifier and ensemble learner classifiers is the same for naïve Bayes, Bayes net, PART, random forest, IB1, VFI, decision table, and NB tree base learners, the case was different for boosted SMO, bagged J48 and boosted and bagged random tree. In addition the results showed that the random forest classifier is one of the most significant classifiers that should be stacked with other classifiers to gain the better fault prediction.
Keywords: software defect prediction; bagging; boosting; stacking; data mining; software defects; software faults; software testing; software development; software quality; ensemble classifiers; base learner classifier; random forest. (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://www.inderscience.com/link.php?id=83058 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:ids:injdan:v:9:y:2017:i:1:p:1-16
Access Statistics for this article
More articles in International Journal of Data Analysis Techniques and Strategies from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().