Nonlinear Geometric Framework for Software Defect Prediction
Misha Kakkar,
Sarika Jain,
Abhay Bansal and
P. S. Grover
Additional contact information
Misha Kakkar: Amity University, India
Sarika Jain: Amity Institute of Information Technology, Amity University, India
Abhay Bansal: Department of Computer Science and Engineering, Amity University, India
P. S. Grover: KIIT Group of Colleges, Gurgaon, India
International Journal of Decision Support System Technology (IJDSST), 2020, vol. 12, issue 3, 85-100
Abstract:
Humans use the software in every walk of life thus it is essential to have the best quality software. Software defect prediction models assist in identifying defect prone modules with the help of historical data, which in turn improves software quality. Historical data consists of data related to modules /files/classes which are labeled as buggy or clean. As the number of buggy artifacts as less as compared to clean artifacts, the nature of historical data becomes imbalance. Due to this uneven distribution of the data, it difficult for classification algorithms to build highly effective SDP models. The objective of this study is to propose a new nonlinear geometric framework based on SMOTE and ensemble learning to improve the performance of SDP models. The study combines the traditional SMOTE algorithm and the novel ensemble Support Vector Machine (SVM) is used to develop the proposed framework called SMEnsemble. SMOTE algorithm handles the class imbalance problem by generating synthetic instances of the minority class. Ensemble learning generates multiple classification models to select the best performing SDP model. For experimentation, datasets from three different software repositories that contain both open source as well as proprietary projects are used in the study. The results show that SMEnsemble performs better than traditional methods for identifying the minority class i.e. buggy artifacts. Also, the proposed model performance is better than the latest state of Art SDP model- SMOTUNED. The proposed model is capable of handling imbalance classes when compared with traditional methods. Also, by carefully selecting the number of ensembles high performance can be achieved in less time.
Date: 2020
References: Add references at CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve. ... 18/IJDSST.2020070105 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jdsst0:v:12:y:2020:i:3:p:85-100
Access Statistics for this article
International Journal of Decision Support System Technology (IJDSST) is currently edited by Shaofeng Liu
More articles in International Journal of Decision Support System Technology (IJDSST) from IGI Global
Bibliographic data for series maintained by Journal Editor ().