Cross-Project Defect Prediction Method Based on Manifold Feature Transformation

Zhao, Yu; Zhu, Yi; Yu, Qiao; Chen, Xiaoying

Cross-Project Defect Prediction Method Based on Manifold Feature Transformation

Yu Zhao, Yi Zhu, Qiao Yu and Xiaoying Chen
Additional contact information
Yu Zhao: School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China
Yi Zhu: School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China
Qiao Yu: School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China
Xiaoying Chen: School of Computer Science and Technology, Jiangsu Normal University, Xuzhou 221116, China

Future Internet, 2021, vol. 13, issue 8, 1-18

Abstract: Traditional research methods in software defect prediction use part of the data in the same project to train the defect prediction model and predict the defect label of the remaining part of the data. However, in the practical realm of software development, the software project that needs to be predicted is generally a brand new software project, and there is not enough labeled data to build a defect prediction model; therefore, traditional methods are no longer applicable. Cross-project defect prediction uses the labeled data of the same type of project similar to the target project to build the defect prediction model, so as to solve the problem of data loss in traditional methods. However, the difference in data distribution between the same type of project and the target project reduces the performance of defect prediction. To solve this problem, this paper proposes a cross-project defect prediction method based on manifold feature transformation. This method transforms the original feature space of the project into a manifold space, then reduces the difference in data distribution of the transformed source project and the transformed target project in the manifold space, and finally uses the transformed source project to train a naive Bayes prediction model with better performance. A comparative experiment was carried out using the Relink dataset and the AEEEM dataset. The experimental results show that compared with the benchmark method and several cross-project defect prediction methods, the proposed method effectively reduces the difference in data distribution between the source project and the target project, and obtains a higher F1 value, which is an indicator commonly used to measure the performance of the two-class model.

Keywords: cross-project defect prediction; manifold feature transformation; naive Bayes prediction model; F1 (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2021
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1999-5903/13/8/216/pdf (application/pdf)
https://www.mdpi.com/1999-5903/13/8/216/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:13:y:2021:i:8:p:216-:d:618376

Access Statistics for this article

Future Internet is currently edited by Ms. Grace You

More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().