EconPapers    
Economics at your fingertips  
 

Investigating Bad Smells with Feature Selection and Machine Learning Approaches

Aakanshi Gupta (), Rashmi Gandhi () and Vijay Kumar ()
Additional contact information
Aakanshi Gupta: Amity University Uttar Pradesh
Rashmi Gandhi: Amity University Uttar Pradesh
Vijay Kumar: Amity Institute of Applied Sciences, Amity University Uttar Pradesh

A chapter in Predictive Analytics in System Reliability, 2023, pp 53-65 from Springer

Abstract: Abstract Code Smell is a piece of code that is designed and implemented poorly and it gives adverse effect on the software quality and maintenance. Now, a day’s machine learning based techniques have been extensively used towards code smell research. The main objective of this research is to optimise the features of Android code smells in terms of software metrics using feature selection technique based on Correlation on 2896 instances of open-source projects which are extracted from GitHub. Further, we have examined the performance measures like accuracy, precision, F-measure and execution time etc. with the reduced features data set of Android code smells. This paper also discussed about implementation of correlation-based feature selection algorithm to reduce the features of code smells. Then, the data has been analyzed with 4 machine learning algorithms that are Logistic Regression, Stochastic Gradient Descent (SGD), Simple Logistic and Sequential minimal optimization (SMO). The performance metrics for the above-mentioned machine learning algorithms with and without performing the feature selection have been compared. The computed outcome shows that the best accuracy and lesser execution time for all 3 considered Android code smells have been achieved using Logistic Regression algorithm. After feature selection the accuracy has increased up to 16%, 25% and 4.7% for NLMR, MIM and DTWC code smells respectively. Meanwhile, the other performance measures have also been increased.

Keywords: Feature selection; Android code smell; Logistic regression; GITHUB (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:ssrchp:978-3-031-05347-4_4

Ordering information: This item can be ordered from
http://www.springer.com/9783031053474

DOI: 10.1007/978-3-031-05347-4_4

Access Statistics for this chapter

More chapters in Springer Series in Reliability Engineering from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-01
Handle: RePEc:spr:ssrchp:978-3-031-05347-4_4