Next Article in Journal
State of Charge Estimation of a Lithium Ion Battery Based on Adaptive Kalman Filter Method for an Equivalent Circuit Model
Next Article in Special Issue
Toward Automatic Cardiomyocyte Clustering and Counting through Hesitant Fuzzy Sets
Previous Article in Journal
VPNFilter Malware Analysis on Cyber Threat in Smart Home Network
Previous Article in Special Issue
Fuzzy Logic Controller Parameter Optimization Using Metaheuristic Cuckoo Search Algorithm for a Magnetic Levitation System
Article Menu
Issue 13 (July-1) cover image

Export Article

Open AccessArticle

Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach

1
Department of Computer and Information Sciences, Universiti Teknologi PETRONAS, Perak 32610, Malaysia
2
Department of Computer Science, University of Ilorin, Ilorin 240103, Nigeria
*
Author to whom correspondence should be addressed.
Appl. Sci. 2019, 9(13), 2764; https://doi.org/10.3390/app9132764
Received: 26 April 2019 / Revised: 10 May 2019 / Accepted: 14 May 2019 / Published: 9 July 2019
  |  
PDF [2880 KB, uploaded 9 July 2019]
  |     |  

Abstract

Software Defect Prediction (SDP) models are built using software metrics derived from software systems. The quality of SDP models depends largely on the quality of software metrics (dataset) used to build the SDP models. High dimensionality is one of the data quality problems that affect the performance of SDP models. Feature selection (FS) is a proven method for addressing the dimensionality problem. However, the choice of FS method for SDP is still a problem, as most of the empirical studies on FS methods for SDP produce contradictory and inconsistent quality outcomes. Those FS methods behave differently due to different underlining computational characteristics. This could be due to the choices of search methods used in FS because the impact of FS depends on the choice of search method. It is hence imperative to comparatively analyze the FS methods performance based on different search methods in SDP. In this paper, four filter feature ranking (FFR) and fourteen filter feature subset selection (FSS) methods were evaluated using four different classifiers over five software defect datasets obtained from the National Aeronautics and Space Administration (NASA) repository. The experimental analysis showed that the application of FS improves the predictive performance of classifiers and the performance of FS methods can vary across datasets and classifiers. In the FFR methods, Information Gain demonstrated the greatest improvements in the performance of the prediction models. In FSS methods, Consistency Feature Subset Selection based on Best First Search had the best influence on the prediction models. However, prediction models based on FFR proved to be more stable than those based on FSS methods. Hence, we conclude that FS methods improve the performance of SDP models, and that there is no single best FS method, as their performance varied according to datasets and the choice of the prediction model. However, we recommend the use of FFR methods as the prediction models based on FFR are more stable in terms of predictive performance. View Full-Text
Keywords: software defect prediction; feature selection; high dimensionality; search methods software defect prediction; feature selection; high dimensionality; search methods
Figures

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).
SciFeed

Share & Cite This Article

MDPI and ACS Style

Balogun, A.O.; Basri, S.; Abdulkadir, S.J.; Hashim, A.S. Performance Analysis of Feature Selection Methods in Software Defect Prediction: A Search Method Approach. Appl. Sci. 2019, 9, 2764.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics

1

Comments

[Return to top]
Appl. Sci. EISSN 2076-3417 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top