Movatterモバイル変換

Regular Section

A Hybrid Feature Selection Method for Software Fault Prediction

Yiheng JIAN, Xiao YU, Zhou XU, Ziyi MA

Author information

Yiheng JIAN
School of Information and Electronics, Beijing Institute of Technology
Xiao YU
School of Computer Science, Wuhan University
Zhou XU
School of Computer Science, Wuhan University
Ziyi MA
School of Computer Science, Huazhong University of Science and Technology

Corresponding author

Keywords:fault prediction,feature selection,hierarchical agglomerative clustering

JOURNALFREE ACCESS

2019 Volume E102.DIssue 10Pages 1966-1975

DOIhttps://doi.org/10.1587/transinf.2019EDP7033

Details

Published: October 01, 2019Manuscript Received: February 01, 2019Released on J-STAGE: October 01, 2019Accepted: -Advance online publication: -Manuscript Revised: May 25, 2019

Download PDF(473K)

Download citationRIS

(compatible with EndNote, Reference Manager, ProCite, RefWorks)

BIB TEX

(compatible with BibDesk, LaTeX)

Text

How to download citation

Article overview

Abstract

Fault prediction aims to identify whether a software module is defect-prone or not according to metrics that are mined from software projects. These metric values, also known as features, may involve irrelevance and redundancy, which hurt the performance of fault prediction models. In order to filter out irrelevant and redundant features, a Hybrid Feature Selection (abbreviated as HFS) method for software fault prediction is proposed. The proposed HFS method consists of two major stages. First, HFS groups features with hierarchical agglomerative clustering; second, HFS selects the most valuable features from each cluster to remove irrelevant and redundant ones based on two wrapper based strategies. The empirical evaluation was conducted on 11 widely-studied NASA projects, using three different classifiers with four performance metrics (precision, recall, F-measure, and AUC). Comparison with six filter-based feature selection methods demonstrates that HFS achieves higher average F-measure and AUC values. Compared with two classic wrapper feature selection methods, HFS can obtain a competitive prediction performance in terms of average AUC while significantly reducing the computation cost of the wrapper process.

References (29)

Movatterモバイル変換

Corresponding author

Register with J-STAGE for free!