Next Article in Journal
Development of Automatic Accumulating Equipment for Roller-Type Onion Pot-Seeding Machine
Previous Article in Journal
Extensive Modeling of Peculiar Hydration in Fine Micro-Pore Structures Applicable to Integrated Thermodynamic Analysis for Portland Cement
Open AccessArticle

An Improved CNN Model for Within-Project Software Defect Prediction

by Cong Pan 1,2,*, Minyan Lu 1,2,*, Biao Xu 1,2 and Houleng Gao 1,2
The Key Laboratory on Reliability and Environmental Engineering Technology, Beihang University, Beijing 100191, China
School of Reliability and System Engineering, Beihang University, Beijing 100191, China
Authors to whom correspondence should be addressed.
Appl. Sci. 2019, 9(10), 2138;
Received: 18 March 2019 / Revised: 13 May 2019 / Accepted: 22 May 2019 / Published: 24 May 2019
(This article belongs to the Section Computing and Artificial Intelligence)
To improve software reliability, software defect prediction is used to find software bugs and prioritize testing efforts. Recently, some researchers introduced deep learning models, such as the deep belief network (DBN) and the state-of-the-art convolutional neural network (CNN), and used automatically generated features extracted from abstract syntax trees (ASTs) and deep learning models to improve defect prediction performance. However, the research on the CNN model failed to reveal clear conclusions due to its limited dataset size, insufficiently repeated experiments, and outdated baseline selection. To solve these problems, we built the PROMISE Source Code (PSC) dataset to enlarge the original dataset in the CNN research, which we named the Simplified PROMISE Source Code (SPSC) dataset. Then, we proposed an improved CNN model for within-project defect prediction (WPDP) and compared our results to existing CNN results and an empirical study. Our experiment was based on a 30-repetition holdout validation and a 10 * 10 cross-validation. Experimental results showed that our improved CNN model was comparable to the existing CNN model, and it outperformed the state-of-the-art machine learning models significantly for WPDP. Furthermore, we defined hyperparameter instability and examined the threat and opportunity it presents for deep learning models on defect prediction. View Full-Text
Keywords: CNN model; within-project defect prediction; abstract syntax tree; deep learning; hyperparameter instability CNN model; within-project defect prediction; abstract syntax tree; deep learning; hyperparameter instability
Show Figures

Figure 1

MDPI and ACS Style

Pan, C.; Lu, M.; Xu, B.; Gao, H. An Improved CNN Model for Within-Project Software Defect Prediction. Appl. Sci. 2019, 9, 2138.

Show more citation formats Show less citations formats
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Access Map by Country/Region

Search more from Scilit
Back to TopTop