Your browser doesn't support javascript.
loading
Semantic and traditional feature fusion for software defect prediction using hybrid deep learning model.
Abdu, Ahmed; Zhai, Zhengjun; Abdo, Hakim A; Algabri, Redhwan; Al-Masni, Mohammed A; Muhammad, Mannan Saeed; Gu, Yeong Hyeon.
Afiliação
  • Abdu A; School of Software, Northwestern Polytechnical University, Xi'an, 710072, China.
  • Zhai Z; School of Software, Northwestern Polytechnical University, Xi'an, 710072, China. zhaizjun@nwpu.edu.cn.
  • Abdo HA; School of Computer Science, Northwestern Polytechnical University, Xi'an, 710072, China. zhaizjun@nwpu.edu.cn.
  • Algabri R; School of Computer Science, Dr.Babasaheb Ambedkar Marathwada University, Aurangabad, India.
  • Al-Masni MA; Research Institute of Engineering and Technology, Hanyang University, Ansan, 15588, Korea. redhwan@hanyang.ac.kr.
  • Muhammad MS; Department of Artificial Intelligence and Data Science, College of Software and Convergence Technology, Sejong University, Seoul, 05006, Republic of Korea.
  • Gu YH; Department of AI and Robotics, Sejong University, Seoul, 05006, South Korea.
Sci Rep ; 14(1): 14771, 2024 Jul 01.
Article em En | MEDLINE | ID: mdl-38951608
ABSTRACT
Software defect prediction aims to find a reliable method for predicting defects in a particular software project and assisting software engineers in allocating limited resources to release high-quality software products. While most earlier research has concentrated on employing traditional features, current methodologies are increasingly directed toward extracting semantic features from source code. Traditional features often fall short in identifying semantic differences within programs, differences that are essential for the development of reliable and effective prediction models. In contrast, semantic features cannot present statistical metrics about the source code, such as the code size and complexity. Thus, using only one kind of feature negatively affects prediction performance. To bridge the gap between the traditional and semantic features, we propose a novel defect prediction model that integrates traditional and semantic features using a hybrid deep learning approach to address this limitation. Specifically, our model employs a hybrid CNN-MLP classifier the convolutional neural network (CNN) processes semantic features extracted from projects' abstract syntax trees (ASTs) using Word2vec. In contrast, the traditional features extracted from the dataset repository are processed by a multilayer perceptron (MLP). Outputs of CNN and MLP are then integrated and fed into a fully connected layer for defect prediction. Extensive experiments are conducted on various open-source projects to validate CNN-MLP's effectiveness. Experimental results indicate that CNN-MLP can significantly enhance defect prediction performance. Furthermore, CNN-MLP's improvements outperform existing methods in non-effort-aware and effort-aware cases.

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Sci Rep Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Sci Rep Ano de publicação: 2024 Tipo de documento: Article