Your browser doesn't support javascript.
loading
Development of partial least squares regression with discriminant analysis for software bug prediction.
Rajkó, Róbert; Siket, István; Hegedus, Péter; Ferenc, Rudolf.
Afiliação
  • Rajkó R; E-Dimension Ltd., Szeged, Hungary.
  • Siket I; Academic staff of Doctoral School of Computer Science, University of Szeged, Szeged, Hungary.
  • Hegedus P; Department of Software Engineering, University of Szeged, Szeged, Hungary.
  • Ferenc R; Academic staff of Doctoral School of Computer Science, University of Szeged, Szeged, Hungary.
Heliyon ; 10(15): e35045, 2024 Aug 15.
Article em En | MEDLINE | ID: mdl-39166017
ABSTRACT
Many prediction models and approaches have been introduced during the past decades that try to forecast bugged code elements based on static source code metrics, change and history metrics, or both. However, there is still no universal best solution to this problem, as most suitable features and models vary from dataset to dataset and depend on the context in which we use them. Therefore, novel approaches and further studies on this topic are highly necessary. In this paper, we employ a chemometric approach - Partial Least Squares with Discriminant Analysis (PLS-DA) - for predicting bug prone Classes in Java programs using static source code metrics. PLS-DA is successfully applied within the field of chemometrics, but to our best knowledge, it has never been used before in the software maintenance domain for predicting bugs. In addition, we have used rigorous statistical treatments and evaluation for representing the software engineering results. We show that our PLS-DA based prediction model achieves superior performances compared to the state-of-the-art approaches (i.e. F-measure of 0.44-0.47 at 90% confidence level) when no data re-sampling applied and comparable to others when applying up-sampling on the largest open bug dataset, while training the model is significantly faster, thus finding optimal parameters is much easier. In terms of completeness, which measures the amount of bugs contained in the Java Classes predicted to be defective, PLS-DA outperforms every other algorithm it found 69.3% and 79.4% of the total bugs with no re-sampling and up-sampling, respectively.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Heliyon Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Hungria

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Heliyon Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Hungria