Your browser doesn't support javascript.
loading
Software reusability dataset based on static analysis metrics and reuse rate information.
Papamichail, Michail D; Diamantopoulos, Themistoklis; Symeonidis, Andreas L.
Afiliação
  • Papamichail MD; Intelligent Systems and Software Engineering Laboratory, Electrical and Computer Engineering Dept., Aristotle University of Thessaloniki, Greece.
  • Diamantopoulos T; Intelligent Systems and Software Engineering Laboratory, Electrical and Computer Engineering Dept., Aristotle University of Thessaloniki, Greece.
  • Symeonidis AL; Intelligent Systems and Software Engineering Laboratory, Electrical and Computer Engineering Dept., Aristotle University of Thessaloniki, Greece.
Data Brief ; 27: 104687, 2019 Dec.
Article em En | MEDLINE | ID: mdl-31720337
ABSTRACT
The widely adopted component-based development paradigm considers the reuse of proper software components as a primary criterion for successful software development. As a result, various research efforts are directed towards evaluating the extent to which a software component is reusable. Prior efforts follow expert-based approaches, however the continuously increasing open-source software initiative allows the introduction of data-driven alternatives. In this context we have generated a dataset that harnesses information residing in online code hosting facilities and introduces the actual reuse rate of software components as a measure of their reusability. To do so, we have analyzed the most popular projects included in the maven registry and have computed a large number of static analysis metrics at both class and package levels using SourceMeter tool [2] that quantify six major source code properties complexity, cohesion, coupling, inheritance, documentation and size. For these projects we additionally computed their reuse rate using our self-developed code search engine, AGORA [5]. The generated dataset contains analysis information regarding more than 24,000 classes and 2000 packages, and can, thus, be used as the information basis towards the design and development of data-driven reusability evaluation methodologies. The dataset is related to the research article entitled "Measuring the Reusability of Software Components using Static Analysis Metrics and Reuse Rate Information" [1].
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Data Brief Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Data Brief Ano de publicação: 2019 Tipo de documento: Article