Your browser doesn't support javascript.
loading
Sustained software development, not number of citations or journal choice, is indicative of accurate bioinformatic software.
Gardner, Paul P; Paterson, James M; McGimpsey, Stephanie; Ashari-Ghomi, Fatemeh; Umu, Sinan U; Pawlik, Aleksandra; Gavryushkin, Alex; Black, Michael A.
Afiliação
  • Gardner PP; Department of Biochemistry, University of Otago, Dunedin, New Zealand. paul.gardner@otago.ac.nz.
  • Paterson JM; Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand. paul.gardner@otago.ac.nz.
  • McGimpsey S; Department of Civil and Natural Resources Engineering, University of Canterbury, Christchurch, New Zealand.
  • Ashari-Ghomi F; Parasites and Microbes, Wellcome Sanger Institute, Hinxton, UK.
  • Umu SU; Research Group for Genomic Epidemiology, Technical University of Denmark, Kongens Lyngby, Denmark.
  • Pawlik A; Department of Research, Cancer Registry of Norway, Oslo, Norway.
  • Gavryushkin A; Manaaki Whenua - Landcare Research, Lincoln, New Zealand.
  • Black MA; Department of Computer Science, University of Otago, Dunedin, New Zealand.
Genome Biol ; 23(1): 56, 2022 02 16.
Article em En | MEDLINE | ID: mdl-35172880
BACKGROUND: Computational biology provides software tools for testing and making inferences about biological data. In the face of increasing volumes of data, heuristic methods that trade software speed for accuracy may be employed. We have studied these trade-offs using the results of a large number of independent software benchmarks, and evaluated whether external factors, including speed, author reputation, journal impact, recency and developer efforts, are indicative of accurate software. RESULTS: We find that software speed, author reputation, journal impact, number of citations and age are unreliable predictors of software accuracy. This is unfortunate because these are frequently cited reasons for selecting software tools. However, GitHub-derived statistics and high version numbers show that accurate bioinformatic software tools are generally the product of many improvements over time. We also find an excess of slow and inaccurate bioinformatic software tools, and this is consistent across many sub-disciplines. There are few tools that are middle-of-road in terms of accuracy and speed trade-offs. CONCLUSIONS: Our findings indicate that accurate bioinformatic software is primarily the product of long-term commitments to software development. In addition, we hypothesise that bioinformatics software suffers from publication bias. Software that is intermediate in terms of both speed and accuracy may be difficult to publish-possibly due to author, editor and reviewer practises. This leaves an unfortunate hole in the literature, as ideal tools may fall into this gap. High accuracy tools are not always useful if they are slow, while high speed is not useful if the results are also inaccurate.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Biologia Computacional Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Software / Biologia Computacional Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article