Evaluation guidelines for machine learning tools in the chemical sciences.

Bender, Andreas; Schneider, Nadine; Segler, Marwin; Patrick Walters, W; Engkvist, Ola; Rodrigues, Tiago

Bender, Andreas; Schneider, Nadine; Segler, Marwin; Patrick Walters, W; Engkvist, Ola; Rodrigues, Tiago.

Afiliação

Bender A; Centre for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Cambridge, UK.
Schneider N; Novartis Institutes for BioMedical Research, Novartis Pharma, Novartis Campus, Basel, Switzerland.
Segler M; Microsoft Research Cambridge, Cambridge, UK.
Patrick Walters W; Relay Therapeutics, Cambridge, MA, USA.
Engkvist O; Discovery Sciences, R&D, AstraZeneca, Gothenburg, Sweden.
Rodrigues T; Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg, Sweden.

Nat Rev Chem ; 6(6): 428-442, 2022 Jun.

Article em En | MEDLINE | ID: mdl-37117429

RESUMO

Machine learning (ML) promises to tackle the grand challenges in chemistry and speed up the generation, improvement and/or ordering of research hypotheses. Despite the overarching applicability of ML workflows, one usually finds diverse evaluation study designs. The current heterogeneity in evaluation techniques and metrics leads to difficulty in (or the impossibility of) comparing and assessing the relevance of new algorithms. Ultimately, this may delay the digitalization of chemistry at scale and confuse method developers, experimentalists, reviewers and journal editors. In this Perspective, we critically discuss a set of method development and evaluation guidelines for different types of ML-based publications, emphasizing supervised learning. We provide a diverse collection of examples from various authors and disciplines in chemistry. While taking into account varying accessibility across research groups, our recommendations focus on reporting completeness and standardizing comparisons between tools. We aim to further contribute to improved ML transparency and credibility by suggesting a checklist of retro-/prospective tests and dissecting their importance. We envisage that the wide adoption and continuous update of best practices will encourage an informed use of ML on real-world problems related to the chemical sciences.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Guideline Idioma: En Revista: Nat Rev Chem Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google