Your browser doesn't support javascript.
loading
Comprehensive Prediction of Molecular Recognition in a Combinatorial Chemical Space Using Machine Learning.
Taguchi, Alexander T; Boyd, James; Diehnelt, Chris W; Legutki, Joseph B; Zhao, Zhan-Gong; Woodbury, Neal W.
Afiliação
  • Taguchi AT; RubrYc, Inc., 733 Industrial Road, San Carlos, California 94403, United States.
  • Boyd J; HealthTell, Inc., 145 S 79th Street, Chandler, Arizona 85226, United States.
  • Diehnelt CW; Center for Innovations in Medicine at the Biodesign Institute, Arizona State University, Tempe, Arizona 85287, United States.
  • Legutki JB; HealthTell, Inc., 145 S 79th Street, Chandler, Arizona 85226, United States.
  • Zhao ZG; Center for Innovations in Medicine at the Biodesign Institute, Arizona State University, Tempe, Arizona 85287, United States.
  • Woodbury NW; Center for Innovations in Medicine at the Biodesign Institute, Arizona State University, Tempe, Arizona 85287, United States.
ACS Comb Sci ; 22(10): 500-508, 2020 10 12.
Article em En | MEDLINE | ID: mdl-32786325
ABSTRACT
In combinatorial chemical approaches, optimizing the composition and arrangement of building blocks toward a particular function has been done using a number of methods, including high throughput molecular screening, molecular evolution, and computational prescreening. Here, a different approach is considered that uses sparse measurements of library molecules as the input to a machine learning algorithm which generates a comprehensive, quantitative relationship between covalent molecular structure and function that can then be used to predict the function of any molecule in the possible combinatorial space. To test the feasibility of the approach, a defined combinatorial chemical space consisting of ∼1012 possible linear combinations of 16 different amino acids was used. The binding of a very sparse, but nearly random, sampling of this amino acid sequence space to 9 different protein targets is measured and used to generate a general relationship between peptide sequence and binding for each target. Surprisingly, measuring as little as a few hundred to a few thousand of the ∼1012 possible molecules provides sufficient training to be highly predictive of the binding of the remaining molecules in the combinatorial space. Furthermore, measuring only amino acid sequences that bind weakly to a target allows the accurate prediction of which sequences will bind 10-100 times more strongly. Thus, the molecular recognition information contained in a tiny fraction of molecules in this combinatorial space is sufficient to characterize any set of molecules randomly selected from the entire space, a fact that potentially has significant implications for the design of new chemical function using combinatorial chemical libraries.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Peptídeos / Aprendizado de Máquina Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Peptídeos / Aprendizado de Máquina Idioma: En Ano de publicação: 2020 Tipo de documento: Article