Your browser doesn't support javascript.
loading
Deep Dive into Machine Learning Models for Protein Engineering.
Xu, Yuting; Verma, Deeptak; Sheridan, Robert P; Liaw, Andy; Ma, Junshui; Marshall, Nicholas M; McIntosh, John; Sherer, Edward C; Svetnik, Vladimir; Johnston, Jennifer M.
Afiliação
  • Xu Y; Biometrics Research, Merck & Co., Inc., Rahway, New Jersey 07065, United States.
  • Verma D; Computational and Structural Chemistry, Merck & Co., Inc., Kenilworth, New Jersey 07033, United States.
  • Sheridan RP; Computational and Structural Chemistry, Merck & Co., Inc., Kenilworth, New Jersey 07033, United States.
  • Liaw A; Biometrics Research, Merck & Co., Inc., Rahway, New Jersey 07065, United States.
  • Ma J; Early Oncology Statistics, Merck & Co., Inc., Rahway, New Jersey 07065, United States.
  • Marshall NM; Invenra, Inc., 505 South Rosa Road, Madison, Wisconsin 53719, United States.
  • McIntosh J; Process Research & Development, Merck & Co., Inc., Rahway, New Jersey 07065, United States.
  • Sherer EC; Computational and Structural Chemistry, Merck & Co., Inc., Kenilworth, New Jersey 07033, United States.
  • Svetnik V; Biometrics Research, Merck & Co., Inc., Rahway, New Jersey 07065, United States.
  • Johnston JM; Computational and Structural Chemistry, Merck & Co., Inc., Kenilworth, New Jersey 07033, United States.
J Chem Inf Model ; 60(6): 2773-2790, 2020 06 22.
Article em En | MEDLINE | ID: mdl-32250622
ABSTRACT
Protein redesign and engineering has become an important task in pharmaceutical research and development. Recent advances in technology have enabled efficient protein redesign by mimicking natural evolutionary mutation, selection, and amplification steps in the laboratory environment. For any given protein, the number of possible mutations is astronomical. It is impractical to synthesize all sequences or even to investigate all functionally interesting variants. Recently, there has been an increased interest in using machine learning to assist protein redesign, since prediction models can be used to virtually screen a large number of novel sequences. However, many state-of-the-art machine learning models, especially deep learning models, have not been extensively explored. Moreover, only a small selection of protein sequence descriptors has been considered. In this work, the performance of prediction models built using an array of machine learning methods and protein descriptor types, including two novel, single amino acid descriptors and one structure-based three-dimensional descriptor, is benchmarked. The predictions were evaluated on a diverse collection of public and proprietary data sets, using a variety of evaluation metrics. The results of this comparison suggest that Convolution Neural Network models built with amino acid property descriptors are the most widely applicable to the types of protein redesign problems faced in the pharmaceutical industry.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Idioma: En Revista: J Chem Inf Model Assunto da revista: INFORMATICA MEDICA / QUIMICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Idioma: En Revista: J Chem Inf Model Assunto da revista: INFORMATICA MEDICA / QUIMICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: Estados Unidos