Your browser doesn't support javascript.
loading
Rank-ordering of known enzymes as starting points for re-engineering novel substrate activity using a convolutional neural network.
Upadhyay, Vikas; Boorla, Veda Sheersh; Maranas, Costas D.
Afiliación
  • Upadhyay V; Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, 16802, USA.
  • Boorla VS; Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, 16802, USA.
  • Maranas CD; Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, 16802, USA. Electronic address: costas@psu.edu.
Metab Eng ; 78: 171-182, 2023 07.
Article en En | MEDLINE | ID: mdl-37301359
Retro-biosynthetic approaches have made significant advances in predicting synthesis routes of target biofuel, bio-renewable or bio-active molecules. The use of only cataloged enzymatic activities limits the discovery of new production routes. Recent retro-biosynthetic algorithms increasingly use novel conversions that require altering the substrate or cofactor specificities of existing enzymes while connecting pathways leading to a target metabolite. However, identifying and re-engineering enzymes for desired novel conversions are currently the bottlenecks in implementing such designed pathways. Herein, we present EnzRank, a convolutional neural network (CNN) based approach, to rank-order existing enzymes in terms of their suitability to undergo successful protein engineering through directed evolution or de novo design towards a desired specific substrate activity. We train the CNN model on 11,800 known active enzyme-substrate pairs from the BRENDA database as positive samples and data generated by scrambling these pairs as negative samples using substrate dissimilarity between an enzyme's native substrate and all other molecules present in the dataset using Tanimoto similarity score. EnzRank achieves an average recovery rate of 80.72% and 73.08% for positive and negative pairs on test data after using a 10-fold holdout method for training and cross-validation. We further developed a web-based user interface (available at https://huggingface.co/spaces/vuu10/EnzRank) to predict enzyme-substrate activity using SMILES strings of substrates and enzyme sequence as input to allow convenient and easy-to-use access to EnzRank. In summary, this effort can aid de novo pathway design tools to prioritize starting enzyme re-engineering candidates for novel reactions as well as in predicting the potential secondary activity of enzymes in cell metabolism.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Algoritmos / Redes Neurales de la Computación Tipo de estudio: Prognostic_studies Idioma: En Revista: Metab Eng Asunto de la revista: ENGENHARIA BIOMEDICA / METABOLISMO Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Bélgica

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Algoritmos / Redes Neurales de la Computación Tipo de estudio: Prognostic_studies Idioma: En Revista: Metab Eng Asunto de la revista: ENGENHARIA BIOMEDICA / METABOLISMO Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos Pais de publicación: Bélgica