Computational scoring and experimental evaluation of enzymes generated by neural networks.

Johnson, Sean R; Fu, Xiaozhi; Viknander, Sandra; Goldin, Clara; Monaco, Sarah; Zelezniak, Aleksej; Yang, Kevin K

Johnson, Sean R; Fu, Xiaozhi; Viknander, Sandra; Goldin, Clara; Monaco, Sarah; Zelezniak, Aleksej; Yang, Kevin K.

Afiliação

Johnson SR; New England Biolabs, Ipswich, MA, USA.
Fu X; Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
Viknander S; Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
Goldin C; Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
Zelezniak A; Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden. aleksej.zelezniak@chalmers.se.
Yang KK; Institute of Biotechnology, Life Sciences Centre, Vilnius University, Vilnius, Lithuania. aleksej.zelezniak@chalmers.se.

Nat Biotechnol ; 2024 Apr 23.

Article em En | MEDLINE | ID: mdl-38653796

ABSTRACT

ABSTRACT

In recent years, generative protein sequence models have been developed to sample novel sequences. However, predicting whether generated proteins will fold and function remains challenging. We evaluate a set of 20 diverse computational metrics to assess the quality of enzyme sequences produced by three contrasting generative models ancestral sequence reconstruction, a generative adversarial network and a protein language model. Focusing on two enzyme families, we expressed and purified over 500 natural and generated sequences with 70-90% identity to the most similar natural sequences to benchmark computational metrics for predicting in vitro enzyme activity. Over three rounds of experiments, we developed a computational filter that improved the rate of experimental success by 50-150%. The proposed metrics and models will drive protein engineering research by serving as a benchmark for generative protein sequence models and helping to select active variants for experimental testing.

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Nat Biotechnol Assunto da revista: BIOTECNOLOGIA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google