Your browser doesn't support javascript.
loading
Illuminating protein space with a programmable generative model.
Ingraham, John B; Baranov, Max; Costello, Zak; Barber, Karl W; Wang, Wujie; Ismail, Ahmed; Frappier, Vincent; Lord, Dana M; Ng-Thow-Hing, Christopher; Van Vlack, Erik R; Tie, Shan; Xue, Vincent; Cowles, Sarah C; Leung, Alan; Rodrigues, João V; Morales-Perez, Claudio L; Ayoub, Alex M; Green, Robin; Puentes, Katherine; Oplinger, Frank; Panwar, Nishant V; Obermeyer, Fritz; Root, Adam R; Beam, Andrew L; Poelwijk, Frank J; Grigoryan, Gevorg.
Affiliation
  • Ingraham JB; Generate Biomedicines, Somerville, MA, USA.
  • Baranov M; Generate Biomedicines, Somerville, MA, USA.
  • Costello Z; Generate Biomedicines, Somerville, MA, USA.
  • Barber KW; Generate Biomedicines, Somerville, MA, USA.
  • Wang W; Generate Biomedicines, Somerville, MA, USA.
  • Ismail A; Generate Biomedicines, Somerville, MA, USA.
  • Frappier V; Generate Biomedicines, Somerville, MA, USA.
  • Lord DM; Generate Biomedicines, Somerville, MA, USA.
  • Ng-Thow-Hing C; Generate Biomedicines, Somerville, MA, USA.
  • Van Vlack ER; Generate Biomedicines, Somerville, MA, USA.
  • Tie S; Generate Biomedicines, Somerville, MA, USA.
  • Xue V; Generate Biomedicines, Somerville, MA, USA.
  • Cowles SC; Generate Biomedicines, Somerville, MA, USA.
  • Leung A; Generate Biomedicines, Somerville, MA, USA.
  • Rodrigues JV; Generate Biomedicines, Somerville, MA, USA.
  • Morales-Perez CL; Generate Biomedicines, Somerville, MA, USA.
  • Ayoub AM; Generate Biomedicines, Somerville, MA, USA.
  • Green R; Generate Biomedicines, Somerville, MA, USA.
  • Puentes K; Generate Biomedicines, Somerville, MA, USA.
  • Oplinger F; Generate Biomedicines, Somerville, MA, USA.
  • Panwar NV; Generate Biomedicines, Somerville, MA, USA.
  • Obermeyer F; Generate Biomedicines, Somerville, MA, USA.
  • Root AR; Generate Biomedicines, Somerville, MA, USA.
  • Beam AL; Generate Biomedicines, Somerville, MA, USA.
  • Poelwijk FJ; Generate Biomedicines, Somerville, MA, USA.
  • Grigoryan G; Generate Biomedicines, Somerville, MA, USA. ggrigoryan@generatebiomedicines.com.
Nature ; 623(7989): 1070-1078, 2023 Nov.
Article in En | MEDLINE | ID: mdl-37968394
ABSTRACT
Three billion years of evolution has produced a tremendous diversity of protein molecules1, but the full potential of proteins is likely to be much greater. Accessing this potential has been challenging for both computation and experiments because the space of possible protein molecules is much larger than the space of those likely to have functions. Here we introduce Chroma, a generative model for proteins and protein complexes that can directly sample novel protein structures and sequences, and that can be conditioned to steer the generative process towards desired properties and functions. To enable this, we introduce a diffusion process that respects the conformational statistics of polymer ensembles, an efficient neural architecture for molecular systems that enables long-range reasoning with sub-quadratic scaling, layers for efficiently synthesizing three-dimensional structures of proteins from predicted inter-residue geometries and a general low-temperature sampling algorithm for diffusion models. Chroma achieves protein design as Bayesian inference under external constraints, which can involve symmetries, substructure, shape, semantics and even natural-language prompts. The experimental characterization of 310 proteins shows that sampling from Chroma results in proteins that are highly expressed, fold and have favourable biophysical properties. The crystal structures of two designed proteins exhibit atomistic agreement with Chroma samples (a backbone root-mean-square deviation of around 1.0 Å). With this unified approach to protein design, we hope to accelerate the programming of protein matter to benefit human health, materials science and synthetic biology.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Protein Conformation / Algorithms / Computer Simulation / Proteins Limits: Humans Language: En Journal: Nature Year: 2023 Document type: Article Affiliation country: United States

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Protein Conformation / Algorithms / Computer Simulation / Proteins Limits: Humans Language: En Journal: Nature Year: 2023 Document type: Article Affiliation country: United States