RESUMEN
Protein denoising diffusion probabilistic models are used for the de novo generation of protein backbones but are limited in their ability to guide generation of proteins with sequence-specific attributes and functional properties. To overcome this limitation, we developed ProteinGenerator (PG), a sequence space diffusion model based on RoseTTAFold that simultaneously generates protein sequences and structures. Beginning from a noised sequence representation, PG generates sequence and structure pairs by iterative denoising, guided by desired sequence and structural protein attributes. We designed thermostable proteins with varying amino acid compositions and internal sequence repeats and cage bioactive peptides, such as melittin. By averaging sequence logits between diffusion trajectories with distinct structural constraints, we designed multistate parent-child protein triples in which the same sequence folds to different supersecondary structures when intact in the parent versus split into two child domains. PG design trajectories can be guided by experimental sequence-activity data, providing a general approach for integrated computational and experimental optimization of protein function.
RESUMEN
Designing proteins with tailored structures and functions is a long-standing goal in bioengineering. Recently, deep learning advances have enabled protein structure prediction at near-experimental accuracy, which has catalyzed progress in protein design as well. We review recent studies that use structure-prediction neural networks to design proteins, via approaches such as activation maximization, inpainting, or denoising diffusion. These methods have led to major improvements over previous methods in wet-lab success rates for designing protein binders, metalloproteins, enzymes, and oligomeric assemblies. These results show that structure-prediction models are a powerful foundation for developing protein-design tools and suggest that continued improvement of their accuracy and generality will be key to unlocking the full potential of protein design.
Asunto(s)
Proteínas , Proteínas/química , Conformación Proteica , Redes Neurales de la Computación , Modelos Moleculares , Ingeniería de Proteínas , Pliegue de ProteínaRESUMEN
The binding and catalytic functions of proteins are generally mediated by a small number of functional residues held in place by the overall protein structure. Here, we describe deep learning approaches for scaffolding such functional sites without needing to prespecify the fold or secondary structure of the scaffold. The first approach, "constrained hallucination," optimizes sequences such that their predicted structures contain the desired functional site. The second approach, "inpainting," starts from the functional site and fills in additional sequence and structure to create a viable protein scaffold in a single forward pass through a specifically trained RoseTTAFold network. We use these two methods to design candidate immunogens, receptor traps, metalloproteins, enzymes, and protein-binding proteins and validate the designs using a combination of in silico and experimental tests.
Asunto(s)
Aprendizaje Profundo , Ingeniería de Proteínas , Proteínas , Sitios de Unión , Catálisis , Unión Proteica , Ingeniería de Proteínas/métodos , Pliegue de Proteína , Estructura Secundaria de Proteína , Proteínas/químicaRESUMEN
Rho family GTPases are activated with precise spatiotemporal control by guanine nucleotide exchange factors (GEFs). Guanine exchange factor H1 (GEF-H1), a RhoA activator, is thought to act as an integrator of microtubule (MT) and actin dynamics in diverse cell functions. Here we identify a GEF-H1 autoinhibitory sequence and exploit it to produce an activation biosensor to quantitatively probe the relationship between GEF-H1 conformational change, RhoA activity, and edge motion in migrating cells with micrometer- and second-scale resolution. Simultaneous imaging of MT dynamics and GEF-H1 activity revealed that autoinhibited GEF-H1 is localized to MTs, while MT depolymerization subadjacent to the cell cortex promotes GEF-H1 activation in an ~5-µm-wide peripheral band. GEF-H1 is further regulated by Src phosphorylation, activating GEF-H1 in a narrower band ~0-2 µm from the cell edge, in coordination with cell protrusions. This indicates a synergistic intersection between MT dynamics and Src signaling in RhoA activation through GEF-H1.