ABSTRACT
Protein structures are essential to understanding cellular processes in molecular detail. While advances in artificial intelligence revealed the tertiary structure of proteins at scale, their quaternary structure remains mostly unknown. We devise a scalable strategy based on AlphaFold2 to predict homo-oligomeric assemblies across four proteomes spanning the tree of life. Our results suggest that approximately 45% of an archaeal proteome and a bacterial proteome and 20% of two eukaryotic proteomes form homomers. Our predictions accurately capture protein homo-oligomerization, recapitulate megadalton complexes, and unveil hundreds of homo-oligomer types, including three confirmed experimentally by structure determination. Integrating these datasets with omics information suggests that a majority of known protein complexes are symmetric. Finally, these datasets provide a structural context for interpreting disease mutations and reveal coiled-coil regions as major enablers of quaternary structure evolution in human. Our strategy is applicable to any organism and provides a comprehensive view of homo-oligomerization in proteomes.
Subject(s)
Artificial Intelligence , Proteins , Proteome , Humans , Proteins/chemistry , Proteins/genetics , Archaea/chemistry , Archaea/genetics , Eukaryota/chemistry , Eukaryota/genetics , Bacteria/chemistry , Bacteria/geneticsABSTRACT
The field of protein design has made remarkable progress over the past decade. Historically, the low reliability of purely structure-based design methods limited their application, but recent strategies that combine structure-based and sequence-based calculations, as well as machine learning tools, have dramatically improved protein engineering and design. In this Review, we discuss how these methods have enabled the design of increasingly complex structures and therapeutically relevant activities. Additionally, protein optimization methods have improved the stability and activity of complex eukaryotic proteins. Thanks to their increased reliability, computational design methods have been applied to improve therapeutics and enzymes for green chemistry and have generated vaccine antigens, antivirals and drug-delivery nano-vehicles. Moreover, the high success of design methods reflects an increased understanding of basic rules that govern the relationships among protein sequence, structure and function. However, de novo design is still limited mostly to α-helix bundles, restricting its potential to generate sophisticated enzymes and diverse protein and small-molecule binders. Designing complex protein structures is a challenging but necessary next step if we are to realize our objective of generating new-to-nature activities.
Subject(s)
Protein Engineering , Proteins , Protein Engineering/methods , Humans , Proteins/chemistry , Proteins/metabolism , Animals , Models, Molecular , Protein ConformationABSTRACT
Advances in the synthesis and screening of small-molecule libraries have accelerated the discovery of chemical probes for studying biological processes. Still, only a small fraction of the human proteome has chemical ligands. Here, we describe a platform that marries fragment-based ligand discovery with quantitative chemical proteomics to map thousands of reversible small molecule-protein interactions directly in human cells, many of which can be site-specifically determined. We show that fragment hits can be advanced to furnish selective ligands that affect the activity of proteins heretofore lacking chemical probes. We further combine fragment-based chemical proteomics with phenotypic screening to identify small molecules that promote adipocyte differentiation by engaging the poorly characterized membrane protein PGRMC2. Fragment-based screening in human cells thus provides an extensive proteome-wide map of protein ligandability and facilitates the coordinated discovery of bioactive small molecules and their molecular targets.
Subject(s)
Drug Discovery/methods , Proteomics/methods , Adipocytes/cytology , Cell Differentiation , Crystallography, X-Ray , High-Throughput Screening Assays , Humans , Hydrolases/chemistry , Ligands , Membrane Proteins/antagonists & inhibitors , Oxidoreductases/chemistry , Protein Binding , Receptors, Progesterone/antagonists & inhibitors , Small Molecule LibrariesABSTRACT
Lipids play central roles in physiology and disease, where their structural, metabolic, and signaling functions often arise from interactions with proteins. Here, we describe a set of lipid-based chemical proteomic probes and their global interaction map in mammalian cells. These interactions involve hundreds of proteins from diverse functional classes and frequently occur at sites of drug action. We determine the target profiles for several drugs across the lipid-interaction proteome, revealing that its ligandable content extends far beyond traditionally defined categories of druggable proteins. In further support of this finding, we describe a selective ligand for the lipid-binding protein nucleobindin-1 (NUCB1) and show that this compound perturbs the hydrolytic and oxidative metabolism of endocannabinoids in cells. The described chemical proteomic platform thus provides an integrated path to both discover and pharmacologically characterize a wide range of proteins that participate in lipid pathways in cells.
Subject(s)
Lipid Metabolism , Proteins/analysis , Proteins/metabolism , Animals , Calcium-Binding Proteins/analysis , Cell Line, Tumor , DNA-Binding Proteins/analysis , Drug Evaluation, Preclinical , Eicosanoids/metabolism , Endocannabinoids/metabolism , HEK293 Cells , Humans , Lipid Metabolism/drug effects , Mice , Nerve Tissue Proteins/analysis , Nucleobindins , Proteome/analysis , Proteome/metabolism , Small Molecule Libraries/pharmacologyABSTRACT
De novo design of complex protein folds using solely computational means remains a substantial challenge1. Here we use a robust deep learning pipeline to design complex folds and soluble analogues of integral membrane proteins. Unique membrane topologies, such as those from G-protein-coupled receptors2, are not found in the soluble proteome, and we demonstrate that their structural features can be recapitulated in solution. Biophysical analyses demonstrate the high thermal stability of the designs, and experimental structures show remarkable design accuracy. The soluble analogues were functionalized with native structural motifs, as a proof of concept for bringing membrane protein functions to the soluble proteome, potentially enabling new approaches in drug discovery. In summary, we have designed complex protein topologies and enriched them with functionalities from membrane proteins, with high experimental success rates, leading to a de facto expansion of the functional soluble fold space.
Subject(s)
Computer-Aided Design , Deep Learning , Membrane Proteins , Protein Folding , Solubility , Humans , Membrane Proteins/chemistry , Membrane Proteins/metabolism , Models, Molecular , Protein Stability , Proteome/chemistry , Receptors, G-Protein-Coupled/chemistry , Receptors, G-Protein-Coupled/metabolism , Amino Acid Motifs , Proof of Concept StudyABSTRACT
Because apoptosis of infected cells can limit virus production and spread, some viruses have co-opted prosurvival genes from the host. This includes the Epstein-Barr virus (EBV) gene BHRF1, a homolog of human Bcl-2 proteins that block apoptosis and are associated with cancer. Computational design and experimental optimization were used to generate a novel protein called BINDI that binds BHRF1 with picomolar affinity. BINDI recognizes the hydrophobic cleft of BHRF1 in a manner similar to other Bcl-2 protein interactions but makes many additional contacts to achieve exceptional affinity and specificity. BINDI induces apoptosis in EBV-infected cancer lines, and when delivered with an antibody-targeted intracellular delivery carrier, BINDI suppressed tumor growth and extended survival in a xenograft disease model of EBV-positive human lymphoma. High-specificity-designed proteins that selectively kill target cells may provide an advantage over the toxic compounds used in current generation antibody-drug conjugates.
Subject(s)
Herpesvirus 4, Human/chemistry , Protein Engineering , Proteins/pharmacology , Viral Proteins/antagonists & inhibitors , Amino Acid Sequence , Animals , Apoptosis/drug effects , Computational Biology , Crystallography, X-Ray , Epstein-Barr Virus Infections/drug therapy , Herpesvirus 4, Human/physiology , Heterografts , Humans , Lymphoma, B-Cell/drug therapy , Mice , Models, Molecular , Molecular Sequence Data , Neoplasm Transplantation , Proteins/chemistry , Proteins/metabolism , Sequence Alignment , Viral Proteins/chemistryABSTRACT
Physical interactions between proteins are essential for most biological processes governing life1. However, the molecular determinants of such interactions have been challenging to understand, even as genomic, proteomic and structural data increase. This knowledge gap has been a major obstacle for the comprehensive understanding of cellular protein-protein interaction networks and for the de novo design of protein binders that are crucial for synthetic biology and translational applications2-9. Here we use a geometric deep-learning framework operating on protein surfaces that generates fingerprints to describe geometric and chemical features that are critical to drive protein-protein interactions10. We hypothesized that these fingerprints capture the key aspects of molecular recognition that represent a new paradigm in the computational design of novel protein interactions. As a proof of principle, we computationally designed several de novo protein binders to engage four protein targets: SARS-CoV-2 spike, PD-1, PD-L1 and CTLA-4. Several designs were experimentally optimized, whereas others were generated purely in silico, reaching nanomolar affinity with structural and mutational characterization showing highly accurate predictions. Overall, our surface-centric approach captures the physical and chemical determinants of molecular recognition, enabling an approach for the de novo design of protein interactions and, more broadly, of artificial proteins with function.
Subject(s)
Computer Simulation , Deep Learning , Protein Binding , Proteins , Humans , Proteins/chemistry , Proteins/metabolism , Proteomics , Protein Interaction Maps , Binding Sites , Synthetic BiologyABSTRACT
The ability to remotely control the activity of chimeric antigen receptors (CARs) with small molecules can improve the safety and efficacy of gene-modified T cells. Split ON- or OFF-switch CARs involve the dissociation of tumor-antigen binding from T cell activation (i.e., CD3ζ) on the receptor (R-) and signaling (S-) chains, respectively, that either associate or are disrupted in the presence of a small molecule. Here, we have developed an inducible (i)ON-CAR comprising the anti-apoptotic B cell lymphoma protein 2 protein in the ectodomain of both chains which associate in the presence of venetoclax. We showed that inducible ON (iON)-CAR T cells respond to target tumors cells in the presence of venetoclax or the BH3 mimetic navitoclax in a dose-dependent manner, while there is no impact of the drugs on equivalent second generation-CAR T cells. Within 48 h of venetoclax withdrawal, iON-CAR T cells lose the ability to respond to target tumor cells in vitro as evaluated by Interferon-gamma (IFNγ) production, and they are reliant upon the presence of venetoclax for in vivo activity. Finally, by fusing a degron sequence to the endodomain of the iON-CAR S-chain we generated an all-in-one ON/OFF-switch CAR, the iONØ-CAR, down-regulated by lenalidomide within 4 to 6 for functionally inactive T cells (no IFNγ production) within 24 h. We propose that our remote-control CAR designs can reduce toxicity in the clinic. Moreover, the periodic rest of iON and iONØ-CAR T cells may alleviate exhaustion and hence augment persistence and long-term tumor control in patients.
Subject(s)
Bridged Bicyclo Compounds, Heterocyclic , Receptors, Chimeric Antigen , Sulfonamides , T-Lymphocytes , Receptors, Chimeric Antigen/immunology , Receptors, Chimeric Antigen/metabolism , Humans , Sulfonamides/pharmacology , Animals , T-Lymphocytes/immunology , T-Lymphocytes/drug effects , Mice , Bridged Bicyclo Compounds, Heterocyclic/pharmacology , Cell Line, Tumor , Antineoplastic Agents/pharmacology , Immunotherapy, Adoptive/methods , Proto-Oncogene Proteins c-bcl-2/metabolism , Xenograft Model Antitumor Assays , Receptors, Antigen, T-Cell/metabolism , Receptors, Antigen, T-Cell/immunology , Lymphocyte Activation/drug effects , Interferon-gamma/metabolism , Interferon-gamma/immunology , Aniline CompoundsABSTRACT
Cysteine cathepsins are a family of proteases that are relevant therapeutic targets for the treatment of different cancers and other diseases. However, no clinically approved drugs for these proteins exist, as their systemic inhibition can induce deleterious side effects. To address this problem, we developed a modular antibody-based platform for targeted drug delivery by conjugating non-natural peptide inhibitors (NNPIs) to antibodies. NNPIs were functionalized with reactive warheads for covalent inhibition, optimized with deep saturation mutagenesis and conjugated to antibodies to enable cell-type-specific delivery. Our antibody-peptide inhibitor conjugates specifically blocked the activity of cathepsins in different cancer cells, as well as osteoclasts, and showed therapeutic efficacy in vitro and in vivo. Overall, our approach allows for the rapid design of selective cathepsin inhibitors and can be generalized to inhibit a broad class of proteases in cancer and other diseases.
Subject(s)
Cathepsins , Peptides , Humans , Cathepsins/antagonists & inhibitors , Cathepsins/metabolism , Peptides/chemistry , Peptides/pharmacology , Animals , Mice , Cell Line, Tumor , Drug Delivery Systems/methods , Immunoconjugates/pharmacology , Immunoconjugates/chemistry , Neoplasms/drug therapy , Antineoplastic Agents/pharmacology , Antineoplastic Agents/chemistryABSTRACT
Biological signal processing is vital for cellular function. Similar to electronic circuits, cells process signals via integrated mechanisms. In electronics, bandpass filters transmit frequencies with defined ranges, but protein-based counterparts for controlled responses are lacking in engineered biological systems. Here, we rationally design protein-based, chemically responsive bandpass filters (CBPs) showing OFF-ON-OFF patterns that respond to chemical concentrations within a specific range and reject concentrations outside that range. Employing structure-based strategies, we designed a heterodimeric construct that dimerizes in response to low concentrations of a small molecule (ON), and dissociates at high concentrations of the same molecule (OFF). The CBPs have a multidomain architecture in which we used known drug receptors, a computationally designed protein binder and small-molecule inhibitors. This modular system allows fine-tuning for optimal performance in terms of bandwidth, response, cutoff and fold changes. The CBPs were used to regulate cell surface receptor signaling pathways to control cellular activities in engineered cells.
ABSTRACT
De novo protein design enables the exploration of novel sequences and structures absent from the natural protein universe. De novo design also stands as a stringent test for our understanding of the underlying physical principles of protein folding and may lead to the development of proteins with unmatched functional characteristics. The first fundamental challenge of de novo design is to devise "designable" structural templates leading to sequences that will adopt the predicted fold. Here, we built on the TopoBuilder (TB) de novo design method, to automatically assemble structural templates with native-like features starting from string descriptors that capture the overall topology of proteins. Our framework eliminates the dependency of hand-crafted and fold-specific rules through an iterative, data-driven approach that extracts geometrical parameters from structural tertiary motifs. We evaluated the TopoBuilder framework by designing sequences for a set of five protein folds and experimental characterization revealed that several sequences were folded and stable in solution. The TopoBuilder de novo design framework will be broadly useful to guide the generation of artificial proteins with customized geometries, enabling the exploration of the protein universe.
Subject(s)
Protein Folding , Proteins , Models, Molecular , Protein Engineering/methods , Proteins/chemistryABSTRACT
Reliably scoring and ranking candidate models of protein complexes and assigning their oligomeric state from the structure of the crystal lattice represent outstanding challenges. A community-wide effort was launched to tackle these challenges. The latest resources on protein complexes and interfaces were exploited to derive a benchmark dataset consisting of 1677 homodimer protein crystal structures, including a balanced mix of physiological and non-physiological complexes. The non-physiological complexes in the benchmark were selected to bury a similar or larger interface area than their physiological counterparts, making it more difficult for scoring functions to differentiate between them. Next, 252 functions for scoring protein-protein interfaces previously developed by 13 groups were collected and evaluated for their ability to discriminate between physiological and non-physiological complexes. A simple consensus score generated using the best performing score of each of the 13 groups, and a cross-validated Random Forest (RF) classifier were created. Both approaches showed excellent performance, with an area under the Receiver Operating Characteristic (ROC) curve of 0.93 and 0.94, respectively, outperforming individual scores developed by different groups. Additionally, AlphaFold2 engines recalled the physiological dimers with significantly higher accuracy than the non-physiological set, lending support to the reliability of our benchmark dataset annotations. Optimizing the combined power of interface scoring functions and evaluating it on challenging benchmark datasets appears to be a promising strategy.
Subject(s)
Proteins , Reproducibility of Results , Proteins/metabolism , Protein BindingABSTRACT
The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.
Subject(s)
Macromolecular Substances/chemistry , Models, Molecular , Proteins/chemistry , Software , Molecular Docking Simulation , Peptidomimetics/chemistry , Protein ConformationABSTRACT
De novo protein design has enabled the creation of new protein structures. However, the design of functional proteins has proved challenging, in part due to the difficulty of transplanting structurally complex functional sites to available protein structures. Here, we used a bottom-up approach to build de novo proteins tailored to accommodate structurally complex functional motifs. We applied the bottom-up strategy to successfully design five folds for four distinct binding motifs, including a bifunctionalized protein with two motifs. Crystal structures confirmed the atomic-level accuracy of the computational designs. These de novo proteins were functional as components of biosensors to monitor antibody responses and as orthogonal ligands to modulate synthetic signaling receptors in engineered mammalian cells. Our work demonstrates the potential of bottom-up approaches to accommodate complex structural motifs, which will be essential to endow de novo proteins with elaborate biochemical functions, such as molecular recognition or catalysis.
Subject(s)
Protein Engineering/methods , Amino Acid Motifs/genetics , Binding Sites/genetics , Catalysis , Ligands , Models, Molecular , Protein Binding/genetics , Protein Folding , Proteins/chemistryABSTRACT
Proteins are typically represented by discrete atomic coordinates providing an accessible framework to describe different conformations. However, in some fields proteins are more accurately represented as near-continuous surfaces, as these are imprinted with geometric (shape) and chemical (electrostatics) features of the underlying protein structure. Protein surfaces are dependent on their chemical composition and, ultimately determine protein function, acting as the interface that engages in interactions with other molecules. In the past, such representations were utilized to compare protein structures on global and local scales and have shed light on functional properties of proteins. Here we describe RosettaSurf, a surface-centric computational design protocol, that focuses on the molecular surface shape and electrostatic properties as means for protein engineering, offering a unique approach for the design of proteins and their functions. The RosettaSurf protocol combines the explicit optimization of molecular surface features with a global scoring function during the sequence design process, diverging from the typical design approaches that rely solely on an energy scoring function. With this computational approach, we attempt to address a fundamental problem in protein design related to the design of functional sites in proteins, even when structurally similar templates are absent in the characterized structural repertoire. Surface-centric design exploits the premise that molecular surfaces are, to a certain extent, independent of the underlying sequence and backbone configuration, meaning that different sequences in different proteins may present similar surfaces. We benchmarked RosettaSurf on various sequence recovery datasets and showcased its design capabilities by generating epitope mimics that were biochemically validated. Overall, our results indicate that the explicit optimization of surface features may lead to new routes for the design of functional proteins.
Subject(s)
Protein Engineering , Proteins , Algorithms , Computational Biology/methods , Protein Conformation , Protein Engineering/methods , Proteins/chemistry , Static ElectricityABSTRACT
Optogenetic control of CRISPR-Cas9 systems has significantly improved our ability to perform genome perturbations in living cells with high precision in time and space. As new Cas orthologues with advantageous properties are rapidly being discovered and engineered, the need for straightforward strategies to control their activity via exogenous stimuli persists. The Cas9 from Neisseria meningitidis (Nme) is a particularly small and target-specific Cas9 orthologue, and thus of high interest for in vivo genome editing applications. Here, we report the first optogenetic tool to control NmeCas9 activity in mammalian cells via an engineered, light-dependent anti-CRISPR (Acr) protein. Building on our previous Acr engineering work, we created hybrids between the NmeCas9 inhibitor AcrIIC3 and the LOV2 blue light sensory domain from Avena sativa. Two AcrIIC3-LOV2 hybrids from our collection potently blocked NmeCas9 activity in the dark, while permitting robust genome editing at various endogenous loci upon blue light irradiation. Structural analysis revealed that, within these hybrids, the LOV2 domain is located in striking proximity to the Cas9 binding surface. Together, our work demonstrates optogenetic regulation of a type II-C CRISPR effector and might suggest a new route for the design of optogenetic Acrs.
Subject(s)
CRISPR-Associated Protein 9/antagonists & inhibitors , CRISPR-Associated Protein 9/chemistry , CRISPR-Cas Systems , Gene Editing/methods , Neisseria meningitidis/enzymology , Optogenetics/methods , Cell Line , HEK293 Cells , Humans , Light , Models, Molecular , Protein Engineering , Proteins/chemistry , Proteins/radiation effectsABSTRACT
Anti-CRISPR (Acr) proteins are powerful tools to control CRISPR-Cas technologies. However, the available Acr repertoire is limited to naturally occurring variants. Here, we applied structure-based design on AcrIIC1, a broad-spectrum CRISPR-Cas9 inhibitor, to improve its efficacy on different targets. We first show that inserting exogenous protein domains into a selected AcrIIC1 surface site dramatically enhances inhibition of Neisseria meningitidis (Nme)Cas9. Then, applying structure-guided design to the Cas9-binding surface, we converted AcrIIC1 into AcrIIC1X, a potent inhibitor of the Staphylococcus aureus (Sau)Cas9, an orthologue widely applied for in vivo genome editing. Finally, to demonstrate the utility of AcrIIC1X for genome engineering applications, we implemented a hepatocyte-specific SauCas9 ON-switch by placing AcrIIC1X expression under regulation of microRNA-122. Our work introduces designer Acrs as important biotechnological tools and provides an innovative strategy to safeguard CRISPR technologies.
Subject(s)
CRISPR-Associated Protein 9/genetics , CRISPR-Cas Systems , Clustered Regularly Interspaced Short Palindromic Repeats , Gene Editing/methods , MicroRNAs/genetics , Protein Engineering/methods , Amino Acid Sequence , CRISPR-Associated Protein 9/metabolism , Cell Line, Tumor , Genome, Human , HEK293 Cells , Hepatocytes/cytology , Hepatocytes/metabolism , Humans , MicroRNAs/metabolism , Models, Molecular , Mutagenesis, Insertional , Neisseria meningitidis/enzymology , Neisseria meningitidis/genetics , Plasmids/chemistry , Plasmids/metabolism , Protein Domains , Protein Structure, Secondary , RNA, Guide, Kinetoplastida/genetics , RNA, Guide, Kinetoplastida/metabolism , Staphylococcus aureus/enzymology , Staphylococcus aureus/geneticsABSTRACT
Throughout the last several decades, vaccination has been key to prevent and eradicate infectious diseases. However, many pathogens (e.g., respiratory syncytial virus [RSV], influenza, dengue, and others) have resisted vaccine development efforts, largely because of the failure to induce potent antibody responses targeting conserved epitopes. Deep profiling of human B cells often reveals potent neutralizing antibodies that emerge from natural infection, but these specificities are generally subdominant (i.e., are present in low titers). A major challenge for next-generation vaccines is to overcome established immunodominance hierarchies and focus antibody responses on crucial neutralization epitopes. Here, we show that a computationally designed epitope-focused immunogen presenting a single RSV neutralization epitope elicits superior epitope-specific responses compared to the viral fusion protein. In addition, the epitope-focused immunogen efficiently boosts antibodies targeting the palivizumab epitope, resulting in enhanced neutralization. Overall, we show that epitope-focused immunogens can boost subdominant neutralizing antibody responses in vivo and reshape established antibody hierarchies.
Subject(s)
Antibodies, Neutralizing/biosynthesis , Antibodies, Viral/biosynthesis , Epitopes/chemistry , Receptors, Antigen, B-Cell/immunology , Recombinant Fusion Proteins/chemistry , Respiratory Syncytial Viruses/immunology , Viral Fusion Proteins/chemistry , Animals , Antibodies, Monoclonal, Humanized/chemistry , Antibodies, Monoclonal, Humanized/immunology , Antibodies, Neutralizing/genetics , Antibodies, Viral/genetics , Cloning, Molecular , Computer-Aided Design , Epitopes/immunology , Escherichia coli/genetics , Escherichia coli/metabolism , Female , Gene Expression , Genetic Vectors/chemistry , Genetic Vectors/metabolism , Immunization/methods , Immunogenicity, Vaccine , Mice , Mice, Inbred BALB C , Nanoparticles/administration & dosage , Nanoparticles/chemistry , Palivizumab/chemistry , Palivizumab/immunology , Receptors, Antigen, B-Cell/chemistry , Receptors, Antigen, B-Cell/genetics , Recombinant Fusion Proteins/administration & dosage , Recombinant Fusion Proteins/genetics , Recombinant Fusion Proteins/immunology , Respiratory Syncytial Virus Vaccines/administration & dosage , Respiratory Syncytial Virus Vaccines/biosynthesis , Respiratory Syncytial Virus Vaccines/genetics , Structural Homology, Protein , Viral Fusion Proteins/administration & dosage , Viral Fusion Proteins/genetics , Viral Fusion Proteins/immunologyABSTRACT
Small molecules are powerful tools for investigating protein function and can serve as leads for new therapeutics. Most human proteins, however, lack small-molecule ligands, and entire protein classes are considered 'undruggable'. Fragment-based ligand discovery can identify small-molecule probes for proteins that have proven difficult to target using high-throughput screening of complex compound libraries. Although reversibly binding ligands are commonly pursued, covalent fragments provide an alternative route to small-molecule probes, including those that can access regions of proteins that are difficult to target through binding affinity alone. Here we report a quantitative analysis of cysteine-reactive small-molecule fragments screened against thousands of proteins in human proteomes and cells. Covalent ligands were identified for >700 cysteines found in both druggable proteins and proteins deficient in chemical probes, including transcription factors, adaptor/scaffolding proteins, and uncharacterized proteins. Among the atypical ligand-protein interactions discovered were compounds that react preferentially with pro- (inactive) caspases. We used these ligands to distinguish extrinsic apoptosis pathways in human cell lines versus primary human T cells, showing that the former is largely mediated by caspase-8 while the latter depends on both caspase-8 and -10. Fragment-based covalent ligand discovery provides a greatly expanded portrait of the ligandable proteome and furnishes compounds that can illuminate protein functions in native biological systems.
Subject(s)
Cysteine/metabolism , Drug Evaluation, Preclinical/methods , Proteome/chemistry , Proteome/metabolism , Small Molecule Libraries/metabolism , Small Molecule Libraries/pharmacology , T-Lymphocytes/metabolism , Apoptosis , Caspase 10/chemistry , Caspase 10/metabolism , Caspase 8/chemistry , Caspase 8/metabolism , Cells, Cultured , Enzyme Precursors/chemistry , Enzyme Precursors/metabolism , Humans , Ligands , Peptide Fragments/chemistry , Peptide Fragments/metabolism , T-Lymphocytes/chemistry , Transcription Factors/chemistry , Transcription Factors/metabolismABSTRACT
Structure-based antibody and antigen design has advanced greatly in recent years, due not only to the increasing availability of experimentally determined structures but also to improved computational methods for both prediction and design. Constant improvements in performance within the Rosetta software suite for biomolecular modeling have given rise to a greater breadth of structure prediction, including docking and design application cases for antibody and antigen modeling. Here, we present an overview of current protocols for antibody and antigen modeling using Rosetta and exemplify those by detailed tutorials originally developed for a Rosetta workshop at Vanderbilt University. These tutorials cover antibody structure prediction, docking, and design and antigen design strategies, including the addition of glycans in Rosetta. We expect that these materials will allow novice users to apply Rosetta in their own projects for modeling antibodies and antigens.