Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 296
Filter
1.
Angew Chem Int Ed Engl ; 63(24): e202405767, 2024 Jun 10.
Article in English | MEDLINE | ID: mdl-38588243

ABSTRACT

Identifying the interactome for a protein of interest is challenging due to the large number of possible binders. High-throughput experimental approaches narrow down possible binding partners but often include false positives. Furthermore, they provide no information about what the binding region is (e.g., the binding epitope). We introduce a novel computational pipeline based on an AlphaFold2 (AF) Competitive Binding Assay (AF-CBA) to identify proteins that bind a target of interest from a pull-down experiment and the binding epitope. Our focus is on proteins that bind the Extraterminal (ET) domain of Bromo and Extraterminal domain (BET) proteins, but we also introduce nine additional systems to show transferability to other peptide-protein systems. We describe a series of limitations to the methodology based on intrinsic deficiencies of AF and AF-CBA to help users identify scenarios where the approach will be most useful. Given the method's speed and accuracy, we anticipate its broad applicability to identify binding epitope regions among potential partners, setting the stage for experimental verification.


Subject(s)
Protein Binding , Proteins , Proteins/chemistry , Proteins/metabolism , Peptide Library , High-Throughput Screening Assays
2.
Structure ; 32(6): 824-837.e1, 2024 Jun 06.
Article in English | MEDLINE | ID: mdl-38490206

ABSTRACT

Biomolecular structure analysis from experimental NMR studies generally relies on restraints derived from a combination of experimental and knowledge-based data. A challenge for the structural biology community has been a lack of standards for representing these restraints, preventing the establishment of uniform methods of model-vs-data structure validation against restraints and limiting interoperability between restraint-based structure modeling programs. The NEF and NMR-STAR formats provide a standardized approach for representing commonly used NMR restraints. Using these restraint formats, a standardized validation system for assessing structural models of biopolymers against restraints has been developed and implemented in the wwPDB OneDep data deposition-validation-biocuration system. The resulting wwPDB restraint violation report provides a model vs. data assessment of biomolecule structures determined using distance and dihedral restraints, with extensions to other restraint types currently being implemented. These tools are useful for assessing NMR models, as well as for assessing biomolecular structure predictions based on distance restraints.


Subject(s)
Databases, Protein , Models, Molecular , Nuclear Magnetic Resonance, Biomolecular , Protein Conformation , Proteins , Nuclear Magnetic Resonance, Biomolecular/methods , Proteins/chemistry , Software
3.
bioRxiv ; 2024 Jan 22.
Article in English | MEDLINE | ID: mdl-38328042

ABSTRACT

Biomolecular structure analysis from experimental NMR studies generally relies on restraints derived from a combination of experimental and knowledge-based data. A challenge for the structural biology community has been a lack of standards for representing these restraints, preventing the establishment of uniform methods of model-vs-data structure validation against restraints and limiting interoperability between restraint-based structure modeling programs. The NMR exchange (NEF) and NMR-STAR formats provide a standardized approach for representing commonly used NMR restraints. Using these restraint formats, a standardized validation system for assessing structural models of biopolymers against restraints has been developed and implemented in the wwPDB OneDep data deposition-validation-biocuration system. The resulting wwPDB Restraint Violation Report provides a model vs. data assessment of biomolecule structures determined using distance and dihedral restraints, with extensions to other restraint types currently being implemented. These tools are useful for assessing NMR models, as well as for assessing biomolecular structure predictions based on distance restraints.

4.
bioRxiv ; 2024 Jan 23.
Article in English | MEDLINE | ID: mdl-38328039

ABSTRACT

Identifying the interactome for a protein of interest is challenging due to the large number of possible binders. High-throughput experimental approaches narrow down possible binding partners, but often include false positives. Furthermore, they provide no information about what the binding region is (e.g. the binding epitope). We introduce a novel computational pipeline based on an AlphaFold2 (AF) Competition Assay (AF-CBA) to identify proteins that bind a target of interest from a pull-down experiment, along with the binding epitope. Our focus is on proteins that bind the Extraterminal (ET) domain of Bromo and Extraterminal domain (BET) proteins, but we also introduce nine additional systems to show transferability to other peptide-protein systems. We describe a series of limitations to the methodology based on intrinsic deficiencies to AF and AF-CBA, to help users identify scenarios where the approach will be most useful. Given the speed and accuracy of the methodology, we expect it to be generally applicable to facilitate target selection for experimental verification starting from high-throughput protein libraries.

5.
bioRxiv ; 2024 Jan 31.
Article in English | MEDLINE | ID: mdl-38352570

ABSTRACT

This manuscript describes the application of Isothermal Titration Calorimetry (ITC) to characterize the kinetics of 3CL pro from the Severe Acute Respiratory Syndrome CoronaVirus-2 (SARS-CoV-2) and its inhibition by Ensitrelvir, a known non-covalent inhibitor. 3CL pro is the main protease that plays a crucial role of producing the whole array of proteins necessary for the viral infection that caused the spread of COVID-19, responsible for millions of deaths worldwide as well as global economic and healthcare crises in recent years. The proposed calorimetric method proved to have several advantages over the two types of enzymatic assays so far applied to this system, namely Förster Resonance Energy Transfer (FRET) and Liquid Chromatography-Mass Spectrometry (LC-MS). The developed ITC-based assay provided a rapid response to 3CL pro activity, which was used to directly derive the kinetic enzymatic constants K M and k cat reliably and reproducibly, as well as their temperature dependence, from which the activation energy of the reaction was obtained for the first time. The assay further revealed the existence of two modes of inhibition of 3CL pro by Ensitrelvir, namely a competitive mode as previously inferred by crystallography as well as an unprecedented uncompetitive mode, further yielding the respective inhibition constants with high precision. The calorimetric method described in this paper is thus proposed to be generally and widely used in the discovery and development of drugs targeting 3CL pro .

6.
bioRxiv ; 2024 Jan 24.
Article in English | MEDLINE | ID: mdl-38328244

ABSTRACT

Influenza A and B viruses overcome the host antiviral response to cause a contagious and often severe human respiratory disease. Here, integrative structural biology and biochemistry studies on non-structural protein 1 of influenza B virus (NS1B) reveal a previously unrecognized viral mechanism for innate immune evasion. Conserved basic groups of its C-terminal domain (NS1B-CTD) bind 5'triphosphorylated double-stranded RNA (5'-ppp-dsRNA), the primary pathogen-associated feature that activates the host retinoic acid-inducible gene I protein (RIG-I) to initiate interferon synthesis and the cellular antiviral response. Like RIG-I, NS1B-CTD preferentially binds blunt-end 5'ppp-dsRNA. NS1B-CTD also competes with RIG-I for binding 5'ppp-dsRNA, and thus suppresses activation of RIG-I's ATPase activity. Although the NS1B N-terminal domain also binds dsRNA, it utilizes a different binding mode and lacks 5'ppp-dsRNA end preferences. In cells infected with wild-type influenza B virus, RIG-I activation is inhibited. In contrast, RIG-I activation and the resulting phosphorylation of transcription factor IRF-3 are not inhibited in cells infected with a mutant virus encoding NS1B with a R208A substitution it its CTD that eliminates its 5'ppp-dsRNA binding activity. These results reveal a novel mechanism in which NS1B binds 5'ppp-dsRNA to inhibit the RIG-I antiviral response during influenza B virus infection, and open the door to new avenues for antiviral drug discovery.

8.
Sci Data ; 11(1): 30, 2024 Jan 04.
Article in English | MEDLINE | ID: mdl-38177162

ABSTRACT

Multidimensional NMR spectra are the basis for studying proteins by NMR spectroscopy and crucial for the development and evaluation of methods for biomolecular NMR data analysis. Nevertheless, in contrast to derived data such as chemical shift assignments in the BMRB and protein structures in the PDB databases, this primary data is in general not publicly archived. To change this unsatisfactory situation, we present a standardized set of solution NMR data comprising 1329 2-4-dimensional NMR spectra and associated reference (chemical shift assignments, structures) and derived (peak lists, restraints for structure calculation, etc.) annotations. With the 100-protein NMR spectra dataset that was originally compiled for the development of the ARTINA deep learning-based spectra analysis method, 100 protein structures can be reproduced from their original experimental data. The 100-protein NMR spectra dataset is expected to help the development of computational methods for NMR spectroscopy, in particular machine learning approaches, and enable consistent and objective comparisons of these methods.


Subject(s)
Magnetic Resonance Imaging , Proteins , Algorithms , Magnetic Resonance Spectroscopy , Nuclear Magnetic Resonance, Biomolecular/methods , Proteins/chemistry
9.
Proteins ; 91(12): 1903-1911, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37872703

ABSTRACT

For the first time, the 2022 CASP (Critical Assessment of Structure Prediction) community experiment included a section on computing multiple conformations for protein and RNA structures. There was full or partial success in reproducing the ensembles for four of the nine targets, an encouraging result. For protein structures, enhanced sampling with variations of the AlphaFold2 deep learning method was by far the most effective approach. One substantial conformational change caused by a single mutation across a complex interface was accurately reproduced. In two other assembly modeling cases, methods succeeded in sampling conformations near to the experimental ones even though environmental factors were not included in the calculations. An experimentally derived flexibility ensemble allowed a single accurate RNA structure model to be identified. Difficulties included how to handle sparse or low-resolution experimental data and the current lack of effective methods for modeling RNA/protein complexes. However, these and other obstacles appear addressable.


Subject(s)
Proteins , RNA , Protein Conformation , Proteins/chemistry , Mutation
10.
Curr Opin Struct Biol ; 83: 102703, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37776602

ABSTRACT

Biomolecules exhibit dynamic behavior that single-state models of their structures cannot fully capture. We review some recent advances for investigating multiple conformations of biomolecules, including experimental methods, molecular dynamics simulations, and machine learning. We also address the challenges associated with representing single- and multiple-state models in data archives, with a particular focus on NMR structures. Establishing standardized representations and annotations will facilitate effective communication and understanding of these complex models to the broader scientific community.


Subject(s)
Molecular Dynamics Simulation , Proteins , Proteins/chemistry , Molecular Conformation , Magnetic Resonance Spectroscopy , Protein Conformation
11.
STAR Protoc ; 4(2): 102326, 2023 May 05.
Article in English | MEDLINE | ID: mdl-37235475

ABSTRACT

3CLpro protease from SARS-CoV-2 is a primary target for COVID-19 antiviral drug development. Here, we present a protocol for 3CLpro production in Escherichia coli. We describe steps to purify 3CLpro, expressed as a fusion with the Saccharomyces cerevisiae SUMO protein, with yields up to 120 mg L-1 following cleavage. The protocol also provides isotope-enriched samples suitable for nuclear magnetic resonance (NMR) studies. We also present methods to characterize 3CLpro by mass spectrometry, X-ray crystallography, heteronuclear NMR, and a Förster-resonance-energy-transfer-based enzyme assay. For complete details on the use and execution of this protocol, please refer to Bafna et al.1.

12.
Curr Opin Struct Biol ; 80: 102603, 2023 06.
Article in English | MEDLINE | ID: mdl-37178478

ABSTRACT

Membrane-traversing peptides offer opportunities for targeting intracellular proteins and oral delivery. Despite progress in understanding the mechanisms underlying membrane traversal in natural cell-permeable peptides, there are still several challenges to designing membrane-traversing peptides with diverse shapes and sizes. Conformational flexibility appears to be a key determinant of membrane permeability of large macrocycles. We review recent developments in the design and validation of chameleonic cyclic peptides, which can switch between alternative conformations to enable improved permeability through cell membranes, while still maintaining reasonable solubility and exposed polar functional groups for target protein binding. Finally, we discuss the principles, strategies, and practical considerations for rational design, discovery, and validation of permeable chameleonic peptides.


Subject(s)
Lizards , Peptides, Cyclic , Animals , Peptides, Cyclic/metabolism , Lizards/metabolism , Peptides/chemistry , Molecular Conformation , Cell Membrane Permeability
13.
J Magn Reson ; 352: 107481, 2023 07.
Article in English | MEDLINE | ID: mdl-37257257

ABSTRACT

Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open-source protein NMR data sets for such "blind" targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) 15N-1H residual dipolar coupling data. For these nine small (70-108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research.


Subject(s)
Furylfuramide , Proteins , Protein Conformation , Cryoelectron Microscopy , Nuclear Magnetic Resonance, Biomolecular/methods , Proteins/chemistry
15.
J Chem Inf Model ; 63(7): 2058-2072, 2023 04 10.
Article in English | MEDLINE | ID: mdl-36988562

ABSTRACT

Intrinsically disordered regions of proteins often mediate important protein-protein interactions. However, the folding-upon-binding nature of many polypeptide-protein interactions limits the ability of modeling tools to predict the three-dimensional structures of such complexes. To address this problem, we have taken a tandem approach combining NMR chemical shift data and molecular simulations to determine the structures of peptide-protein complexes. Here, we use the MELD (Modeling Employing Limited Data) technique applied to polypeptide complexes formed with the extraterminal domain (ET) of bromo and extraterminal domain (BET) proteins, which exhibit a high degree of binding plasticity. This system is particularly challenging as the binding process includes allosteric changes across the ET receptor upon binding, and the polypeptide binding partners can adopt different conformations (e.g., helices and hairpins) in the complex. In a blind study, the new approach successfully modeled bound-state conformations and binding poses, using only protein receptor backbone chemical shift data, in excellent agreement with experimentally determined structures for moderately tight (Kd ∼100 nM) binders. The hybrid MELD + NMR approach required additional peptide ligand chemical shift data for weaker (Kd ∼250 µM) peptide binding partners. AlphaFold also successfully predicts the structures of some of these peptide-protein complexes. However, whereas AlphaFold can provide qualitative peptide rankings, MELD can directly estimate relative binding affinities. The hybrid MELD + NMR approach offers a powerful new tool for structural analysis of protein-polypeptide complexes involving disorder-to-order transitions upon complex formation, which are not successfully modeled with most other complex prediction methods, providing both the 3D structures of peptide-protein complexes and their relative binding affinities.


Subject(s)
Molecular Dynamics Simulation , Peptides , Protein Binding , Proteins/chemistry , Protein Structure, Secondary , Protein Conformation
16.
bioRxiv ; 2023 Jan 22.
Article in English | MEDLINE | ID: mdl-36712039

ABSTRACT

Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open source protein NMR data sets for such "blind" targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) 15 N- 1 H residual dipolar coupling data. For these nine small (70 - 108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research. Highlights: AF2 models assessed against NMR data for 9 monomeric proteins not used in training.AF2 models fit NMR data almost as well as the experimentally-determined structures. RPF-DP, PSVS , and PDBStat software provide structure quality and RDC assessment. RPF-DP analysis using AF2 models suggests multiple conformational states.

17.
Front Chem ; 10: 948553, 2022.
Article in English | MEDLINE | ID: mdl-36353143

ABSTRACT

Considering the significant impact of the recent COVID-19 outbreak, development of broad-spectrum antivirals is a high priority goal to prevent future global pandemics. Antiviral development processes generally emphasize targeting a specific protein from a particular virus. However, some antiviral agents developed for specific viral protein targets may exhibit broad spectrum antiviral activity, or at least provide useful lead molecules for broad spectrum drug development. There is significant potential for repurposing a wide range of existing viral protease inhibitors to inhibit the SARS-CoV2 3C-like protease (3CLpro). If effective even as relatively weak inhibitors of 3CLpro, these molecules can provide a diverse and novel set of scaffolds for new drug discovery campaigns. In this study, we compared the sequence- and structure-based similarity of SARS-CoV2 3CLpro with proteases from other viruses, and identified 22 proteases with similar active-site structures. This structural similarity, characterized by secondary-structure topology diagrams, is evolutionarily divergent within taxonomically related viruses, but appears to result from evolutionary convergence of protease enzymes between virus families. Inhibitors of these proteases that are structurally similar to the SARS-CoV2 3CLpro protease were identified and assessed as potential inhibitors of SARS-CoV2 3CLpro protease by virtual docking. Several of these molecules have docking scores that are significantly better than known SARS-CoV2 3CLpro inhibitors, suggesting that these molecules are also potential inhibitors of the SARS-CoV2 3CLpro protease. Some have been previously reported to inhibit SARS-CoV2 3CLpro. The results also suggest that established inhibitors of SARS-CoV2 3CLpro may be considered as potential inhibitors of other viral 3C-like proteases.

19.
Cell ; 185(19): 3520-3532.e26, 2022 09 15.
Article in English | MEDLINE | ID: mdl-36041435

ABSTRACT

We use computational design coupled with experimental characterization to systematically investigate the design principles for macrocycle membrane permeability and oral bioavailability. We designed 184 6-12 residue macrocycles with a wide range of predicted structures containing noncanonical backbone modifications and experimentally determined structures of 35; 29 are very close to the computational models. With such control, we show that membrane permeability can be systematically achieved by ensuring all amide (NH) groups are engaged in internal hydrogen bonding interactions. 84 designs over the 6-12 residue size range cross membranes with an apparent permeability greater than 1 × 10-6 cm/s. Designs with exposed NH groups can be made membrane permeable through the design of an alternative isoenergetic fully hydrogen-bonded state favored in the lipid membrane. The ability to robustly design membrane-permeable and orally bioavailable peptides with high structural accuracy should contribute to the next generation of designed macrocycle therapeutics.


Subject(s)
Amides , Peptides , Amides/chemistry , Hydrogen , Hydrogen Bonding , Lipids , Peptides/chemistry
20.
J Magn Reson ; 342: 107268, 2022 09.
Article in English | MEDLINE | ID: mdl-35930941

ABSTRACT

NMR is a valuable experimental tool in the structural biologist's toolkit to elucidate the structures, functions, and motions of biomolecules. The progress of machine learning, particularly in structural biology, reveals the critical importance of large, diverse, and reliable datasets in developing new methods and understanding in structural biology and science more broadly. Biomolecular NMR research groups produce large amounts of data, and there is renewed interest in organizing these data to train new, sophisticated machine learning architectures and to improve biomolecular NMR analysis pipelines. The foundational data type in NMR is the free-induction decay (FID). There are opportunities to build sophisticated machine learning methods to tackle long-standing problems in NMR data processing, resonance assignment, dynamics analysis, and structure determination using NMR FIDs. Our goal in this study is to provide a lightweight, broadly available tool for archiving FID data as it is generated at the spectrometer, and grow a new resource of FID data and associated metadata. This study presents a relational schema for storing and organizing the metadata items that describe an NMR sample and FID data, which we call Spectral Database (SpecDB). SpecDB is implemented in SQLite and includes a Python software library providing a command-line application to create, organize, query, backup, share, and maintain the database. This set of software tools and database schema allow users to store, organize, share, and learn from NMR time domain data. SpecDB is freely available under an open source license at https://github.rpi.edu/RPIBioinformatics/SpecDB.


Subject(s)
Software , Magnetic Resonance Spectroscopy/methods , Nuclear Magnetic Resonance, Biomolecular/methods
SELECTION OF CITATIONS
SEARCH DETAIL
...