Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 84
Filtrar
1.
Cell ; 187(19): 5267-5281.e13, 2024 Sep 19.
Artigo em Inglês | MEDLINE | ID: mdl-39127037

RESUMO

The nuclear pore complex (NPC) is the sole mediator of nucleocytoplasmic transport. Despite great advances in understanding its conserved core architecture, the peripheral regions can exhibit considerable variation within and between species. One such structure is the cage-like nuclear basket. Despite its crucial roles in mRNA surveillance and chromatin organization, an architectural understanding has remained elusive. Using in-cell cryo-electron tomography and subtomogram analysis, we explored the NPC's structural variations and the nuclear basket across fungi (yeast; S. cerevisiae), mammals (mouse; M. musculus), and protozoa (T. gondii). Using integrative structural modeling, we computed a model of the basket in yeast and mammals that revealed how a hub of nucleoporins (Nups) in the nuclear ring binds to basket-forming Mlp/Tpr proteins: the coiled-coil domains of Mlp/Tpr form the struts of the basket, while their unstructured termini constitute the basket distal densities, which potentially serve as a docking site for mRNA preprocessing before nucleocytoplasmic transport.


Assuntos
Transporte Ativo do Núcleo Celular , Complexo de Proteínas Formadoras de Poros Nucleares , Poro Nuclear , Saccharomyces cerevisiae , Animais , Poro Nuclear/metabolismo , Poro Nuclear/ultraestrutura , Poro Nuclear/química , Saccharomyces cerevisiae/metabolismo , Complexo de Proteínas Formadoras de Poros Nucleares/metabolismo , Complexo de Proteínas Formadoras de Poros Nucleares/química , Camundongos , Núcleo Celular/metabolismo , Toxoplasma/metabolismo , Toxoplasma/ultraestrutura , Microscopia Crioeletrônica , RNA Mensageiro/metabolismo , Modelos Moleculares , Proteínas de Saccharomyces cerevisiae/metabolismo , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/ultraestrutura
2.
Cell ; 187(20): 5587-5603.e19, 2024 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-39293445

RESUMO

Filoviruses, including the Ebola and Marburg viruses, cause hemorrhagic fevers with up to 90% lethality. The viral nucleocapsid is assembled by polymerization of the nucleoprotein (NP) along the viral genome, together with the viral proteins VP24 and VP35. We employed cryo-electron tomography of cells transfected with viral proteins and infected with model Ebola virus to illuminate assembly intermediates, as well as a 9 Å map of the complete intracellular assembly. This structure reveals a previously unresolved third and outer layer of NP complexed with VP35. The intrinsically disordered region, together with the C-terminal domain of this outer layer of NP, provides the constant width between intracellular nucleocapsid bundles and likely functions as a flexible tether to the viral matrix protein in the virion. A comparison of intracellular nucleocapsids with prior in-virion nucleocapsid structures reveals that the nucleocapsid further condenses vertically in the virion. The interfaces responsible for nucleocapsid assembly are highly conserved and offer targets for broadly effective antivirals.


Assuntos
Ebolavirus , Tomografia com Microscopia Eletrônica , Nucleocapsídeo , Montagem de Vírus , Ebolavirus/ultraestrutura , Ebolavirus/química , Ebolavirus/metabolismo , Ebolavirus/fisiologia , Nucleocapsídeo/metabolismo , Nucleocapsídeo/ultraestrutura , Nucleocapsídeo/química , Humanos , Microscopia Crioeletrônica/métodos , Proteínas do Nucleocapsídeo/química , Proteínas do Nucleocapsídeo/metabolismo , Proteínas do Nucleocapsídeo/ultraestrutura , Nucleoproteínas/química , Nucleoproteínas/metabolismo , Nucleoproteínas/ultraestrutura , Animais , Proteínas Virais/metabolismo , Proteínas Virais/química , Proteínas Virais/ultraestrutura , Modelos Moleculares , Vírion/ultraestrutura , Vírion/metabolismo , Doença pelo Vírus Ebola/virologia , Chlorocebus aethiops
3.
Cell ; 182(6): 1508-1518.e16, 2020 09 17.
Artigo em Inglês | MEDLINE | ID: mdl-32783917

RESUMO

Mutations in leucine-rich repeat kinase 2 (LRRK2) are the most frequent cause of familial Parkinson's disease. LRRK2 is a multi-domain protein containing a kinase and GTPase. Using correlative light and electron microscopy, in situ cryo-electron tomography, and subtomogram analysis, we reveal a 14-Å structure of LRRK2 bearing a pathogenic mutation that oligomerizes as a right-handed double helix around microtubules, which are left-handed. Using integrative modeling, we determine the architecture of LRRK2, showing that the GTPase and kinase are in close proximity, with the GTPase closer to the microtubule surface, whereas the kinase is exposed to the cytoplasm. We identify two oligomerization interfaces mediated by non-catalytic domains. Mutation of one of these abolishes LRRK2 microtubule-association. Our work demonstrates the power of cryo-electron tomography to generate models of previously unsolved structures in their cellular environment.


Assuntos
Microscopia Crioeletrônica/métodos , Tomografia com Microscopia Eletrônica/métodos , Serina-Treonina Proteína Quinase-2 com Repetições Ricas em Leucina/química , Microtúbulos/metabolismo , Doença de Parkinson/metabolismo , Citoplasma/metabolismo , GTP Fosfo-Hidrolases/química , GTP Fosfo-Hidrolases/metabolismo , Células HEK293 , Humanos , Microscopia Eletrônica de Transmissão , Microtúbulos/química , Modelos Químicos , Mutação , Doença de Parkinson/genética , Doença de Parkinson/patologia , Fosfotransferases/química , Fosfotransferases/metabolismo , Domínios Proteicos , Repetições WD40
4.
Annu Rev Biochem ; 88: 113-135, 2019 06 20.
Artigo em Inglês | MEDLINE | ID: mdl-30830798

RESUMO

Integrative structure modeling computationally combines data from multiple sources of information with the aim of obtaining structural insights that are not revealed by any single approach alone. In the first part of this review, we survey the commonly used sources of structural information and the computational aspects of model building. Throughout the past decade, integrative modeling was applied to various biological systems, with a focus on large protein complexes. Recent progress in the field of cryo-electron microscopy (cryo-EM) has resolved many of these complexes to near-atomic resolution. In the second part of this review, we compare a range of published integrative models with their higher-resolution counterparts with the aim of critically assessing their accuracy. This comparison gives a favorable view of integrative modeling and demonstrates its ability to yield accurate and informative results. We discuss possible roles of integrative modeling in the new era of cryo-EM and highlight future challenges and directions.


Assuntos
Microscopia Crioeletrônica/métodos , Cristalografia por Raios X/métodos , Espectroscopia de Ressonância Magnética/métodos , Espectrometria de Massas/métodos , Modelos Moleculares , Proteínas/ultraestrutura , Reagentes de Ligações Cruzadas/química , Microscopia Crioeletrônica/história , Microscopia Crioeletrônica/instrumentação , Cristalografia por Raios X/história , Cristalografia por Raios X/instrumentação , História do Século XX , História do Século XXI , Espectroscopia de Ressonância Magnética/história , Espectroscopia de Ressonância Magnética/instrumentação , Espectrometria de Massas/história , Espectrometria de Massas/instrumentação , Conformação Proteica , Proteínas/química , Software
5.
Cell ; 173(1): 11-19, 2018 03 22.
Artigo em Inglês | MEDLINE | ID: mdl-29570991

RESUMO

The construction of a predictive model of an entire eukaryotic cell that describes its dynamic structure from atomic to cellular scales is a grand challenge at the intersection of biology, chemistry, physics, and computer science. Having such a model will open new dimensions in biological research and accelerate healthcare advancements. Developing the necessary experimental and modeling methods presents abundant opportunities for a community effort to realize this goal. Here, we present a vision for creation of a spatiotemporal multi-scale model of the pancreatic ß-cell, a relevant target for understanding and modulating the pathogenesis of diabetes.


Assuntos
Células Secretoras de Insulina/metabolismo , Modelos Biológicos , Biologia Computacional , Descoberta de Drogas , Humanos , Células Secretoras de Insulina/citologia , Proteínas/química , Proteínas/metabolismo
6.
Mol Cell Proteomics ; 23(3): 100724, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38266916

RESUMO

We propose a pipeline that combines AlphaFold2 (AF2) and crosslinking mass spectrometry (XL-MS) to model the structure of proteins with multiple conformations. The pipeline consists of two main steps: ensemble generation using AF2 and conformer selection using XL-MS data. For conformer selection, we developed two scores-the monolink probability score (MP) and the crosslink probability score (XLP)-both of which are based on residue depth from the protein surface. We benchmarked MP and XLP on a large dataset of decoy protein structures and showed that our scores outperform previously developed scores. We then tested our methodology on three proteins having an open and closed conformation in the Protein Data Bank: Complement component 3 (C3), luciferase, and glutamine-binding periplasmic protein, first generating ensembles using AF2, which were then screened for the open and closed conformations using experimental XL-MS data. In five out of six cases, the most accurate model within the AF2 ensembles-or a conformation within 1 Å of this model-was identified using crosslinks, as assessed through the XLP score. In the remaining case, only the monolinks (assessed through the MP score) successfully identified the open conformation of glutamine-binding periplasmic protein, and these results were further improved by including the "occupancy" of the monolinks. This serves as a compelling proof-of-concept for the effectiveness of monolinks. In contrast, the AF2 assessment score was only able to identify the most accurate conformation in two out of six cases. Our results highlight the complementarity of AF2 with experimental methods like XL-MS, with the MP and XLP scores providing reliable metrics to assess the quality of the predicted models. The MP and XLP scoring functions mentioned above are available at https://gitlab.com/topf-lab/xlms-tools.


Assuntos
Glutamina , Proteínas Periplásmicas , Furilfuramida , Espectrometria de Massas , Conformação Proteica , Proteínas de Membrana
7.
Proc Natl Acad Sci U S A ; 119(5)2022 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-35082148

RESUMO

Triggering receptor expressed on myeloid cells 2 (TREM2) is a single-pass transmembrane receptor of the immunoglobulin superfamily that is secreted in a soluble (sTREM2) form. Mutations in TREM2 have been linked to increased risk of Alzheimer's disease (AD). A prominent neuropathological component of AD is deposition of the amyloid-ß (Aß) into plaques, particularly Aß40 and Aß42. While the membrane-bound form of TREM2 is known to facilitate uptake of Aß fibrils and the polarization of microglial processes toward amyloid plaques, the role of its soluble ectodomain, particularly in interactions with monomeric or fibrillar Aß, has been less clear. Our results demonstrate that sTREM2 does not bind to monomeric Aß40 and Aß42, even at a high micromolar concentration, while it does bind to fibrillar Aß42 and Aß40 with equal affinities (2.6 ± 0.3 µM and 2.3 ± 0.4 µM). Kinetic analysis shows that sTREM2 inhibits the secondary nucleation step in the fibrillization of Aß, while having little effect on the primary nucleation pathway. Furthermore, binding of sTREM2 to fibrils markedly enhanced uptake of fibrils into human microglial and neuroglioma derived cell lines. The disease-associated sTREM2 mutant, R47H, displayed little to no effect on fibril nucleation and binding, but it decreased uptake and functional responses markedly. We also probed the structure of the WT sTREM2-Aß fibril complex using integrative molecular modeling based primarily on the cross-linking mass spectrometry data. The model shows that sTREM2 binds fibrils along one face of the structure, leaving a second, mutation-sensitive site free to mediate cellular binding and uptake.


Assuntos
Peptídeos beta-Amiloides/metabolismo , Amiloide/metabolismo , Glicoproteínas de Membrana/metabolismo , Receptores Imunológicos/metabolismo , Doença de Alzheimer/genética , Doença de Alzheimer/metabolismo , Amiloide/genética , Peptídeos beta-Amiloides/genética , Animais , Humanos , Cinética , Glicoproteínas de Membrana/genética , Camundongos , Microglia/metabolismo , Mutação/genética , Fragmentos de Peptídeos/genética , Fragmentos de Peptídeos/metabolismo , Placa Amiloide/genética , Placa Amiloide/metabolismo , Receptores Imunológicos/genética , Proteínas tau/genética , Proteínas tau/metabolismo
8.
J Proteome Res ; 23(3): 1049-1061, 2024 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-38372774

RESUMO

Elucidating antibody-antigen complexes at the atomic level is of utmost interest for understanding immune responses and designing better therapies. Cross-linking mass spectrometry (XL-MS) has emerged as a powerful tool for mapping protein-protein interactions, suggesting valuable structural insights. However, the use of XL-MS studies to enable epitope/paratope mapping of antibody-antigen complexes is still limited up to now. XL-MS data can be used to drive integrative modeling of antibody-antigen complexes, where cross-links information serves as distance restraints for the precise determination of binding interfaces. In this approach, XL-MS data are employed to identify connections between binding interfaces of the antibody and the antigen, thus informing molecular modeling. Current literature provides minimal input about the impact of XL-MS data on the integrative modeling of antibody-antigen complexes. Here, we applied XL-MS to retrieve information about binding interfaces of three antibody-antigen complexes. We leveraged XL-MS data to perform integrative modeling using HADDOCK (active-passive residues and distance restraints strategies) and AlphaLink2. We then compared these three approaches with initial predictions of investigated antibody-antigen complexes by AlphaFold Multimer. This work emphasizes the importance of cross-linking data in resolving conformational dynamics of antibody-antigen complexes, ultimately enhancing the design of better protein therapeutics and vaccines.


Assuntos
Complexo Antígeno-Anticorpo , Espectrometria de Massas , Mapeamento de Epitopos
9.
Biostatistics ; 2023 Jul 26.
Artigo em Inglês | MEDLINE | ID: mdl-37494883

RESUMO

Radionuclide imaging plays a critical role in the diagnosis and management of kidney obstruction. However, most practicing radiologists in US hospitals have insufficient time and resources to acquire training and experience needed to interpret radionuclide images, leading to increased diagnostic errors. To tackle this problem, Emory University embarked on a study that aims to develop a computer-assisted diagnostic (CAD) tool for kidney obstruction by mining and analyzing patient data comprised of renogram curves, ordinal expert ratings on the obstruction status, pharmacokinetic variables, and demographic information. The major challenges here are the heterogeneity in data modes and the lack of gold standard for determining kidney obstruction. In this article, we develop a statistically principled CAD tool based on an integrative latent class model that leverages heterogeneous data modalities available for each patient to provide accurate prediction of kidney obstruction. Our integrative model consists of three sub-models (multilevel functional latent factor regression model, probit scalar-on-function regression model, and Gaussian mixture model), each of which is tailored to the specific data mode and depends on the unknown obstruction status (latent class). An efficient MCMC algorithm is developed to train the model and predict kidney obstruction with associated uncertainty. Extensive simulations are conducted to evaluate the performance of the proposed method. An application to an Emory renal study demonstrates the usefulness of our model as a CAD tool for kidney obstruction.

10.
Proc Natl Acad Sci U S A ; 118(34)2021 08 24.
Artigo em Inglês | MEDLINE | ID: mdl-34373319

RESUMO

Atomic structures of several proteins from the coronavirus family are still partial or unavailable. A possible reason for this gap is the instability of these proteins outside of the cellular context, thereby prompting the use of in-cell approaches. In situ cross-linking and mass spectrometry (in situ CLMS) can provide information on the structures of such proteins as they occur in the intact cell. Here, we applied targeted in situ CLMS to structurally probe Nsp1, Nsp2, and nucleocapsid (N) proteins from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and obtained cross-link sets with an average density of one cross-link per 20 residues. We then employed integrative modeling that computationally combined the cross-linking data with domain structures to determine full-length atomic models. For the Nsp2, the cross-links report on a complex topology with long-range interactions. Integrative modeling with structural prediction of individual domains by the AlphaFold2 system allowed us to generate a single consistent all-atom model of the full-length Nsp2. The model reveals three putative metal binding sites and suggests a role for Nsp2 in zinc regulation within the replication-transcription complex. For the N protein, we identified multiple intra- and interdomain cross-links. Our integrative model of the N dimer demonstrates that it can accommodate three single RNA strands simultaneously, both stereochemically and electrostatically. For the Nsp1, cross-links with the 40S ribosome were highly consistent with recent cryogenic electron microscopy structures. These results highlight the importance of cellular context for the structural probing of recalcitrant proteins and demonstrate the effectiveness of targeted in situ CLMS and integrative modeling.


Assuntos
Modelos Moleculares , SARS-CoV-2/química , Proteínas Virais/química , Reagentes de Ligações Cruzadas/química , Células HEK293 , Humanos , Espectrometria de Massas , Domínios Proteicos
11.
Proc Natl Acad Sci U S A ; 118(35)2021 08 31.
Artigo em Inglês | MEDLINE | ID: mdl-34453000

RESUMO

Comprehensive modeling of a whole cell requires an integration of vast amounts of information on various aspects of the cell and its parts. To divide and conquer this task, we introduce Bayesian metamodeling, a general approach to modeling complex systems by integrating a collection of heterogeneous input models. Each input model can in principle be based on any type of data and can describe a different aspect of the modeled system using any mathematical representation, scale, and level of granularity. These input models are 1) converted to a standardized statistical representation relying on probabilistic graphical models, 2) coupled by modeling their mutual relations with the physical world, and 3) finally harmonized with respect to each other. To illustrate Bayesian metamodeling, we provide a proof-of-principle metamodel of glucose-stimulated insulin secretion by human pancreatic ß-cells. The input models include a coarse-grained spatiotemporal simulation of insulin vesicle trafficking, docking, and exocytosis; a molecular network model of glucose-stimulated insulin secretion signaling; a network model of insulin metabolism; a structural model of glucagon-like peptide-1 receptor activation; a linear model of a pancreatic cell population; and ordinary differential equations for systemic postprandial insulin response. Metamodeling benefits from decentralized computing, while often producing a more accurate, precise, and complete model that contextualizes input models as well as resolves conflicting information. We anticipate Bayesian metamodeling will facilitate collaborative science by providing a framework for sharing expertise, resources, data, and models, as exemplified by the Pancreatic ß-Cell Consortium.


Assuntos
Modelos Biológicos , Teorema de Bayes , Simulação por Computador , Humanos , Modelos Lineares
12.
Biol Chem ; 404(8-9): 741-754, 2023 07 26.
Artigo em Inglês | MEDLINE | ID: mdl-37505205

RESUMO

There is a growing interest in characterizing the structure and dynamics of large biomolecular assemblies and their interactions within the cellular environment. A diverse array of experimental techniques allows us to study biomolecular systems on a variety of length and time scales. These techniques range from imaging with light, X-rays or electrons, to spectroscopic methods, cross-linking mass spectrometry and functional genomics approaches, and are complemented by AI-assisted protein structure prediction methods. A challenge is to integrate all of these data into a model of the system and its functional dynamics. This review focuses on Bayesian approaches to integrative structure modeling. We sketch the principles of Bayesian inference, highlight recent applications to integrative modeling and conclude with a discussion of current challenges and future perspectives.


Assuntos
Genômica , Proteínas , Modelos Moleculares , Teorema de Bayes , Proteínas/química , Espectrometria de Massas/métodos
13.
Mol Cell Proteomics ; 20: 100139, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34418567

RESUMO

Proteomics methodology has expanded to include protein structural analysis, primarily through cross-linking mass spectrometry (XL-MS) and hydrogen-deuterium exchange mass spectrometry (HX-MS). However, while the structural proteomics community has effective tools for primary data analysis, there is a need for structure modeling pipelines that are accessible to the proteomics specialist. Integrative structural biology requires the aggregation of multiple distinct types of data to generate models that satisfy all inputs. Here, we describe IMProv, an app in the Mass Spec Studio that combines XL-MS data with other structural data, such as cryo-EM densities and crystallographic structures, for integrative structure modeling on high-performance computing platforms. The resource provides an easily deployed bundle that includes the open-source Integrative Modeling Platform program (IMP) and its dependencies. IMProv also provides functionality to adjust cross-link distance restraints according to the underlying dynamics of cross-linked sites, as characterized by HX-MS. A dynamics-driven conditioning of restraint values can improve structure modeling precision, as illustrated by an integrative structure of the five-membered Polycomb Repressive Complex 2. IMProv is extensible to additional types of data.


Assuntos
Modelos Moleculares , Proteômica/métodos , Software , Espectrometria de Massas , Complexo Repressor Polycomb 2/química , Conformação Proteica
14.
Proc Natl Acad Sci U S A ; 117(1): 93-102, 2020 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-31848235

RESUMO

Detailed mechanistic understanding of protein complex function is greatly enhanced by insights from its 3-dimensional structure. Traditional methods of protein structure elucidation remain expensive and labor-intensive and require highly purified starting material. Chemical cross-linking coupled with mass spectrometry offers an alternative that has seen increased use, especially in combination with other experimental approaches like cryo-electron microscopy. Here we report advances in method development, combining several orthogonal cross-linking chemistries as well as improvements in search algorithms, statistical analysis, and computational cost to achieve coverage of 1 unique cross-linked position pair for every 7 amino acids at a 1% false discovery rate. This is accomplished without any peptide-level fractionation or enrichment. We apply our methods to model the complex between a carbonic anhydrase (CA) and its protein inhibitor, showing that the cross-links are self-consistent and define the interaction interface at high resolution. The resulting model suggests a scaffold for development of a class of protein-based inhibitors of the CA family of enzymes. We next cross-link the yeast proteasome, identifying 3,893 unique cross-linked peptides in 3 mass spectrometry runs. The dataset includes 1,704 unique cross-linked position pairs for the proteasome subunits, more than half of them intersubunit. Using multiple recently solved cryo-EM structures, we show that observed cross-links reflect the conformational dynamics and disorder of some proteasome subunits. We further demonstrate that this level of cross-linking density is sufficient to model the architecture of the 19-subunit regulatory particle de novo.


Assuntos
Reagentes de Ligações Cruzadas/química , Microscopia Crioeletrônica/métodos , Modelos Moleculares , Domínios e Motivos de Interação entre Proteínas , Proteínas/química , Sequenciamento de Cromatina por Imunoprecipitação , Imageamento Tridimensional/métodos , Espectrometria de Massas , Peptídeos/metabolismo , Complexo de Endopeptidases do Proteassoma/química , Conformação Proteica , Saccharomyces cerevisiae/metabolismo
15.
Int J Mol Sci ; 24(19)2023 Oct 04.
Artigo em Inglês | MEDLINE | ID: mdl-37834348

RESUMO

Homologous recombination (HR) is a fundamental process common to all species. HR aims to faithfully repair DNA double strand breaks. HR involves the formation of nucleoprotein filaments on DNA single strands (ssDNA) resected from the break. The nucleoprotein filaments search for homologous regions in the genome and promote strand exchange with the ssDNA homologous region in an unbroken copy of the genome. HR has been the object of intensive studies for decades. Because multi-scale dynamics is a fundamental aspect of this process, studying HR is highly challenging, both experimentally and using computational approaches. Nevertheless, knowledge has built up over the years and has recently progressed at an accelerated pace, borne by increasingly focused investigations using new techniques such as single molecule approaches. Linking this knowledge to the atomic structure of the nucleoprotein filament systems and the succession of unstable, transient intermediate steps that takes place during the HR process remains a challenge; modeling retains a very strong role in bridging the gap between structures that are stable enough to be observed and in exploring transition paths between these structures. However, working on ever-changing long filament systems submitted to kinetic processes is full of pitfalls. This review presents the modeling tools that are used in such studies, their possibilities and limitations, and reviews the advances in the knowledge of the HR process that have been obtained through modeling. Notably, we will emphasize how cooperative behavior in the HR nucleoprotein filament enables modeling to produce reliable information.


Assuntos
Recombinação Homóloga , Recombinases Rec A , Recombinases Rec A/metabolismo , DNA de Cadeia Simples/genética , Nucleoproteínas/genética , Quebras de DNA de Cadeia Dupla
16.
J Struct Biol ; 214(1): 107841, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35149213

RESUMO

Integrative modeling computes a model based on varied types of input information, be it from experiments or prior models. Often, a type of input information will be best handled by a specific modeling software package. In such a case, we desire to integrate our integrative modeling software package, Integrative Modeling Platform (IMP), with software specialized to the computational demands of the modeling problem at hand. After several attempts, however, we have concluded that even in collaboration with the software's developers, integration is either impractical or impossible. The reasons for the intractability of integration include software incompatibilities, differing modeling logic, the costs of collaboration, and academic incentives. In the integrative modeling software ecosystem, several large modeling packages exist with often redundant tools. We reason, therefore, that the other development groups have similarly concluded that the benefit of integration does not justify the cost. As a result, modelers are often restricted to the set of tools within a single software package. The inability to integrate tools from distinct software negatively impacts the quality of the models and the efficiency of the modeling. As the complexity of modeling problems grows, we seek to galvanize developers and modelers to consider the long-term benefit that software interoperability yields. In this article, we formulate a demonstrative set of software standards for implementing a model search using tools from independent software packages and discuss our efforts to integrate IMP and the crystallography suite Phenix within the Bayesian modeling framework.


Assuntos
Ecossistema , Teorema de Bayes , Software
17.
J Biol Chem ; 296: 100743, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33957123

RESUMO

Integrative modeling is an increasingly important tool in structural biology, providing structures by combining data from varied experimental methods and prior information. As a result, molecular architectures of large, heterogeneous, and dynamic systems, such as the ∼52-MDa Nuclear Pore Complex, can be mapped with useful accuracy, precision, and completeness. Key challenges in improving integrative modeling include expanding model representations, increasing the variety of input data and prior information, quantifying a match between input information and a model in a Bayesian fashion, inventing more efficient structural sampling, as well as developing better model validation, analysis, and visualization. In addition, two community-level challenges in integrative modeling are being addressed under the auspices of the Worldwide Protein Data Bank (wwPDB). First, the impact of integrative structures is maximized by PDB-Development, a prototype wwPDB repository for archiving, validating, visualizing, and disseminating integrative structures. Second, the scope of structural biology is expanded by linking the wwPDB resource for integrative structures with archives of data that have not been generally used for structure determination but are increasingly important for computing integrative structures, such as data from various types of mass spectrometry, spectroscopy, optical microscopy, proteomics, and genetics. To address the largest of modeling problems, a type of integrative modeling called metamodeling is being developed; metamodeling combines different types of input models as opposed to different types of data to compute an output model. Collectively, these developments will facilitate the structural biology mindset in cell biology and underpin spatiotemporal mapping of the entire cell.


Assuntos
Biologia Celular/história , Bases de Dados de Proteínas/história , Modelos Moleculares , Biologia Molecular/história , Animais , História do Século XX , História do Século XXI , Humanos
18.
J Biol Chem ; 296: 100748, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33957128

RESUMO

In part 1 of this remarkable collection, we told you the story of The Protein Data Bank (PDB) (1), which was founded 50 years ago, and we illustrated the breadth of the science contained within it with ten informative review articles. The second half of this collection is a continuation of our celebrations to mark this momentous anniversary. Part 2 provides eight more superb articles describing how the PDB has influenced biology over the course of the last half-century and how biology has fueled the deposition of impactful structures in the PDB. Here are some brief synopses of the articles you will enjoy in part 2!


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Cristalografia por Raios X , Conformação Proteica
19.
Acta Biochim Biophys Sin (Shanghai) ; 54(9): 1213-1221, 2022 Aug 25.
Artigo em Inglês | MEDLINE | ID: mdl-36017893

RESUMO

A whole-cell model represents certain aspects of the cell structure and/or function. Due to the high complexity of the cell, an integrative modeling approach is often taken to utilize all available information including experimental data, prior knowledge and prior models. In this review, we summarize an emerging workflow of whole-cell modeling into five steps: (i) gather information; (ii) represent the modeled system into modules; (iii) translate input information into scoring function; (iv) sample the whole-cell model; (v) validate and interpret the model. In particular, we propose the integrative modeling of the cell by combining available (whole-cell) models to maximize the accuracy, precision, and completeness. In addition, we list quantitative predictions of various aspects of cell biology from existing whole-cell models. Moreover, we discuss the remaining challenges and future directions, and highlight the opportunity to establish an integrative spatiotemporal multi-scale whole-cell model based on a community approach.


Assuntos
Modelos Biológicos
20.
Proteins ; 88(8): 1029-1036, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-31886559

RESUMO

Our information-driven docking approach HADDOCK has demonstrated a sustained performance since the start of its participation to CAPRI. This is due, in part, to its ability to integrate data into the modeling process, and to the robustness of its scoring function. We participated in CAPRI both as server and manual predictors. In CAPRI rounds 38-45, we have used various strategies depending on the available information. These ranged from imposing restraints to a few residues identified from literature as being important for the interaction, to binding pockets identified from homologous complexes or template-based refinement/CA-CA restraint-guided docking from identified templates. When relevant, symmetry restraints were used to limit the conformational sampling. We also tested for a large decamer target a new implementation of the MARTINI coarse-grained force field in HADDOCK. Overall, we obtained acceptable or better predictions for 13 and 11 server and manual submissions, respectively, out of the 22 interfaces. Our server performance (acceptable or higher-quality models when considering the top 10) was better (59%) than the manual (50%) one, in which we typically experiment with various combinations of protocols and data sources. Again, our simple scoring function based on a linear combination of intermolecular van der Waals and electrostatic energies and an empirical desolvation term demonstrated a good performance in the scoring experiment with a 63% success rate across all 22 interfaces. An analysis of model quality indicates that, while we are consistently performing well in generating acceptable models, there is room for improvement for generating/identifying higher quality models.


Assuntos
Simulação de Acoplamento Molecular , Peptídeos/química , Proteínas/química , Software , Sequência de Aminoácidos , Sítios de Ligação , Humanos , Ligantes , Peptídeos/metabolismo , Ligação Proteica , Conformação Proteica em alfa-Hélice , Conformação Proteica em Folha beta , Domínios e Motivos de Interação entre Proteínas , Mapeamento de Interação de Proteínas , Multimerização Proteica , Proteínas/metabolismo , Projetos de Pesquisa , Homologia Estrutural de Proteína , Termodinâmica
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa