RESUMO
The transplantation of loops between structurally related proteins is a compelling method to improve the activity, specificity and stability of enzymes. However, despite the interest of loop regions in protein engineering, the available methods of loop-based rational protein design are scarce. One particular difficulty related to loop engineering is the unique dynamism that enables them to exert allosteric control over the catalytic function of enzymes. Thus, when engaging in a transplantation effort, such dynamics in the context of protein structure need consideration. A second practical challenge is identifying successful excision points for the transplantation or grafting. Here, we present LoopGrafter (https://loschmidt.chemi.muni.cz/loopgrafter/), a web server that specifically guides in the loop grafting process between structurally related proteins. The server provides a step-by-step interactive procedure in which the user can successively identify loops in the two input proteins, calculate their geometries, assess their similarities and dynamics, and select a number of loops to be transplanted. All possible different chimeric proteins derived from any existing recombination point are calculated, and 3D models for each of them are constructed and energetically evaluated. The obtained results can be interactively visualized in a user-friendly graphical interface and downloaded for detailed structural analyses.
Assuntos
Proteínas , Software , Modelos Moleculares , Proteínas/genética , Proteínas/química , Engenharia de Proteínas , InternetRESUMO
Motivation: Studying the transport paths of ligands, solvents, or ions in transmembrane proteins and proteins with buried binding sites is fundamental to the understanding of their biological function. A detailed analysis of the structural features influencing the transport paths is also important for engineering proteins for biomedical and biotechnological applications. Results: CAVER Analyst 2.0 is a software tool for quantitative analysis and real-time visualization of tunnels and channels in static and dynamic structures. This version provides the users with many new functions, including advanced techniques for intuitive visual inspection of the spatiotemporal behavior of tunnels and channels. Novel integrated algorithms allow an efficient analysis and data reduction in large protein structures and molecular dynamic simulations. Availability and implementation: CAVER Analyst 2.0 is a multi-platform standalone Java-based application. Binaries and documentation are freely available at www.caver.cz. Supplementary information: Supplementary data are available at Bioinformatics online.
Assuntos
Simulação de Dinâmica Molecular , Proteínas/química , Algoritmos , Conformação Proteica , Engenharia de Proteínas , SoftwareRESUMO
BACKGROUND: Studying the patterns of protein-protein interactions (PPIs) is fundamental for understanding the structure and function of protein complexes. The exploration of the vast space of possible mutual configurations of interacting proteins and their contact zones is very time consuming and requires the proteomic expert knowledge. RESULTS: In this paper, we propose a novel tool containing a set of visual abstraction techniques for the guided exploration of PPI configuration space. It helps proteomic experts to select the most relevant configurations and explore their contact zones at different levels of detail. The system integrates a set of methods that follow and support the workflow of proteomics experts. The first visual abstraction method, the Matrix view, is based on customized interactive heat maps and provides the users with an overview of all possible residue-residue contacts in all PPI configurations and their interactive filtering. In this step, the user can traverse all input PPI configurations and obtain an overview of their interacting amino acids. Then, the models containing a particular pair of interacting amino acids can be selectively picked and traversed. Detailed information on the individual amino acids in the contact zones and their properties is presented in the Contact-Zone list-view. The list-view provides a comparative tool to rank the best models based on the similarity of their contacts to the template-structure contacts. All these techniques are interactively linked with other proposed methods, the Exploded view and the Open-Book view, which represent individual configurations in three-dimensional space. These representations solve the high overlap problem associated with many configurations. Using these views, the structural alignment of the best models can also be visually confirmed. CONCLUSIONS: We developed a system for the exploration of large sets of protein-protein complexes in a fast and intuitive way. The usefulness of our system has been tested and verified on several docking structures covering the three major types of PPIs, including coiled-coil, pocket-string, and surface-surface interactions. Our case studies prove that our tool helps to analyse and filter protein-protein complexes in a fraction of the time compared to using previously available techniques.
Assuntos
Mapeamento de Interação de Proteínas/métodos , Proteínas/metabolismo , Domínios e Motivos de Interação entre Proteínas , Estrutura Terciária de Proteína , Proteínas/químicaRESUMO
BACKGROUND: Protein function is determined by many factors, namely by its constitution, spatial arrangement, and dynamic behavior. Studying these factors helps the biochemists and biologists to better understand the protein behavior and to design proteins with modified properties. One of the most common approaches to these studies is to compare the protein structure with other molecules and to reveal similarities and differences in their polypeptide chains. RESULTS: We support the comparison process by proposing a new visualization technique that bridges the gap between traditionally used 1D and 3D representations. By introducing the information about mutual positions of protein chains into the 1D sequential representation the users are able to observe the spatial differences between the proteins without any occlusion commonly present in 3D view. Our representation is designed to serve namely for comparison of multiple proteins or a set of time steps of molecular dynamics simulation. CONCLUSIONS: The novel representation is demonstrated on two usage scenarios. The first scenario aims to compare a set of proteins from the family of cytochromes P450 where the position of the secondary structures has a significant impact on the substrate channeling. The second scenario focuses on the protein flexibility when by comparing a set of time steps our representation helps to reveal the most dynamically changing parts of the protein chain.
Assuntos
Simulação de Dinâmica Molecular , Estrutura Secundária de Proteína , Algoritmos , Sequência de Aminoácidos , Modelos Moleculares , Proteínas/química , Alinhamento de SequênciaRESUMO
BACKGROUND: Protein structures and their interaction with ligands have been in the focus of biochemistry and structural biology research for decades. The transportation of ligand into the protein active site is often complex process, driven by geometric and physico-chemical properties, which renders the ligand path full of jitter and impasses. This prevents understanding of the ligand transportation and reasoning behind its behavior along the path. RESULTS: To address the needs of the domain experts we design an explorative visualization solution based on a multi-scale simplification model. It helps to navigate the user to the most interesting parts of the ligand trajectory by exploring different attributes of the ligand and its movement, such as its distance to the active site, changes of amino acids lining the ligand, or ligand "stuckness". The process is supported by three linked views - 3D representation of the simplified trajectory, scatterplot matrix, and bar charts with line representation of ligand-lining amino acids. CONCLUSIONS: The usage of our tool is demonstrated on molecular dynamics simulations provided by the domain experts. The tool was tested by the domain experts from protein engineering and the results confirm that it helps to navigate the user to the most interesting parts of the ligand trajectory and to understand the ligand behavior.
Assuntos
Simulação de Dinâmica Molecular , Proteínas/química , Aminoácidos/química , Domínio Catalítico , Processamento de Imagem Assistida por Computador , Ligantes , Modelos Moleculares , Conformação ProteicaRESUMO
Molecular docking is a key technique in various fields like structural biology, medicinal chemistry, and biotechnology. It is widely used for virtual screening during drug discovery, computer-assisted drug design, and protein engineering. A general molecular docking process consists of the target and ligand selection, their preparation, and the docking process itself, followed by the evaluation of the results. However, the most commonly used docking software provides no or very basic evaluation possibilities. Scripting and external molecular viewers are often used, which are not designed for an efficient analysis of docking results. Therefore, we developed InVADo, a comprehensive interactive visual analysis tool for large docking data. It consists of multiple linked 2D and 3D views. It filters and spatially clusters the data, and enriches it with post-docking analysis results of protein-ligand interactions and functional groups, to enable well-founded decision-making. In an exemplary case study, domain experts confirmed that InVADo facilitates and accelerates the analysis workflow. They rated it as a convenient, comprehensive, and feature-rich tool, especially useful for virtual screening.
Assuntos
Gráficos por Computador , Software , Simulação de Acoplamento Molecular , Ligantes , Descoberta de Drogas/métodosRESUMO
In understanding and redesigning the function of proteins in modern biochemistry, protein engineers are increasingly focusing on exploring regions in proteins called loops. Analyzing various characteristics of these regions helps the experts design the transfer of the desired function from one protein to another. This process is denoted as loop grafting. We designed a set of interactive visualizations that provide experts with visual support through all the loop grafting pipeline steps. The workflow is divided into several phases, reflecting the steps of the pipeline. Each phase is supported by a specific set of abstracted 2D visual representations of proteins and their loops that are interactively linked with the 3D View of proteins. By sequentially passing through the individual phases, the user shapes the list of loops that are potential candidates for loop grafting. Finally, the actual in-silico insertion of the loop candidates from one protein to the other is performed, and the results are visually presented to the user. In this way, the fully computational rational design of proteins and their loops results in newly designed protein structures that can be further assembled and tested through in-vitro experiments. We showcase the contribution of our visual support design on a real case scenario changing the enantiomer selectivity of the engineered enzyme. Moreover, we provide the readers with the experts' feedback.
RESUMO
We present sMolBoxes, a dataflow representation for the exploration and analysis of long molecular dynamics (MD) simulations. When MD simulations reach millions of snapshots, a frame-by-frame observation is not feasible anymore. Thus, biochemists rely to a large extent only on quantitative analysis of geometric and physico-chemical properties. However, the usage of abstract methods to study inherently spatial data hinders the exploration and poses a considerable workload. sMolBoxes link quantitative analysis of a user-defined set of properties with interactive 3D visualizations. They enable visual explanations of molecular behaviors, which lead to an efficient discovery of biochemically significant parts of the MD simulation. sMolBoxes follow a node-based model for flexible definition, combination, and immediate evaluation of properties to be investigated. Progressive analytics enable fluid switching between multiple properties, which facilitates hypothesis generation. Each sMolBox provides quick insight to an observed property or function, available in more detail in the bigBox View. The case studies illustrate that even with relatively few sMolBoxes, it is possible to express complex analytical tasks, and their use in exploratory analysis is perceived as more efficient than traditional scripting-based methods.
RESUMO
DNA nanostructures offer promising applications, particularly in the biomedical domain, as they can be used for targeted drug delivery, construction of nanorobots, or as a basis for molecular motors. One of the most prominent techniques for assembling these structures is DNA origami. Nowadays, desktop applications are used for the in silico design of such structures. However, as such structures are often spatially complex, their assembly and analysis are complicated. Since virtual reality (VR) was proven to be advantageous for such spatial-related tasks and there are no existing VR solutions focused on this domain, we propose Vivern, a VR application that allows domain experts to design and visually examine DNA origami nanostructures. Our approach presents different abstracted visual representations of the nanostructures, various color schemes, and an ability to place several DNA nanostructures and proteins in one environment, thus allowing for the detailed analysis of complex assemblies. We also present two novel examination tools, the Magic Scale Lens and the DNA Untwister, that allow the experts to visually embed different representations into local regions to preserve the context and support detailed investigation. To showcase the capabilities of our solution, prototypes of novel nanodevices conceptualized by our collaborating experts, such as DNA-protein hybrid structures and DNA origami superstructures, are presented. Finally, the results of two rounds of evaluations are summarized. They demonstrate the advantages of our solution, especially for scenarios where current desktop tools are very limited, while also presenting possible future research directions.
Assuntos
Gráficos por Computador , Nanoestruturas , DNA , Proteínas/químicaRESUMO
In the process of understanding and redesigning the function of proteins in modern biochemistry, protein engineers are increasingly focusing on the exploration of regions in proteins called loops. Analyzing various characteristics of these regions helps the experts to design the transfer of the desired function from one protein to another. This process is denoted as loop grafting. As this process requires extensive manual treatment and currently there is no proper visual support for it, we designed LoopGrafter: a web-based tool that provides experts with visual support through all the loop grafting pipeline steps. The tool is logically divided into several phases, starting with the definition of two input proteins and ending with a set of grafted proteins. Each phase is supported by a specific set of abstracted 2D visual representations of loaded proteins and their loops that are interactively linked with the 3D view onto proteins. By sequentially passing through the individual phases, the user is shaping the list of loops that are potential candidates for loop grafting. In the end, the actual in-silico insertion of the loop candidates from one protein to the other is performed and the results are visually presented to the user. In this way, the fully computational rational design of proteins and their loops results in newly designed protein structures that can be further assembled and tested through in-vitro experiments. LoopGrafter was designed in tight collaboration with protein engineers, and its final appearance reflects many testing iterations. We showcase the contribution of LoopGrafter on a real case scenario and provide the readers with the experts' feedback, confirming the usefulness of our tool.
RESUMO
In the modern drug discovery process, medicinal chemists deal with the complexity of analysis of large ensembles of candidate molecules. Computational tools, such as dimensionality reduction (DR) and classification, are commonly used to efficiently process the multidimensional space of features. These underlying calculations often hinder interpretability of results and prevent experts from assessing the impact of individual molecular features on the resulting representations. To provide a solution for scrutinizing such complex data, we introduce ChemVA, an interactive application for the visual exploration of large molecular ensembles and their features. Our tool consists of multiple coordinated views: Hexagonal view, Detail view, 3D view, Table view, and a newly proposed Difference view designed for the comparison of DR projections. These views display DR projections combined with biological activity, selected molecular features, and confidence scores for each of these projections. This conjunction of views allows the user to drill down through the dataset and to efficiently select candidate compounds. Our approach was evaluated on two case studies of finding structurally similar ligands with similar binding affinity to a target protein, as well as on an external qualitative evaluation. The results suggest that our system allows effective visual inspection and comparison of different high-dimensional molecular representations. Furthermore, ChemVA assists in the identification of candidate compounds while providing information on the certainty behind different molecular representations.
Assuntos
Gráficos por Computador , ProteínasRESUMO
Understanding large amounts of spatiotemporal data from particle-based simulations, such as molecular dynamics, often relies on the computation and analysis of aggregate measures. These, however, by virtue of aggregation, hide structural information about the space/time localization of the studied phenomena. This leads to degenerate cases where the measures fail to capture distinct behaviour. In order to drill into these aggregate values, we propose a multi-scale visual exploration technique. Our novel representation, based on partial domain aggregation, enables the construction of a continuous scale-space for discrete datasets and the simultaneous exploration of scales in both space and time. We link these two scale-spaces in a scale-space space-time cube and model linked views as orthogonal slices through this cube, thus enabling the rapid identification of spatio-temporal patterns at multiple scales. To demonstrate the effectiveness of our approach, we showcase an advanced exploration of a protein-ligand simulation.
RESUMO
When studying multi-body protein complexes, biochemists use computational tools that can suggest hundreds or thousands of their possible spatial configurations. However, it is not feasible to experimentally verify more than only a very small subset of them. In this paper, we propose a novel multiscale visual drilldown approach that was designed in tight collaboration with proteomic experts, enabling a systematic exploration of the configuration space. Our approach takes advantage of the hierarchical structure of the data - from the whole ensemble of protein complex configurations to the individual configurations, their contact interfaces, and the interacting amino acids. Our new solution is based on interactively linked 2D and 3D views for individual hierarchy levels. At each level, we offer a set of selection and filtering operations that enable the user to narrow down the number of configurations that need to be manually scrutinized. Furthermore, we offer a dedicated filter interface, which provides the users with an overview of the applied filtering operations and enables them to examine their impact on the explored ensemble. This way, we maintain the history of the exploration process and thus enable the user to return to an earlier point of the exploration. We demonstrate the effectiveness of our approach on two case studies conducted by collaborating proteomic experts.
Assuntos
Processamento de Imagem Assistida por Computador/métodos , Modelos Moleculares , Proteínas/química , Proteômica/métodos , Algoritmos , Sequência de Aminoácidos , Gráficos por Computador , Humanos , Nucleossomos/química , Interface Usuário-ComputadorRESUMO
N-terminal acetyltransferases (NATs) belong to the superfamily of acetyltransferases. They are enzymes catalysing the transfer of an acetyl group from acetyl coenzyme A to the N-terminus of polypeptide chains. N-terminal acetylation is one of the most common protein modifications. To date, not much is known on the molecular basis for the exclusive substrate specificity of NATs. All NATs share a common fold called GNAT. A characteristic of NATs is the ß6ß7 hairpin loop covering the active site and forming with the α1α2 loop a narrow tunnel surrounding the catalytic site in which cofactor and polypeptide meet and exchange an acetyl group. We investigated the dynamics-function relationships of all available structures of NATs covering the three domains of Life. Using an elastic network model and normal mode analysis, we found a common dynamics pattern conserved through the GNAT fold; a rigid V-shaped groove formed by the ß4 and ß5 strands and splitting the fold in two dynamical subdomains. Loops α1α2, ß3ß4 and ß6ß7 all show clear displacements in the low frequency normal modes. We characterized the mobility of the loops and show that even limited conformational changes of the loops along the low-frequency modes are able to significantly change the size and shape of the ligand binding sites. Based on the fact that these movements are present in most low-frequency modes, and common to all NATs, we suggest that the α1α2 and ß6ß7 loops may regulate ligand uptake and the release of the acetylated polypeptide.
RESUMO
Networks of protein-protein interactions (PPI) constitute either stable or transient complexes in every cell. Most of the cellular complexes keep their function, and therefore stay similar, during evolution. The evolutionary constraints preserve most cellular functions via preservation of protein structures and interactions. The evolutionary conservation information is utilized in template-based approaches, like protein structure modeling or docking. Here we use the combination of the template-free docking method with conservation-based selection of the best docking model using our newly developed COZOID tool.We describe a step-by-step protocol for visual selection of docking models, based on their similarity to the original protein complex structure. Using the COZOID tool, we first analyze contact zones of the original complex structure and select contact amino acids for docking restraints. Then we model and dock the homologous proteins. Finally, we utilize different analytical modes of our COZOID tool to select the docking models most similar to the original complex structure.
Assuntos
Proteínas/química , Bases de Dados de Proteínas , Humanos , Ligação Proteica , Conformação Proteica , Mapeamento de Interação de Proteínas , Homologia Estrutural de ProteínaRESUMO
Aquaporin-4, encoded by AQP4, is the major water channel in the central nervous system and plays an important role in the brain's water balance, including edema formation and clearance. Using genomic copy-number analysis and trio-exome sequencing, we investigated a male patient with intellectual disability, hearing loss, and progressive gait dysfunction and found a de novo missense change Ser111Thr in AQP4 as the only suspicious finding. Perinatally, signs of brain ischemia were detected in relation to acute collapse 2 h after birth that resolved a few days later. At the age of 3 mo, cardiac hypertrophy was detected that persisted through childhood but was completely resolved by age 16. In theory, this neurodevelopmental disorder with transient cardiomyopathy could be caused by a disturbance of cellular water balance. Ser111 is an extremely conserved residue in the short cytoplasmic loop between AQP4 transmembrane helix 2 and 3, present across all AQP isoforms from plants to mammals, and it does not appear to be a phosphorylation site. We found that the Ser111Thr change does not affect water permeability or protein stability, suggesting another and possibly regulatory role. Although causality remains unproven, this case study draws attention to AQP4 as a candidate gene for a unique developmental disorder and to a specific serine as a residue of possibly great functional importance in many AQPs.
Assuntos
Aquaporina 4/genética , Isquemia Encefálica/genética , Isquemia Encefálica/fisiopatologia , Cardiomegalia/genética , Marcha/fisiologia , Deficiência Intelectual/genética , Deficiência Intelectual/fisiopatologia , Polimorfismo de Nucleotídeo Único/genética , Adolescente , Animais , Isquemia Encefálica/complicações , Cardiomegalia/complicações , Pré-Escolar , Humanos , Lactente , Recém-Nascido , Deficiência Intelectual/complicações , Masculino , Permeabilidade , Água/metabolismo , Xenopus laevis/metabolismo , Adulto JovemRESUMO
In this paper we propose a novel method for the interactive exploration of protein tunnels. The basic principle of our approach is that we entirely abstract from the 3D/4D space the simulated phenomenon is embedded in. A complex 3D structure and its curvature information is represented only by a straightened tunnel centerline and its width profile. This representation focuses on a key aspect of the studied geometry and frees up graphical estate to key chemical and physical properties represented by surrounding amino acids. The method shows the detailed tunnel profile and its temporal aggregation. The profile is interactively linked with a visual overview of all amino acids which are lining the tunnel over time. In this overview, each amino acid is represented by a set of colored lines depicting the spatial and temporal impact of the amino acid on the corresponding tunnel. This representation clearly shows the importance of amino acids with respect to selected criteria. It helps the biochemists to select the candidate amino acids for mutation which changes the protein function in a desired way. The AnimoAminoMiner was designed in close cooperation with domain experts. Its usefulness is documented by their feedback and a case study, which are included.