RESUMO
The possible effects of mutations on stability and function of a protein can only be understood in the context of protein 3D structure. The MutationExplorer webserver maps sequence changes onto protein structures and allows users to study variation by inputting sequence changes. As the user enters variants, the 3D model evolves, and estimated changes in energy are highlighted. In addition to a basic per-residue input format, MutationExplorer can also upload an entire replacement sequence. Previously the purview of desktop applications, such an upload can back-mutate PDB structures to wildtype sequence in a single step. Another supported variation source is human single nucelotide polymorphisms (SNPs), genomic coordinates input in VCF format. Structures are flexibly colorable, not only by energetic differences, but also by hydrophobicity, sequence conservation, or other biochemical profiling. Coloring by interface score reveals mutation impacts on binding surfaces. MutationExplorer strives for efficiency in user experience. For example, we have prepared 45 000 PDB depositions for instant retrieval and initial display. All modeling steps are performed by Rosetta. Visualizations leverage MDsrv/Mol*. MutationExplorer is available at: http://proteinformatics.org/mutation_explorer/.
Assuntos
Internet , Modelos Moleculares , Mutação , Conformação Proteica , Proteínas , Software , Proteínas/genética , Proteínas/química , Humanos , Gráficos por Computador , TermodinâmicaRESUMO
MOTIVATION: The SmoothT software and webservice offers the construction of pathways from an ensemble of conformations. The user provides an archive of molecule conformations in Protein Databank (PDB) format, from which a starting and a final conformation need to be selected. The individual PDB files need to contain an energy value or score, estimating the quality of the respective confirmation. Additionally, the user has to provide a root-mean-square deviation (RMSD) cut-off, below which conformations are considered neighboring. From this, SmoothT constructs a graph that connects similar conformations. RESULTS: SmoothT returns the energetically most favorable pathway within in this graph. This pathway is directly displayed as interactive animation using the NGL viewer. Simultaneously, the energy along the pathway is plotted, highlighting the conformation that is currently displayed in the 3D window. AVAILABILITY AND IMPLEMENTATION: SmoothT is available as webservice at: http://proteinformatics.org/smoothT. Examples, a tutorial, and FAQs can be found there. Ensembles up to 2 GB (compressed) can be uploaded. Results will be stored for 5 days. The server is completely free and requires no registration. The C++ source code is available at: https://github.com/starbeachlab/smoothT.
Assuntos
Computadores , Proteínas , Proteínas/química , Conformação Molecular , Software , Bases de Dados de Proteínas , Conformação Proteica , InternetRESUMO
The AlignMe web server is dedicated to accurately aligning sequences of membrane proteins, a particularly challenging task due to the strong evolutionary divergence and the low compositional complexity of hydrophobic membrane-spanning proteins. AlignMe can create pairwise alignments of either two primary amino acid sequences or two hydropathy profiles. The web server for AlignMe has been continuously available for >10 years, supporting 1000s of users per year. Recent improvements include anchoring, multiple submissions, and structure visualization. Anchoring is the ability to constrain a position in an alignment, which allows expert information about related residues in proteins to be incorporated into an alignment without manual modification. The original web interface to the server limited the user to one alignment per submission, hindering larger scale studies. Now, batches of alignments can be initiated with a single submission. Finally, to provide structural context for the relationship between proteins, sequence similarity can now be mapped onto one or more structures (or structural models) of the proteins being aligned, by links to MutationExplorer, a web-based visualization tool. Together with a refreshed user interface, these features further enhance an important resource in the membrane protein community. The AlignMe web server is freely available at https://www.bioinfo.mpg.de/AlignMe/.
Assuntos
Proteínas de Membrana , Software , Proteínas de Membrana/genética , Sequência de Aminoácidos , Algoritmos , Alinhamento de Sequência , InternetRESUMO
Molecular dynamics simulation is a proven technique for computing and visualizing the time-resolved motion of macromolecules at atomic resolution. The MDsrv is a tool that streams MD trajectories and displays them interactively in web browsers without requiring advanced skills, facilitating interactive exploration and collaborative visual analysis. We have now enhanced the MDsrv to further simplify the upload and sharing of MD trajectories and improve their online viewing and analysis. With the new instance, the MDsrv simplifies the creation of sessions, which allows the exchange of MD trajectories with preset representations and perspectives. An important innovation is that the MDsrv can now access and visualize trajectories from remote datasets, which greatly expands its applicability and use, as the data no longer needs to be accessible on a local server. In addition, initial analyses such as sequence or structure alignments, distance measurements, or RMSD calculations have been implemented, which optionally support visual analysis. Finally, based on Mol*, MDsrv now provides faster and more efficient visualization of even large trajectories compared to its predecessor tool NGL.
Assuntos
Visualização de Dados , Internet , Simulação de Dinâmica Molecular , Software , Computadores , NavegadorRESUMO
We present an updated version of the Voronoia service that enables fully automated analysis of the atomic packing density of macromolecules. Voronoia combines previous efforts to analyse 3D protein and RNA structures into a single service, combined with state-of-the-art online visualization. Voronoia uses the Voronoi cell method to calculate the free space between neighbouring atoms to estimate van der Waals interactions. Compared to other methods that derive van der Waals interactions by calculating solvent-free surfaces, it explicitly considers volume or packing defects. Large internal voids refer either to water molecules or ions unresolved by X-ray crystallography or cryo-EM, cryptic ligand binding pockets, or parts of a structural model that require further refinement. Voronoia is, therefore mainly used for functional analyses of 3D structures and quality assessments of structural models. Voronoia 4-ever updates the database of precomputed packing densities of PDB entries, allows uploading multiple structures, adds new filter options and facilitates direct access to the results through intuitive display with the NGL viewer. Voronoia is available at: htttp://proteinformatics.org/voronoia.
Assuntos
Conformação Proteica , Software , Modelos Moleculares , RNA/químicaRESUMO
In this proof-of-principle study, we systematically studied the potential of Raman spectroscopy for detecting pre-analytical delays in blood serum samples. Spectra from 330 samples from a liver cirrhosis cohort were acquired over the course of eight days, stored one day at room temperature, and stored subsequently at 4 °C. The spectra were then used to train Convolutional Neural Networks (CNN) to predict the delay to sample examination. We achieved 90% accuracy for binary classification of the serum samples in the groups "without delay" versus "delayed". Spectra recorded on the first day could be distinguished clearly from all subsequent measurements. Distinguishing between spectra taken in the range from the second to the last day seems to be possible as well, but currently, with an accuracy of approximately 70% only. Importantly, filtering out the fluorescent background significantly reduces the precision of detection.
Assuntos
Redes Neurais de Computação , Análise Espectral Raman , Humanos , Análise Espectral Raman/métodos , Cirrose HepáticaRESUMO
BACKGROUND: Virus-like-particles (VLPs) are attractive nanoparticulate scaffolds for broad applications in material/biological sciences and medicine. Prior their functionalization, specific adaptations have to be carried out. These adjustments frequently lead to disordered particles, but the particle integrity is an essential factor for the VLP suitability. Therefore, major requirements for particle stabilization exist. The objective of this study was to evaluate novel stabilizing elements for functionalized chimeric hepatitis B virus core antigen virus-like particles (HBcAg-VLP), with beneficial characteristics for vaccine development, imaging or delivery. RESULTS: The effects of a carboxy-terminal polyhistidine-peptide and an intradimer disulfide-bridge on the stability of preclinically approved chimeric HBcAg-VLPs were assessed. We purified recombinant chimeric HBcAg-VLPs bearing different modified C-termini and compared their physical and chemical particle stability by quantitative protein-biochemical and biophysical techniques. We observed lower chemical resistance of T = 3- compared to T = 4-VLP (triangulation number) capsids and profound impairment of accessibility of hexahistidine-peptides in assembled VLPs. Histidines attached to the C-terminus were associated with superior mechanical and/or chemical particle stability depending on the number of histidine moieties. A molecular modeling approach based on cryo-electron microscopy and biolayer interferometry revealed the underlying structural mechanism for the strengthening of the integrity of VLPs. Interactions triggering capsid stabilization occur on a highly conserved residue on the basis of HBcAg-monomers as well as on hexahistidine-peptides of adjacent monomers. This new stabilization mechanism appears to mimic an evolutionary conserved stabilization concept for hepadnavirus core proteins. CONCLUSIONS: These findings establish the genetically simply transferable C-terminal polyhistidine-peptide as a general stabilizing element for chimeric HBcAg-VLPs to increase their suitability.
Assuntos
Antígenos do Núcleo do Vírus da Hepatite B/metabolismo , Histidina/metabolismo , Oligopeptídeos/metabolismo , Proteínas Recombinantes/metabolismo , Vírion/metabolismo , Antígenos do Núcleo do Vírus da Hepatite B/isolamento & purificação , Modelos Moleculares , Controle de Qualidade , Proteínas Recombinantes/isolamento & purificação , Estresse Fisiológico , Vírion/ultraestruturaRESUMO
We present a web server for pair-wise alignment of membrane protein sequences, using the program AlignMe. The server makes available two operational modes of AlignMe: (i) sequence to sequence alignment, taking two sequences in fasta format as input, combining information about each sequence from multiple sources and producing a pair-wise alignment (PW mode); and (ii) alignment of two multiple sequence alignments to create family-averaged hydropathy profile alignments (HP mode). For the PW sequence alignment mode, four different optimized parameter sets are provided, each suited to pairs of sequences with a specific similarity level. These settings utilize different types of inputs: (position-specific) substitution matrices, secondary structure predictions and transmembrane propensities from transmembrane predictions or hydrophobicity scales. In the second (HP) mode, each input multiple sequence alignment is converted into a hydrophobicity profile averaged over the provided set of sequence homologs; the two profiles are then aligned. The HP mode enables qualitative comparison of transmembrane topologies (and therefore potentially of 3D folds) of two membrane proteins, which can be useful if the proteins have low sequence similarity. In summary, the AlignMe web server provides user-friendly access to a set of tools for analysis and comparison of membrane protein sequences. Access is available at http://www.bioinfo.mpg.de/AlignMe.
Assuntos
Proteínas de Membrana/química , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína , Software , Interações Hidrofóbicas e Hidrofílicas , InternetRESUMO
The possible effects of mutations on stability and function of a protein can only be understood in the context of protein 3D structure. The MutationExplorer webserver maps sequence changes onto protein structures and allows users to study variation by inputting sequence changes. As the user enters variants, the 3D model evolves, and estimated changes in energy are highlighted. In addition to a basic per-residue input format, MutationExplorer can also upload an entire replacement sequence. Previously the purview of desktop applications, such an upload can back-mutate PDB structures to wildtype sequence in a single step. Another supported variation source is human single nucelotide polymorphisms (SNPs), genomic coordinates input in VCF format.
RESUMO
Motivation: Protein-protein interactions (PPIs) play an essential role in a great variety of cellular processes and are therefore of significant interest for the design of new therapeutic compounds as well as the identification of side effects due to unexpected binding. Here, we present ProteinPrompt, a webserver that uses machine learning algorithms to calculate specific, currently unknown PPIs. Our tool is designed to quickly and reliably predict contact propensities based on an input sequence in order to scan large sequence libraries for potential binding partners, with the goal to accelerate and assure the quality of the laborious process of drug target identification. Results: We collected and thoroughly filtered a comprehensive database of known binders from several sources, which is available as download. ProteinPrompt provides two complementary search methods of similar accuracy for comparison and consensus building. The default method is a random forest (RF) algorithm that uses the auto-correlations of seven amino acid scales. Alternatively, a graph neural network (GNN) implementation can be selected. Additionally, a consensus prediction is available. For each query sequence, potential binding partners are identified from a protein sequence database. The proteom of several organisms are available and can be searched for binders. To evaluate the predictive power of the algorithms, we prepared a test dataset that was rigorously filtered for redundancy. No sequence pairs similar to the ones used for training were included in this dataset. With this challenging dataset, the RF method achieved an accuracy rate of 0.88 and an area under the curve of 0.95. The GNN achieved an accuracy rate of 0.86 using the same dataset. Since the underlying learning approaches are unrelated, comparing the results of RF and GNNs reduces the likelihood of errors. The consensus reached an accuracy of 0.89. Availability and implementation: ProteinPrompt is available online at: http://proteinformatics.org/ProteinPrompt, where training and test data used to optimize the methods are also available. The server makes it possible to scan the human proteome for potential binding partners of an input sequence within minutes. For local offline usage, we furthermore created a ProteinPrompt Docker image which allows for batch submission: https://gitlab.hzdr.de/proteinprompt/ProteinPrompt. In conclusion, we offer a fast, accurate, easy-to-use online service for predicting binding partners from an input sequence.
RESUMO
The alignment of primary sequences is a fundamental step in the analysis of protein structure, function, and evolution, and in the generation of homology-based models. Integral membrane proteins pose a significant challenge for such sequence alignment approaches, because their evolutionary relationships can be very remote, and because a high content of hydrophobic amino acids reduces their complexity. Frequently, biochemical or biophysical data is available that informs the optimum alignment, for example, indicating specific positions that share common functional or structural roles. Currently, if those positions are not correctly matched by a standard pairwise sequence alignment procedure, the incorporation of such information into the alignment is typically addressed in an ad hoc manner, with manual adjustments. However, such modifications are problematic because they reduce the robustness and reproducibility of the aligned regions either side of the newly matched positions. Previous studies have introduced restraints as a means to impose the matching of positions during sequence alignments, originally in the context of genome assembly. Here we introduce position restraints, or "anchors" as a feature in our alignment tool AlignMe, providing an aid to pairwise global sequence alignment of alpha-helical membrane proteins. Applying this approach to realistic scenarios involving distantly-related and low complexity sequences, we illustrate how the addition of anchors can be used to modify alignments, while still maintaining the reproducibility and rigor of the rest of the alignment. Anchored alignments can be generated using the online version of AlignMe available at www.bioinfo.mpg.de/AlignMe/.
Assuntos
Proteínas de Membrana/genética , Análise de Sequência de Proteína/métodos , Algoritmos , Sequência de Aminoácidos , Animais , Drosophila/genética , Humanos , Interações Hidrofóbicas e Hidrofílicas , Camundongos , Reprodutibilidade dos Testes , Alinhamento de Sequência/métodos , Homologia de Sequência de Aminoácidos , SoftwareRESUMO
Raman spectroscopy has shown to be a promising method for the examination of biomedical samples. However, until now, its efficacy has not been established in clinical diagnostics. In this study, Raman spectroscopy's potential application in medical laboratories is evaluated for a large variety (38) of biomarkers. Given 234 serum samples from a cohort of patients with different stages of liver disease, we performed Raman spectroscopy at 780nm excitation wavelength. The Raman spectra were analyzed in combination with the results of routine diagnostics using specifically developed complex mathematical algorithms, including fluorescence filtering, frequency subset selection and several overfitting circumventing strategies, such as independent validation. With the results of this cohort, which were validated in 328 independent samples, a significant proof-of-concept study was completed. This study highlights the need to prevent overfitting and to use independent data for validation. The results reveal that Raman spectroscopy has high potential for use in medical laboratory diagnostics to simultaneously quantify multiple biomarkers.
Assuntos
Biomarcadores/sangue , Doença Hepática Terminal/sangue , Adulto , Idoso , Algoritmos , Doença Hepática Terminal/patologia , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Análise Espectral Raman/métodosRESUMO
X-ray crystal structures have revealed that numerous secondary transporter proteins originally categorized into different sequence families share similar structures, namely, the LeuT fold. The core of this fold consists of two units of five transmembrane helices, whose conformations have been proposed to exchange to form the two alternate states required for transport. That these two units are related implies that LeuT-like transporters evolved from gene-duplication and fusion events. Thus, the origins of this structural repeat may be relevant to the evolution of transport function. However, the lack of significant sequence similarity requires sensitive sequence search methods for analyzing their evolution. To this end, we developed a software application called AlignMe, which can use various types of input information, such as residue hydrophobicity, to perform pairwise alignments of sequences and/or of hydropathy profiles of (membrane) proteins. We used AlignMe to analyze the evolutionary relationships between repeats of the LeuT fold. In addition, we identified proteins from the so-called DedA family that potentially share a common ancestor with these repeats. DedA domains have been implicated in, e.g., selenite uptake; they are found widely distributed across all kingdoms of life; two or more DedA domains are typically found per genome, and some may adopt dual topologies. These results suggest that DedA proteins existed in ancient organisms and may function as dimers, as required for a would-be ancestor of the LeuT fold. In conclusion, we provide novel insights into the evolution of this important structural motif and thus potentially into the alternating-access mechanism of transport itself.
Assuntos
Proteínas de Bactérias/química , Proteínas de Membrana Transportadoras/química , Software , Cristalografia por Raios X , Interações Hidrofóbicas e Hidrofílicas , Estrutura Secundária de ProteínaRESUMO
The concept of hydrophobicity is critical to our understanding of the principles of membrane protein (MP) folding, structure, and function. In the last decades, several groups have derived hydrophobicity scales using both experimental and statistical methods that are optimized to mimic certain natural phenomena as closely as possible. The present work adds to this toolset the first knowledge-based scale that unifies the characteristics of both alpha-helical and beta-barrel multispan MPs. This unified hydrophobicity scale (UHS) distinguishes between amino acid preference for solution, transition, and trans-membrane states. The scale represents average hydrophobicity values of amino acids in folded proteins, irrespective of their secondary structure type. We furthermore present the first knowledge-based hydrophobicity scale for mammalian alpha-helical MPs (mammalian hydrophobicity scale--MHS). Both scales are particularly useful for computational protein structure elucidation, for example as input for machine learning techniques, such as secondary structure or trans-membrane span prediction, or as reference energies for protein structure prediction or protein design. The knowledge-based UHS shows a striking similarity to a recent experimental hydrophobicity scale introduced by Hessa and coworkers (Hessa T et al., Nature 2007;450:U1026-U1032). Convergence of two very different approaches onto similar hydrophobicity values consolidates the major differences between experimental and knowledge-based scales observed in earlier studies. Moreover, the UHS scale represents an accurate absolute free energy measure for folded, multispan MPs--a feature that is absent from many existing scales. The utility of the UHS was demonstrated by analyzing a series of diverse MPs. It is further shown that the UHS outperforms nine established hydrophobicity scales in predicting trans-membrane spans along the protein sequence. The accuracy of the present hydrophobicity scale profits from the doubling of the number of integral MPs in the PDB over the past four years. The UHS paves the way for an increased accuracy in the prediction of trans-membrane spans.
Assuntos
Aminoácidos/química , Interações Hidrofóbicas e Hidrofílicas , Proteínas de Membrana/química , Animais , Inteligência Artificial , Bases de Dados de Proteínas , Humanos , Mamíferos , Modelos Moleculares , Dobramento de Proteína , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , TermodinâmicaRESUMO
We investigated the additivity of the solvation free energy of amino acids in homogeneous helices of different length in water and in chloroform. Solvation free energies were computed by multiconfiguration thermodynamic integration involving extended molecular dynamics simulations and by applying the generalized-born surface area solvation model to static helix geometries. The investigation focused on homogeneous peptides composed of uncharged amino acids, where the backbone atoms are kept fixed in an ideal helical conformation. We found nonlinearity especially for short peptides, which does not allow a simple treatment of the interaction of amino acids with their surroundings. For homogeneous peptides longer than five residues, the results from both methods are in quite good agreement and solvation energies are to a good extent additive.
Assuntos
Peptídeos/química , Termodinâmica , Aminoácidos/química , Clorofórmio/química , Simulação por Computador , Modelos Químicos , Estrutura Secundária de Proteína , Solubilidade , Água/químicaRESUMO
Specific non-covalent interactions between transmembrane (TM) alpha-helices are important in a variety of biological processes. Experimental and computational studies have shown that van der Waals interactions play an important role in the tight packing between TM alpha-helices, although polar interactions can also be important in some instances. Based on the assumption that van der Waals interaction alone is sufficient for a meso-scale (residue-scale) description of the interaction between TM alpha-helices, we have designed a novel residue-scale scoring function for modeling structures of oligomers of TM alpha-helices. We first calculated atomistic van der Waals interaction energies between two amino acids, X and Y, of a pair of parallel alpha-helices, glycine-X-glycine and glycine-Y-glycine and compiled them according to three variables, the distance between the two C(alpha) atoms and the rotational angles of X and Y about their helical axes. Upon averaging over the rotational angles, we obtained one-dimensional interaction energy profiles that are functions of the distance between C(alpha) atoms only. Each of the interaction energy profiles was fitted with a generic fitting function of the distance between C(alpha) atoms, yielding analytical scoring functions for all possible amino acid pairs. For glycophorin A, neu/erbB-2, and phospholamban, lowest-energy conformations obtained through exhaustive scanning of the entire conformational space using the scoring functions were compatible with available experimental data.
Assuntos
Membrana Celular/metabolismo , Proteínas de Membrana/química , Proteínas de Membrana/metabolismo , Modelos Moleculares , Sequência de Aminoácidos , Proteínas de Ligação ao Cálcio/química , Proteínas de Ligação ao Cálcio/metabolismo , Glicoforinas/química , Glicoforinas/metabolismo , Dados de Sequência Molecular , Ressonância Magnética Nuclear Biomolecular , Estrutura Secundária de Proteína , Receptor ErbB-2/química , Receptor ErbB-2/metabolismoRESUMO
Few sequence alignment methods have been designed specifically for integral membrane proteins, even though these important proteins have distinct evolutionary and structural properties that might affect their alignments. Existing approaches typically consider membrane-related information either by using membrane-specific substitution matrices or by assigning distinct penalties for gap creation in transmembrane and non-transmembrane regions. Here, we ask whether favoring matching of predicted transmembrane segments within a standard dynamic programming algorithm can improve the accuracy of pairwise membrane protein sequence alignments. We tested various strategies using a specifically designed program called AlignMe. An updated set of homologous membrane protein structures, called HOMEP2, was used as a reference for optimizing the gap penalties. The best of the membrane-protein optimized approaches were then tested on an independent reference set of membrane protein sequence alignments from the BAliBASE collection. When secondary structure (S) matching was combined with evolutionary information (using a position-specific substitution matrix (P)), in an approach we called AlignMePS, the resultant pairwise alignments were typically among the most accurate over a broad range of sequence similarities when compared to available methods. Matching transmembrane predictions (T), in addition to evolutionary information, and secondary-structure predictions, in an approach called AlignMePST, generally reduces the accuracy of the alignments of closely-related proteins in the BAliBASE set relative to AlignMePS, but may be useful in cases of extremely distantly related proteins for which sequence information is less informative. The open source AlignMe code is available at https://sourceforge.net/projects/alignme/, and at http://www.forrestlab.org, along with an online server and the HOMEP2 data set.
Assuntos
Algoritmos , Proteínas/química , Alinhamento de Sequência/métodos , Software , Sequência de Aminoácidos , Interações Hidrofóbicas e Hidrofílicas , Dados de Sequência Molecular , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Alinhamento de Sequência/estatística & dados numéricos , Homologia de Sequência de AminoácidosRESUMO
Computational de novo protein structure prediction is limited to small proteins of simple topology. The present work explores an approach to extend beyond the current limitations through assembling protein topologies from idealized α-helices and ß-strands. The algorithm performs a Monte Carlo Metropolis simulated annealing folding simulation. It optimizes a knowledge-based potential that analyzes radius of gyration, ß-strand pairing, secondary structure element (SSE) packing, amino acid pair distance, amino acid environment, contact order, secondary structure prediction agreement and loop closure. Discontinuation of the protein chain favors sampling of non-local contacts and thereby creation of complex protein topologies. The folding simulation is accelerated through exclusion of flexible loop regions further reducing the size of the conformational search space. The algorithm is benchmarked on 66 proteins with lengths between 83 and 293 amino acids. For 61 out of these proteins, the best SSE-only models obtained have an RMSD100 below 8.0 Å and recover more than 20% of the native contacts. The algorithm assembles protein topologies with up to 215 residues and a relative contact order of 0.46. The method is tailored to be used in conjunction with low-resolution or sparse experimental data sets which often provide restraints for regions of defined secondary structure.
Assuntos
Algoritmos , Biologia Computacional/métodos , Proteínas/química , Benchmarking , Humanos , Modelos Moleculares , Método de Monte Carlo , Estrutura Secundária de Proteína , Controle de QualidadeRESUMO
The topology of most experimentally determined protein domains is defined by the relative arrangement of secondary structure elements, i.e. α-helices and ß-strands, which make up 50-70% of the sequence. Pairing of ß-strands defines the topology of ß-sheets. The packing of side chains between α-helices and ß-sheets defines the majority of the protein core. Often, limited experimental datasets restrain the position of secondary structure elements while lacking detail with respect to loop or side chain conformation. At the same time the regular structure and reduced flexibility of secondary structure elements make these interactions more predictable when compared to flexible loops and side chains. To determine the topology of the protein in such settings, we introduce a tailored knowledge-based energy function that evaluates arrangement of secondary structure elements only. Based on the amino acid C(ß) atom coordinates within secondary structure elements, potentials for amino acid pair distance, amino acid environment, secondary structure element packing, ß-strand pairing, loop length, radius of gyration, contact order and secondary structure prediction agreement are defined. Separate penalty functions exclude conformations with clashes between amino acids or secondary structure elements and loops that cannot be closed. Each individual term discriminates for native-like protein structures. The composite potential significantly enriches for native-like models in three different databases of 10,000-12,000 protein models in 80-94% of the cases. The corresponding application, "BCL::ScoreProtein," is available at www.meilerlab.org.
Assuntos
Biologia Computacional/métodos , Modelos Moleculares , Proteínas/química , Algoritmos , Teorema de Bayes , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Rotação , TermodinâmicaRESUMO
As new atomic structures of membrane proteins are resolved, they reveal increasingly complex transmembrane topologies, and highly irregular surfaces with crevices and pores. In many cases, specific interactions formed with the lipid membrane are functionally crucial, as is the overall lipid composition. Compounded with increasing protein size, these characteristics pose a challenge for the construction of simulation models of membrane proteins in lipid environments; clearly, that these models are sufficiently realistic bears upon the reliability of simulation-based studies of these systems. Here, we introduce GRIFFIN, which uses a versatile framework to automate and improve a widely-used membrane-embedding protocol. Initially, GRIFFIN carves out lipid and water molecules from a volume equivalent to that of the protein, so as to conserve the system density. In the subsequent optimization phase GRIFFIN adds an implicit grid-based protein force-field to a molecular dynamics simulation of the pre-carved membrane. In this force-field, atoms inside the implicit protein volume experience an outward force that will expel them from that volume, whereas those outside are subject to electrostatic and van-der-Waals interactions with the implicit protein. At each step of the simulation, these forces are updated by GRIFFIN and combined with the intermolecular forces of the explicit lipid-water system. This procedure enables the construction of realistic and reproducible starting configurations of the protein-membrane interface within a reasonable timeframe and with minimal intervention. GRIFFIN is a standalone tool designed to work alongside any existing molecular dynamics package, such as NAMD or GROMACS.