RESUMEN
DisProt is the primary repository of Intrinsically Disordered Proteins (IDPs). This database is manually curated and the annotations there have strong experimental support. Currently, DisProt contains a relatively small number of proteins highlighting the importance of transferring annotations regarding verified disorder state and corresponding functions to homologous proteins in other species. In such a way, providing them with highly valuable information to better understand their biological roles. While the principles and practicalities of homology transfer are well-established for globular proteins, these are largely lacking for disordered proteins. We used DisProt to evaluate the transferability of the annotation terms to orthologous proteins. For each protein, we looked for their orthologs, with the assumption that they will have a similar function. Then, for each protein and their orthologs, we made multiple sequence alignments (MSAs). Disordered sequences are fast evolving and can be hard to align, therefore, we implemented alignment quality control steps ensuring robust alignments before mapping the annotations. We have designed a pipeline to obtain good-quality MSAs and to transfer annotations from any protein to their orthologs. Applying the pipeline to DisProt proteins, from the 1731 entries with 5623 annotations, we can reach 97,555 orthologs and transfer a total of 301,190 terms by homology. We also provide a web server for consulting the results of DisProt proteins and execute the pipeline for any other protein. The server Homology Transfer IDP (HoTIDP) is accessible at http://hotidp.leloir.org.ar.
Asunto(s)
Proteínas Intrínsecamente Desordenadas , Proteínas Intrínsecamente Desordenadas/genética , Proteínas Intrínsecamente Desordenadas/metabolismo , Alineación de Secuencia , Bases de Datos FactualesRESUMEN
Glutaredoxins are small proteins that share a common well-conserved thioredoxin-fold and participate in a wide variety of biological processes. Among them, class II Grx are redox-inactive proteins involved in iron-sulfur (Fe-S) metabolism. In the present work, we report different structural and dynamics aspects of 1CGrx1 from the pathogenic parasite Trypanosoma brucei that differentiate it from other orthologues by the presence of a parasite-specific unstructured N-terminal extension whose role has not been fully elucidated yet. Previous nuclear magnetic resonance (NMR) studies revealed significant differences with respect to the mutant lacking the disordered tail. Herein, we have performed atomistic molecular dynamics simulations that, complementary to NMR studies, confirm the intrinsically disordered nature of the N-terminal extension. Moreover, we confirm the main role of these residues in modulating the conformational dynamics of the glutathione-binding pocket. We observe that the N-terminal extension modifies the ligand cavity stiffening it by specific interactions that ultimately modulate its intrinsic flexibility, which may modify its role in the storage and/or transfer of preformed iron-sulfur clusters. These unique structural and dynamics aspects of Trypanosoma brucei 1CGrx1 differentiate it from other orthologues and could have functional relevance. In this way, our results encourage the study of other similar protein folding families with intrinsically disordered regions whose functional roles are still unrevealed and the screening of potential 1CGrx1 inhibitors as antitrypanosomal drug candidates.
Asunto(s)
Proteínas Intrínsecamente Desordenadas , Proteínas Hierro-Azufre , Trypanosoma brucei brucei , Glutarredoxinas/genética , Glutarredoxinas/metabolismo , Humanos , Ligandos , Unión Proteica , Pliegue de Proteína , Trypanosoma brucei brucei/metabolismoRESUMEN
Every biologist knows that the word protein describes a group of macromolecules essential to sustain life on Earth. As biologists, we are invariably trained under a protein paradigm established since the early twentieth century. However, in recent years, the term protein unveiled itself as an euphemism to describe the overwhelming heterogeneity of these compounds. Most of our current studies are targeted on carefully selected subsets of proteins, but we tend to think and write about these as representative of the whole population. Here we discuss how seeking for universal definitions and general rules in any arbitrarily segmented study would be misleading about the conclusions. Of course, it is not our purpose to discourage the use of the word protein. Instead, we suggest to embrace the extended universe of proteins to reach a deeper understanding of their full potential, realizing that the term encompasses a group of molecules very heterogeneous in terms of size, shape, chemistry and functions, i.e. the term protein no longer means what it used to.
RESUMEN
Intrinsically disordered proteins (IDPs) lack stable tertiary structure under physiological conditions. The unique composition and complex dynamical behaviour of IDPs make them a challenge for structural biology and molecular evolution studies. Using NMR ensembles, we found that IDPs evolve under a strong site-specific evolutionary rate heterogeneity, mainly originated by different constraints derived from their inter-residue contacts. Evolutionary rate profiles correlate with the experimentally observed conformational diversity of the protein, allowing the description of different conformational patterns possibly related to their structure-function relationships. The correlation between evolutionary rates and contact information improves when structural information is taken not from any individual conformer or the whole ensemble, but from combining a limited number of conformers. Our results suggest that residue contacts in disordered regions constrain evolutionary rates to conserve the dynamic behaviour of the ensemble and that evolutionary rates can be used as a proxy for the conformational diversity of IDPs.
Asunto(s)
Proteínas Intrínsecamente Desordenadas/química , Modelos Moleculares , Conformación Proteica , Aminoácidos , Sitios de Unión , Evolución Molecular , Humanos , Proteínas Intrínsecamente Desordenadas/genética , Proteínas Intrínsecamente Desordenadas/metabolismo , Resonancia Magnética Nuclear Biomolecular , Unión Proteica , Relación Estructura-ActividadRESUMEN
Intrinsically disordered proteins/regions (IDPs/IDRs) are crucial components of the cell, they are highly abundant and participate ubiquitously in a wide range of biological functions, such as regulatory processes and cell signaling. Many of their important functions rely on protein interactions, by which they trigger or modulate different pathways. Sequence covariation, a powerful tool for protein contact prediction, has been applied successfully to predict protein structure and to identify protein-protein interactions mostly of globular proteins. IDPs/IDRs also mediate a plethora of protein-protein interactions, highlighting the importance of addressing sequence covariation-based inter-protein contact prediction of this class of proteins. Despite their importance, a systematic approach to analyze the covariation phenomena of intrinsically disordered proteins and their complexes is still missing. Here we carry out a comprehensive critical assessment of coevolution-based contact prediction in IDP/IDR complexes and detail the challenges and possible limitations that emerge from their analysis. We found that the coevolutionary signal is faint in most of the complexes of disordered proteins but positively correlates with the interface size and binding affinity between partners. In addition, we discuss the state-of-art methodology by biological interpretation of the results, formulate evaluation guidelines and suggest future directions of development to the field.
Asunto(s)
Proteínas Intrínsecamente Desordenadas/química , Proteínas Intrínsecamente Desordenadas/fisiología , Secuencia de Aminoácidos , Fenómenos Bioquímicos , Unión Proteica , Conformación Proteica , Pliegue de Proteína , Dominios y Motivos de Interacción de Proteínas , Transducción de Señal/genética , Transducción de Señal/fisiologíaRESUMEN
Proteins in their native states can be represented as ensembles of conformers in dynamical equilibrium. Thermal fluctuations are responsible for transitions between these conformers. Normal-modes analysis (NMA) using elastic network models (ENMs) provides an efficient procedure to explore global dynamics of proteins commonly associated with conformational transitions. In the present work, we present an iterative approach to explore protein conformational spaces by introducing structural distortions according to their equilibrium dynamics at room temperature. The approach can be used either to perform unbiased explorations of conformational space or to explore guided pathways connecting two different conformations, e.g., apo and holo forms. In order to test its performance, four proteins with different magnitudes of structural distortions upon ligand binding have been tested. In all cases, the conformational selection model has been confirmed and the conformational space between apo and holo forms has been encompassed. Different strategies have been tested that impact on the efficiency either to achieve a desired conformational change or to achieve a balanced exploration of the protein conformational multiplicity.
Asunto(s)
Proteínas , Conformación ProteicaRESUMEN
According to the generalized conformational selection model, ligand binding involves the co-existence of at least two conformers with different ligand-affinities in a dynamical equilibrium. Conformational transitions between them should be guaranteed by intramolecular vibrational dynamics associated to each conformation. These motions are, therefore, related to the biological function of a protein. Positions whose mutations are found to alter these vibrations the most can be defined as key positions, that is, dynamically important residues that mediate the ligand-binding conformational change. In a previous study, we have shown that these positions are evolutionarily conserved. They correspond to buried aliphatic residues mostly localized in regular structured regions of the protein like ß-sheets and α-helices. In the present paper, we perform a network analysis of these key positions for a large dataset of paired protein structures in the ligand-free and ligand-bound form. We observe that networks of interactions between these key positions present larger and more integrated networks with faster transmission of the information. Besides, networks of residues result that are robust to conformational changes. Our results reveal that the conformational diversity of proteins seems to be guaranteed by a network of strongly interconnected key positions rather than individual residues.
Asunto(s)
Proteínas/química , Proteínas/metabolismo , Ligandos , Modelos Moleculares , Unión Proteica , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , VibraciónRESUMEN
The conformations accessible to proteins are determined by the inter-residue interactions between amino acid residues. During evolution, structural constraints that are required for protein function providing biologically relevant information can exist. Here, we studied the proportion of sites evolving under structural constraints in two very different types of ensembles, those coming from ordered and disordered proteins. Using a structurally constrained model of protein evolution, we found that both types of ensembles show comparable, near 40%, number of positions evolving under structural constraints. Among these sites, ~68% are in disordered regions and ~57% of them show long-range inter-residue contacts. Also, we found that disordered ensembles are redundant in reference to their structurally constrained evolutionary information and could be described on average with ~11 conformers. Despite the different complexity of the studied ensembles and proteins, the similar constraints reveal a comparable level of selective pressure to maintain their biological functions. These results highlight the importance of the evolutionary information to recover meaningful biological information to further characterize conformational ensembles.
Asunto(s)
Proteínas Intrínsecamente Desordenadas/química , Proteínas/química , Animales , Evolución Molecular , Humanos , Simulación de Dinámica Molecular , Conformación ProteicaRESUMEN
Protein motions are a key feature to understand biological function. Recently, a large-scale analysis of protein conformational diversity showed a positively skewed distribution with a peak at 0.5 Å C-alpha root-mean-square-deviation (RMSD). To understand this distribution in terms of structure-function relationships, we studied a well curated and large dataset of ~5,000 proteins with experimentally determined conformational diversity. We searched for global behaviour patterns studying how structure-based features change among the available conformer population for each protein. This procedure allowed us to describe the RMSD distribution in terms of three main protein classes sharing given properties. The largest of these protein subsets (~60%), which we call "rigid" (average RMSD = 0.83 Å), has no disordered regions, shows low conformational diversity, the largest tunnels and smaller and buried cavities. The two additional subsets contain disordered regions, but with differential sequence composition and behaviour. Partially disordered proteins have on average 67% of their conformers with disordered regions, average RMSD = 1.1 Å, the highest number of hinges and the longest disordered regions. In contrast, malleable proteins have on average only 25% of disordered conformers and average RMSD = 1.3 Å, flexible cavities affected in size by the presence of disordered regions and show the highest diversity of cognate ligands. Proteins in each set are mostly non-homologous to each other, share no given fold class, nor functional similarity but do share features derived from their conformer population. These shared features could represent conformational mechanisms related with biological functions.
Asunto(s)
Modelos Químicos , Modelos Estadísticos , Simulación de Dinámica Molecular , Conformación Proteica , Proteínas/química , Proteínas/ultraestructura , Relación Estructura-ActividadRESUMEN
Structural differences between conformers sustain protein biological function. Here, we studied in a large dataset of 745 intrinsically disordered proteins, how ordered-disordered transitions modulate structural differences between conformers as derived from crystallographic data. We found that almost 50% of the proteins studied show no transitions and have low conformational diversity while the rest show transitions and a higher conformational diversity. In this last subset, 60% of the proteins become more ordered after ligand binding, while 40% more disordered. As protein conformational diversity is inherently connected with protein function our analysis suggests differences in structure-function relationships related to order-disorder transitions.
Asunto(s)
Bases de Datos de Proteínas , Proteínas Intrínsecamente Desordenadas/química , Proteínas Intrínsecamente Desordenadas/genética , Conformación ProteicaRESUMEN
The N-terminal stretch of human frataxin (hFXN) intermediate (residues 42-80) is not conserved throughout evolution and, under defined experimental conditions, behaves as a random-coil. Overexpression of hFXN56-210 in Escherichia coli yields a multimer, whereas the mature form of hFXN (hFXN81-210) is monomeric. Thus, cumulative experimental evidence points to the N-terminal moiety as an essential element for the assembly of a high molecular weight oligomer. The secondary structure propensity of peptide 56-81, the moiety putatively responsible for promoting protein-protein interactions, was also studied. Depending on the environment (TFE or SDS), this peptide adopts α-helical or ß-strand structure. In this context, we explored the conformation and stability of hFXN56-210. The biophysical characterization by fluorescence, CD and SEC-FPLC shows that subunits are well folded, sharing similar stability to hFXN90-210. However, controlled proteolysis indicates that the N-terminal stretch is labile in the context of the multimer, whereas the FXN domain (residues 81-210) remains strongly resistant. In addition, guanidine hydrochloride at low concentration disrupts intermolecular interactions, shifting the ensemble toward the monomeric form. The conformational plasticity of the N-terminal tail might impart on hFXN the ability to act as a recognition signal as well as an oligomerization trigger. Understanding the fine-tuning of these activities and their resulting balance will bear direct relevance for ultimately comprehending hFXN function.