Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 72
Filtrar
Mais filtros

Bases de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Annu Rev Biochem ; 83: 553-84, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24606139

RESUMO

Intrinsically disordered proteins (IDPs) and IDP regions fail to form a stable structure, yet they exhibit biological activities. Their mobile flexibility and structural instability are encoded by their amino acid sequences. They recognize proteins, nucleic acids, and other types of partners; they accelerate interactions and chemical reactions between bound partners; and they help accommodate posttranslational modifications, alternative splicing, protein fusions, and insertions or deletions. Overall, IDP-associated biological activities complement those of structured proteins. Recently, there has been an explosion of studies on IDP regions and their functions, yet the discovery and investigation of these proteins have a long, mostly ignored history. Along with recent discoveries, we present several early examples and the mechanisms by which IDPs contribute to function, which we hope will encourage comprehensive discussion of IDPs and IDP regions in biochemistry textbooks. Finally, we propose future directions for IDP research.


Assuntos
Proteínas Intrinsicamente Desordenadas/química , Animais , Calcineurina/química , Caseínas/química , Biologia Computacional , Espectroscopia de Ressonância de Spin Eletrônica , Fibrina/química , Fibrinogênio/química , Humanos , Espectroscopia de Ressonância Magnética , Modelos Moleculares , Fosvitina/química , Ligação Proteica , Mapeamento de Interação de Proteínas , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Espalhamento de Radiação , Solubilidade , Tripsina/química , Tripsinogênio/química , Difração de Raios X
2.
Nucleic Acids Res ; 49(D1): D298-D308, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33119734

RESUMO

We present DescribePROT, the database of predicted amino acid-level descriptors of structure and function of proteins. DescribePROT delivers a comprehensive collection of 13 complementary descriptors predicted using 10 popular and accurate algorithms for 83 complete proteomes that cover key model organisms. The current version includes 7.8 billion predictions for close to 600 million amino acids in 1.4 million proteins. The descriptors encompass sequence conservation, position specific scoring matrix, secondary structure, solvent accessibility, intrinsic disorder, disordered linkers, signal peptides, MoRFs and interactions with proteins, DNA and RNAs. Users can search DescribePROT by the amino acid sequence and the UniProt accession number and entry name. The pre-computed results are made available instantaneously. The predictions can be accesses via an interactive graphical interface that allows simultaneous analysis of multiple descriptors and can be also downloaded in structured formats at the protein, proteome and whole database scale. The putative annotations included by DescriPROT are useful for a broad range of studies, including: investigations of protein function, applied projects focusing on therapeutics and diseases, and in the development of predictors for other protein sequence descriptors. Future releases will expand the coverage of DescribePROT. DescribePROT can be accessed at http://biomine.cs.vcu.edu/servers/DESCRIBEPROT/.


Assuntos
Aminoácidos/química , Bases de Dados de Proteínas , Genoma , Proteínas/genética , Proteoma/genética , Software , Sequência de Aminoácidos , Aminoácidos/metabolismo , Animais , Archaea/genética , Archaea/metabolismo , Bactérias/genética , Bactérias/metabolismo , Sítios de Ligação , Sequência Conservada , Fungos/genética , Fungos/metabolismo , Humanos , Internet , Plantas/genética , Plantas/metabolismo , Células Procarióticas/metabolismo , Ligação Proteica , Estrutura Secundária de Proteína , Proteínas/química , Proteínas/classificação , Proteínas/metabolismo , Proteoma/química , Proteoma/metabolismo , Análise de Sequência de Proteína , Vírus/genética , Vírus/metabolismo
3.
Brief Bioinform ; 21(5): 1509-1522, 2020 09 25.
Artigo em Inglês | MEDLINE | ID: mdl-31616935

RESUMO

Experimental annotations of intrinsic disorder are available for 0.1% of 147 000 000 of currently sequenced proteins. Over 60 sequence-based disorder predictors were developed to help bridge this gap. Current benchmarks of these methods assess predictive performance on datasets of proteins; however, predictions are often interpreted for individual proteins. We demonstrate that the protein-level predictive performance varies substantially from the dataset-level benchmarks. Thus, we perform first-of-its-kind protein-level assessment for 13 popular disorder predictors using 6200 disorder-annotated proteins. We show that the protein-level distributions are substantially skewed toward high predictive quality while having long tails of poor predictions. Consequently, between 57% and 75% proteins secure higher predictive performance than the currently used dataset-level assessment suggests, but as many as 30% of proteins that are located in the long tails suffer low predictive performance. These proteins typically have relatively high amounts of disorder, in contrast to the mostly structured proteins that are predicted accurately by all 13 methods. Interestingly, each predictor provides the most accurate results for some number of proteins, while the best-performing at the dataset-level method is in fact the best for only about 30% of proteins. Moreover, the majority of proteins are predicted more accurately than the dataset-level performance of the most accurate tool by at least four disorder predictors. While these results suggests that disorder predictors outperform their current benchmark performance for the majority of proteins and that they complement each other, novel tools that accurately identify the hard-to-predict proteins and that make accurate predictions for these proteins are needed.


Assuntos
Proteínas Intrinsicamente Desordenadas/química , Algoritmos , Biologia Computacional/métodos , Cristalografia por Raios X , Bases de Dados de Proteínas , Conjuntos de Dados como Assunto , Ressonância Magnética Nuclear Biomolecular , Conformação Proteica , Análise de Sequência de Proteína/métodos
4.
Can J Physiol Pharmacol ; 100(4): 361-370, 2022 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-34695364

RESUMO

Sarco(endo)plasmic reticulum calcium (Ca2+) ATPase (SERCA) transports Ca2+ in muscle. Impaired SERCA activity may contribute to diabetic myopathy. Sirtuin (SIRT) 3 regulates muscle metabolism and function; however, it is unknown if SIRT3 regulates muscle SERCA activity or acetylation. We determined if SIRT3 overexpression enhances SERCA activity in mouse gastrocnemius muscle and if SIRT3 overexpression preserves gastrocnemius SERCA activity in a model of type 2 diabetes, induced by high fat - high sucrose (HFHS) feeding. We also determined if the acetylation status of SERCA proteins in mouse gastrocnemius is altered by SIRT3 overexpression or HFHS feeding. Wild-type (WT) and SIRT3 transgenic (SIRT3TG) mice, overexpressing SIRT3 in skeletal muscle, were fed a standard or HFHS diet for 4 months. SIRT3TG and WT mice developed obesity and glucose intolerance after 4 months of HFHS feeding. SERCA Vmax was higher in gastrocnemius of SIRT3TG mice compared with WT mice. HFHS-fed mice had lower SERCA1a protein levels and lower SERCA Vmax in their gastrocnemius than control-fed mice. The decrease in SERCA Vmax in gastrocnemius muscle due to HFHS feeding was attenuated by SIRT3 overexpression in HFHS-fed SIRT3TG mice. SERCA1a and SERCA2a acetylation in mouse gastrocnemius was not altered by genotype or diet. These findings suggest SIRT3 overexpression improves SERCA function in mouse skeletal muscle.


Assuntos
Diabetes Mellitus Tipo 2 , Músculo Esquelético/enzimologia , ATPases Transportadoras de Cálcio do Retículo Sarcoplasmático , Sirtuína 3 , Animais , Cálcio/metabolismo , Diabetes Mellitus Tipo 2/metabolismo , Estresse do Retículo Endoplasmático , Camundongos , Retículo Sarcoplasmático/enzimologia , ATPases Transportadoras de Cálcio do Retículo Sarcoplasmático/genética , ATPases Transportadoras de Cálcio do Retículo Sarcoplasmático/metabolismo , Sirtuína 3/genética , Sirtuína 3/metabolismo , Sacarose/metabolismo
5.
Cell Mol Life Sci ; 78(4): 1655-1688, 2021 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-32712910

RESUMO

The recently emerged coronavirus designated as SARS-CoV-2 (also known as 2019 novel coronavirus (2019-nCoV) or Wuhan coronavirus) is a causative agent of coronavirus disease 2019 (COVID-19), which is rapidly spreading throughout the world now. More than 1.21 million cases of SARS-CoV-2 infection and more than 67,000 COVID-19-associated mortalities have been reported worldwide till the writing of this article, and these numbers are increasing every passing hour. The World Health Organization (WHO) has declared the SARS-CoV-2 spread as a global public health emergency and admitted COVID-19 as a pandemic now. Multiple sequence alignment data correlated with the already published reports on SARS-CoV-2 evolution indicated that this virus is closely related to the bat severe acute respiratory syndrome-like coronavirus (bat SARS-like CoV) and the well-studied human SARS coronavirus (SARS-CoV). The disordered regions in viral proteins are associated with the viral infectivity and pathogenicity. Therefore, in this study, we have exploited a set of complementary computational approaches to examine the dark proteomes of SARS-CoV-2, bat SARS-like, and human SARS CoVs by analysing the prevalence of intrinsic disorder in their proteins. According to our findings, SARS-CoV-2 proteome contains very significant levels of structural order. In fact, except for nucleocapsid, Nsp8, and ORF6, the vast majority of SARS-CoV-2 proteins are mostly ordered proteins containing less intrinsically disordered protein regions (IDPRs). However, IDPRs found in SARS-CoV-2 proteins are functionally important. For example, cleavage sites in its replicase 1ab polyprotein are found to be highly disordered, and almost all SARS-CoV-2 proteins contains molecular recognition features (MoRFs), which are intrinsic disorder-based protein-protein interaction sites that are commonly utilized by proteins for interaction with specific partners. The results of our extensive investigation of the dark side of SARS-CoV-2 proteome will have important implications in understanding the structural and non-structural biology of SARS or SARS-like coronaviruses.


Assuntos
Betacoronavirus/química , Quirópteros/virologia , Infecções por Coronavirus/virologia , Proteínas Intrinsicamente Desordenadas/química , Proteoma/análise , Proteínas Virais/química , Animais , Proteínas de Ligação a DNA/química , Humanos , Modelos Moleculares , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Motivos de Ligação ao RNA , SARS-CoV-2/química , Relação Estrutura-Atividade
6.
Cell Mol Life Sci ; 77(1): 149-160, 2020 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-31175370

RESUMO

Protein-coding nucleic acids exhibit composition and codon biases between sequences coding for intrinsically disordered regions (IDRs) and those coding for structured regions. IDRs are regions of proteins that are folding self-insufficient and which function without the prerequisite of folded structure. Several authors have investigated composition bias or codon selection in regions encoding for IDRs, primarily in Eukaryota, and concluded that elevated GC content is the result of the biased amino acid composition of IDRs. We substantively extend previous work by examining GC content in regions encoding IDRs, from 44 species in Eukaryota, Archaea, and Bacteria, spanning a wide range of GC content. We confirm that regions coding for IDRs show a significantly elevated GC content, even across all domains of life. Although this is largely attributable to the amino acid composition bias of IDRs, we show that this bias is independent of the overall GC content and, most importantly, we are the first to observe that GC content bias in IDRs is significantly different than expected from IDR amino acid composition alone. We empirically find compensatory codon selection that reduces the observed GC content bias in IDRs. This selection is dependent on the overall GC content of the organism. The codon selection bias manifests as use of infrequent, AT-rich codons in encoding IDRs. Further, we find these relationships to be independent of the intrinsic disorder prediction method used, and independent of estimated translation efficiency. These observations are consistent with the previous work, and we speculate on whether the observed biases are causal or symptomatic of other driving forces.


Assuntos
Códon/química , Proteínas Intrinsicamente Desordenadas/química , Animais , Composição de Bases , Códon/genética , Uso do Códon , Humanos , Proteínas Intrinsicamente Desordenadas/genética , Biossíntese de Proteínas , Conformação Proteica
7.
Bioinformatics ; 35(10): 1692-1700, 2019 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-30329008

RESUMO

MOTIVATION: While putative intrinsic disorder is widely used, none of the predictors provides quality assessment (QA) scores. QA scores estimate the likelihood that predictions are correct at a residue level and have been applied in other bioinformatics areas. We recently reported that QA scores derived from putative disorder propensities perform relatively poorly for native disordered residues. Here we design and validate a general approach to construct QA predictors for disorder predictions. RESULTS: The QUARTER (QUality Assessment for pRotein inTrinsic disordEr pRedictions) toolbox of methods accommodates a diverse set of ten disorder predictors. It builds upon several innovative design elements including use and scaling of selected physicochemical properties of the input sequence, post-processing of disorder propensity scores, and a feature selection that optimizes the predictive models to a specific disorder predictor. We empirically establish that each one of these elements contributes to the overall predictive performance of our tool and that QUARTER's outputs significantly outperform QA scores derived from the outputs generated the disorder predictors. The best performing QA scores for a single disorder predictor identify 13% of residues that are predicted with 98% precision. QA scores computed by combining results of the ten disorder predictors cover 40% of residues with 95% precision. Case studies are used to show how to interpret the QA scores. QA scores based on the high precision combined predictions are applied to analyze disorder in the human proteome. AVAILABILITY AND IMPLEMENTATION: http://biomine.cs.vcu.edu/servers/QUARTER/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional , Proteoma , Humanos
8.
Can J Physiol Pharmacol ; 98(2): 74-84, 2020 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-31815523

RESUMO

The heart is capable of responding to stressful situations by increasing muscle mass, which is broadly defined as cardiac hypertrophy. This phenomenon minimizes ventricular wall stress for the heart undergoing a greater than normal workload. At initial stages, cardiac hypertrophy is associated with normal or enhanced cardiac function and is considered to be adaptive or physiological; however, at later stages, if the stimulus is not removed, it is associated with contractile dysfunction and is termed as pathological cardiac hypertrophy. It is during physiological cardiac hypertrophy where the function of subcellular organelles, including the sarcolemma, sarcoplasmic reticulum, mitochondria, and myofibrils, may be upregulated, while pathological cardiac hypertrophy is associated with downregulation of these subcellular activities. The transition of physiological cardiac hypertrophy to pathological cardiac hypertrophy may be due to the reduction in blood supply to hypertrophied myocardium as a consequence of reduced capillary density. Oxidative stress, inflammatory processes, Ca2+-handling abnormalities, and apoptosis in cardiomyocytes are suggested to play a critical role in the depression of contractile function during the development of pathological hypertrophy.


Assuntos
Cardiomegalia/patologia , Cardiomegalia/fisiopatologia , Animais , Apoptose , Cálcio/metabolismo , Cardiomegalia/metabolismo , Citocinas/metabolismo , Humanos , Espaço Intracelular/metabolismo
9.
Nucleic Acids Res ; 45(D1): D219-D227, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899601

RESUMO

The Database of Protein Disorder (DisProt, URL: www.disprot.org) has been significantly updated and upgraded since its last major renewal in 2007. The current release holds information on more than 800 entries of IDPs/IDRs, i.e. intrinsically disordered proteins or regions that exist and function without a well-defined three-dimensional structure. We have re-curated previous entries to purge DisProt from conflicting cases, and also upgraded the functional classification scheme to reflect continuous advance in the field in the past 10 years or so. We define IDPs as proteins that are disordered along their entire sequence, i.e. entirely lack structural elements, and IDRs as regions that are at least five consecutive residues without well-defined structure. We base our assessment of disorder strictly on experimental evidence, such as X-ray crystallography and nuclear magnetic resonance (primary techniques) and a broad range of other experimental approaches (secondary techniques). Confident and ambiguous annotations are highlighted separately. DisProt 7.0 presents classified knowledge regarding the experimental characterization and functional annotations of IDPs/IDRs, and is intended to provide an invaluable resource for the research community for a better understanding structural disorder and for developing better computational tools for studying disordered proteins.


Assuntos
Bases de Dados de Proteínas , Proteínas Intrinsicamente Desordenadas , Animais , Cristalografia por Raios X , Transferência Ressonante de Energia de Fluorescência , Previsões , Controle de Formulários e Registros , Humanos , Proteínas Intrinsicamente Desordenadas/classificação , Ressonância Magnética Nuclear Biomolecular , Conformação Proteica
10.
Proteomics ; 18(5-6): e1700193, 2018 03.
Artigo em Inglês | MEDLINE | ID: mdl-29068531

RESUMO

It is recognized now that various proteinaceous membrane-less organelles (PMLOs) are commonly found in cytoplasm, nucleus, and mitochondria of various eukaryotic cells (as well as in the chloroplasts of plant cells). Being different from the "traditional" membrane-encapsulated organelles, such as chloroplasts, endoplasmic reticulum, Golgi apparatus, lysosomes, mitochondria, nucleus, and vacuoles, PMLOs solve the cellular need to facilitate and regulate molecular interactions via reversible and controllable isolation of target molecules in specialized compartments. PMLOs possess liquid-like behavior and are believed to be formed as a result of biological liquid-liquid phase transitions (LLPTs, also known as liquid-liquid phase separation), where an intricate interplay between RNA and intrinsically disordered proteins (IDPs) or hybrid proteins containing ordered domains and intrinsically disordered protein regions (IDPRs) may play an important role. This review analyzes the prevalence of intrinsic disorder in proteins associated with various PMLOs found in human cells and considers some of the functional roles of IDPs/IDPRs in biogenesis of these organelles.


Assuntos
Proteínas Intrinsicamente Desordenadas/metabolismo , Organelas/metabolismo , Humanos , Proteínas Intrinsicamente Desordenadas/química , Modelos Moleculares , Transição de Fase , Conformação Proteica
11.
Semin Cell Dev Biol ; 37: 44-55, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25307499

RESUMO

Intrinsically disordered proteins (IDPs) and IDP regions lack stable tertiary structure yet carry out numerous biological functions, especially those associated with signaling, transcription regulation, DNA condensation, cell division, and cellular differentiation. Both post-translational modifications (PTMs) and alternative splicing (AS) expand the functional repertoire of IDPs. Here we propose that an "IDP-based developmental toolkit," which is comprised of IDP regions, PTMs, especially multiple PTMs, within these IDP regions, and AS events within segments of pre-mRNA that code for these same IDP regions, allows functional diversification and environmental responsiveness for molecules that direct the development of complex metazoans.


Assuntos
Evolução Biológica , Células Eucarióticas/metabolismo , Proteínas Intrinsicamente Desordenadas/química , Proteínas Intrinsicamente Desordenadas/metabolismo , Células Procarióticas/metabolismo , Células Eucarióticas/química , Células Eucarióticas/citologia , Proteínas Intrinsicamente Desordenadas/genética , Células Procarióticas/química , Células Procarióticas/citologia , Dobramento de Proteína , Processamento de Proteína Pós-Traducional
12.
Cell Mol Life Sci ; 71(8): 1477-504, 2014 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23942625

RESUMO

Intrinsic disorder (i.e., lack of a unique 3-D structure) is a common phenomenon, and many biologically active proteins are disordered as a whole, or contain long disordered regions. These intrinsically disordered proteins/regions constitute a significant part of all proteomes, and their functional repertoire is complementary to functions of ordered proteins. In fact, intrinsic disorder represents an important driving force for many specific functions. An illustrative example of such disorder-centric functional class is RNA-binding proteins. In this study, we present the results of comprehensive bioinformatics analyses of the abundance and roles of intrinsic disorder in 3,411 ribosomal proteins from 32 species. We show that many ribosomal proteins are intrinsically disordered or hybrid proteins that contain ordered and disordered domains. Predicted globular domains of many ribosomal proteins contain noticeable regions of intrinsic disorder. We also show that disorder in ribosomal proteins has different characteristics compared to other proteins that interact with RNA and DNA including overall abundance, evolutionary conservation, and involvement in protein-protein interactions. Furthermore, intrinsic disorder is not only abundant in the ribosomal proteins, but we demonstrate that it is absolutely necessary for their various functions.


Assuntos
Proteínas Intrinsicamente Desordenadas/genética , Proteínas Intrinsicamente Desordenadas/metabolismo , Modelos Moleculares , Conformação Proteica , Proteínas de Ligação a RNA/metabolismo , Proteínas Ribossômicas/metabolismo , Ribossomos/metabolismo , Aminoácidos/análise , Archaea/genética , Bactérias/genética , Biologia Computacional , Sequência Conservada/genética , Bases de Dados de Proteínas , Eucariotos/genética , Evolução Molecular , Estrutura Terciária de Proteína/genética , Proteínas de Ligação a RNA/genética , Proteínas Ribossômicas/genética , Especificidade da Espécie
13.
Adv Exp Med Biol ; 870: 1-34, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26387098

RESUMO

From the 1970s to the present, regions of missing electron density in protein structures determined by X-ray diffraction and the characterization of the functions of these regions have suggested that not all protein regions depend on prior 3D structure to carry out function. Motivated by these observations, in early 1996 we began to use bioinformatics approaches to study these intrinsically disordered proteins (IDPs) and IDP regions. At just about the same time, several laboratory groups began to study a collection of IDPs and IDP regions using nuclear magnetic resonance. The temporal overlap of the bioinformatics and NMR studies played a significant role in the development of our understanding of IDPs. Here the goal is to recount some of this history and to project from this experience possible directions for future work.


Assuntos
Biologia Computacional , Proteínas Intrinsicamente Desordenadas/química , Ressonância Magnética Nuclear Biomolecular/métodos , Sequência de Aminoácidos , Dados de Sequência Molecular , Conformação Proteica
14.
BMC Bioinformatics ; 15 Suppl 17: S4, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-25559583

RESUMO

BACKGROUND: The earliest whole protein order/disorder predictor (Uversky et al., Proteins, 41: 415-427 (2000)), herein called the charge-hydropathy (C-H) plot, was originally developed using the Kyte-Doolittle (1982) hydropathy scale (Kyte & Doolittle., J. Mol. Biol, 157: 105-132(1982)). Here the goal is to determine whether the performance of the C-H plot in separating structured and disordered proteins can be improved by using an alternative hydropathy scale. RESULTS: Using the performance of the CH-plot as the metric, we compared 19 alternative hydropathy scales, with the finding that the Guy (1985) hydropathy scale (Guy, Biophys. J, 47:61-70(1985)) was the best of the tested hydropathy scales for separating large collections structured proteins and intrinsically disordered proteins (IDPs) on the C-H plot. Next, we developed a new scale, named IDP-Hydropathy, which further improves the discrimination between structured proteins and IDPs. Applying the C-H plot to a dataset containing 109 IDPs and 563 non-homologous fully structured proteins, the Kyte-Doolittle (1982) hydropathy scale, the Guy (1985) hydropathy scale, and the IDP-Hydropathy scale gave balanced two-state classification accuracies of 79%, 84%, and 90%, respectively, indicating a very substantial overall improvement is obtained by using different hydropathy scales. A correlation study shows that IDP-Hydropathy is strongly correlated with other hydropathy scales, thus suggesting that IDP-Hydropathy probably has only minor contributions from amino acid properties other than hydropathy. CONCLUSION: We suggest that IDP-Hydropathy would likely be the best scale to use for any type of algorithm developed to predict protein disorder.


Assuntos
Aminoácidos/química , Bases de Dados de Proteínas , Proteínas Intrinsicamente Desordenadas/química , Proteínas/química , Proteínas/classificação , Algoritmos , Animais , Humanos , Interações Hidrofóbicas e Hidrofílicas
15.
Biochim Biophys Acta ; 1834(2): 487-98, 2013 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-23232152

RESUMO

Intrinsically disordered proteins (IDPs) and proteins with long disordered regions are highly abundant in various proteomes. Despite their lack of well-defined ordered structure, these proteins and regions are frequently involved in crucial biological processes. Although in recent years these proteins have attracted the attention of many researchers, IDPs represent a significant challenge for structural characterization since these proteins can impact many of the processes in the structure determination pipeline. Here we investigate the effects of IDPs on the structure determination process and the utility of disorder prediction in selecting and improving proteins for structural characterization. Examination of the extent of intrinsic disorder in existing crystal structures found that relatively few protein crystal structures contain extensive regions of intrinsic disorder. Although intrinsic disorder is not the only cause of crystallization failures and many structured proteins cannot be crystallized, filtering out highly disordered proteins from structure-determination target lists is still likely to be cost effective. Therefore it is desirable to avoid highly disordered proteins from structure-determination target lists and we show that disorder prediction can be applied effectively to enrich structure determination pipelines with proteins more likely to yield crystal structures. For structural investigation of specific proteins, disorder prediction can be used to improve targets for structure determination. Finally, a framework for considering intrinsic disorder in the structure determination pipeline is proposed.


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Proteínas/genética , Proteômica , Cristalografia por Raios X , Estrutura Terciária de Proteína
17.
Trends Biochem Sci ; 34(2): 53-9, 2009 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-19062293

RESUMO

Natural selection shapes the sequence, structure and biophysical properties of proteins to fit their environment. We hypothesize that highly thermostable proteins and viral proteins represent two opposing adaptation strategies. Thermostable proteins are highly compact and possess well-packed hydrophobic cores and intensely charged surfaces. By contrast, viral proteins, and RNA viral proteins in particular, display a high occurrence of disordered segments and loosely packed cores. These features might endow viral proteins with increased structural flexibility and effective ways to interact with the components of the host. They could also be related to high adaptability levels and mutation rates observed in viruses, thus, representing a unique strategy for buffering the deleterious effects of mutations, such that those that have little (interactions), have little to lose.


Assuntos
Proteínas Virais/química , Interações Hidrofóbicas e Hidrofílicas , Modelos Moleculares , Mutação , Conformação Proteica , Termodinâmica , Proteínas Virais/metabolismo
18.
Bioinformatics ; 28(12): i75-83, 2012 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-22689782

RESUMO

MOTIVATION: Molecular recognition features (MoRFs) are short binding regions located within longer intrinsically disordered regions that bind to protein partners via disorder-to-order transitions. MoRFs are implicated in important processes including signaling and regulation. However, only a limited number of experimentally validated MoRFs is known, which motivates development of computational methods that predict MoRFs from protein chains. RESULTS: We introduce a new MoRF predictor, MoRFpred, which identifies all MoRF types (α, ß, coil and complex). We develop a comprehensive dataset of annotated MoRFs to build and empirically compare our method. MoRFpred utilizes a novel design in which annotations generated by sequence alignment are fused with predictions generated by a Support Vector Machine (SVM), which uses a custom designed set of sequence-derived features. The features provide information about evolutionary profiles, selected physiochemical properties of amino acids, and predicted disorder, solvent accessibility and B-factors. Empirical evaluation on several datasets shows that MoRFpred outperforms related methods: α-MoRF-Pred that predicts α-MoRFs and ANCHOR which finds disordered regions that become ordered when bound to a globular partner. We show that our predicted (new) MoRF regions have non-random sequence similarity with native MoRFs. We use this observation along with the fact that predictions with higher probability are more accurate to identify putative MoRF regions. We also identify a few sequence-derived hallmarks of MoRFs. They are characterized by dips in the disorder predictions and higher hydrophobicity and stability when compared to adjacent (in the chain) residues. AVAILABILITY: http://biomine.ece.ualberta.ca/MoRFpred/; http://biomine.ece.ualberta.ca/MoRFpred/Supplement.pdf.


Assuntos
Biologia Computacional/métodos , Proteínas/análise , Alinhamento de Sequência , Aminoácidos , Sítios de Ligação , Interações Hidrofóbicas e Hidrofílicas , Anotação de Sequência Molecular , Estrutura Secundária de Proteína , Máquina de Vetores de Suporte
19.
PLoS Comput Biol ; 8(10): e1002709, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23055912

RESUMO

The effects of disease mutations on protein structure and function have been extensively investigated, and many predictors of the functional impact of single amino acid substitutions are publicly available. The majority of these predictors are based on protein structure and evolutionary conservation, following the assumption that disease mutations predominantly affect folded and conserved protein regions. However, the prevalence of the intrinsically disordered proteins (IDPs) and regions (IDRs) in the human proteome together with their lack of fixed structure and low sequence conservation raise a question about the impact of disease mutations in IDRs. Here, we investigate annotated missense disease mutations and show that 21.7% of them are located within such intrinsically disordered regions. We further demonstrate that 20% of disease mutations in IDRs cause local disorder-to-order transitions, which represents a 1.7-2.7 fold increase compared to annotated polymorphisms and neutral evolutionary substitutions, respectively. Secondary structure predictions show elevated rates of transition from helices and strands into loops and vice versa in the disease mutations dataset. Disease disorder-to-order mutations also influence predicted molecular recognition features (MoRFs) more often than the control mutations. The repertoire of disorder-to-order transition mutations is limited, with five most frequent mutations (R→W, R→C, E→K, R→H, R→Q) collectively accounting for 44% of all deleterious disorder-to-order transitions. As a proof of concept, we performed accelerated molecular dynamics simulations on a deleterious disorder-to-order transition mutation of tumor protein p63 and, in agreement with our predictions, observed an increased α-helical propensity of the region harboring the mutation. Our findings highlight the importance of mutations in IDRs and refine the traditional structure-centric view of disease mutations. The results of this study offer a new perspective on the role of mutations in disease, with implications for improving predictors of the functional impact of missense mutations.


Assuntos
Doença/genética , Modelos Genéticos , Mutação , Proteínas/genética , Arginina/genética , Análise por Conglomerados , Biologia Computacional , Humanos , Simulação de Dinâmica Molecular , Conformação Proteica , Proteínas/química , Proteínas/metabolismo , Análise de Sequência de DNA , Fatores de Transcrição , Proteínas Supressoras de Tumor
20.
J Struct Biol ; 180(1): 201-15, 2012 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-22651963

RESUMO

The identification of intrinsically disordered proteins (IDPs) among the targets that fail to form satisfactory crystal structures in the Protein Structure Initiative represents a key to reducing the costs and time for determining three-dimensional structures of proteins. To help in this endeavor, several Protein Structure Initiative Centers were asked to send samples of both crystallizable proteins and proteins that failed to crystallize. The abundance of intrinsic disorder in these proteins was evaluated via computational analysis using predictors of natural disordered regions (PONDR®) and the potential cleavage sites and corresponding fragments were determined. Then, the target proteins were analyzed for intrinsic disorder by their resistance to limited proteolysis. The rates of tryptic digestion of sample target proteins were compared to those of lysozyme/myoglobin, apomyoglobin, and α-casein as standards of ordered, partially disordered and completely disordered proteins, respectively. At the next stage, the protein samples were subjected to both far-UV and near-UV circular dichroism (CD) analysis. For most of the samples, a good agreement between CD data, predictions of disorder and the rates of limited tryptic digestion was established. Further experimentation is being performed on a smaller subset of these samples in order to obtain more detailed information on the ordered/disordered nature of the proteins.


Assuntos
Modelos Moleculares , Proteínas/química , Animais , Calibragem , Dicroísmo Circular/normas , Biologia Computacional , Cristalização , Cristalografia por Raios X , Bases de Dados de Proteínas , Eletroforese em Gel de Poliacrilamida , Humanos , Bases de Conhecimento , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , Proteólise , Padrões de Referência , Homologia Estrutural de Proteína , Tripsina/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA