Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 42
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nat Methods ; 20(9): 1291-1303, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37400558

RESUMO

An unambiguous description of an experiment, and the subsequent biological observation, is vital for accurate data interpretation. Minimum information guidelines define the fundamental complement of data that can support an unambiguous conclusion based on experimental observations. We present the Minimum Information About Disorder Experiments (MIADE) guidelines to define the parameters required for the wider scientific community to understand the findings of an experiment studying the structural properties of intrinsically disordered regions (IDRs). MIADE guidelines provide recommendations for data producers to describe the results of their experiments at source, for curators to annotate experimental data to community resources and for database developers maintaining community resources to disseminate the data. The MIADE guidelines will improve the interpretability of experimental results for data consumers, facilitate direct data submission, simplify data curation, improve data exchange among repositories and standardize the dissemination of the key metadata on an IDR experiment by IDR data sources.


Assuntos
Proteínas Intrinsicamente Desordenadas , Proteínas Intrinsicamente Desordenadas/química , Conformação Proteica
2.
Nucleic Acids Res ; 52(D1): D442-D455, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37962385

RESUMO

Short Linear Motifs (SLiMs) are the smallest structural and functional components of modular eukaryotic proteins. They are also the most abundant, especially when considering post-translational modifications. As well as being found throughout the cell as part of regulatory processes, SLiMs are extensively mimicked by intracellular pathogens. At the heart of the Eukaryotic Linear Motif (ELM) Resource is a representative (not comprehensive) database. The ELM entries are created by a growing community of skilled annotators and provide an introduction to linear motif functionality for biomedical researchers. The 2024 ELM update includes 346 novel motif instances in areas ranging from innate immunity to both protein and RNA degradation systems. In total, 39 classes of newly annotated motifs have been added, and another 17 existing entries have been updated in the database. The 2024 ELM release now includes 356 motif classes incorporating 4283 individual motif instances manually curated from 4274 scientific publications and including >700 links to experimentally determined 3D structures. In a recent development, the InterPro protein module resource now also includes ELM data. ELM is available at: http://elm.eu.org.


Assuntos
Motivos de Aminoácidos , Bases de Dados de Proteínas , Eucariotos , Motivos de Aminoácidos/genética , Processamento de Proteína Pós-Traducional , Proteínas/genética , Proteínas/metabolismo , Eucariotos/genética , Internet
3.
Nucleic Acids Res ; 50(D1): D1508-D1514, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34643700

RESUMO

Stimulated by the growing interest in the role of dNTP pools in physiological and malignant processes, we established dNTPpoolDB, the database that offers access to quantitative data on dNTP pools from a wide range of species, experimental and developmental conditions (https://dntppool.org/). The database includes measured absolute or relative cellular levels of the four canonical building blocks of DNA and of exotic dNTPs, as well. In addition to the measured quantity, dNTPpoolDB contains ample information on sample source, dNTP quantitation methods and experimental conditions including any treatments and genetic manipulations. Functions such as the advanced search offering multiple choices from custom-built controlled vocabularies in 15 categories in parallel, the pairwise comparison of any chosen pools, and control-treatment correlations provide users with the possibility to quickly recognize and graphically analyse changes in the dNTP pools in function of a chosen parameter. Unbalanced dNTP pools, as well as the balanced accumulation or depletion of all four dNTPs result in genomic instability. Accordingly, key roles of dNTP pool homeostasis have been demonstrated in cancer progression, development, ageing and viral infections among others. dNTPpoolDB is designated to promote research in these fields and fills a longstanding gap in genome metabolism research.


Assuntos
Bases de Dados Genéticas , Desoxirribonucleotídeos/classificação , Instabilidade Genômica/genética , Neoplasias/genética , Replicação do DNA/genética , Curadoria de Dados , Desoxirribonucleotídeos/genética , Humanos , Neoplasias/classificação , Neoplasias/patologia
4.
Nucleic Acids Res ; 50(D1): D497-D508, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34718738

RESUMO

Almost twenty years after its initial release, the Eukaryotic Linear Motif (ELM) resource remains an invaluable source of information for the study of motif-mediated protein-protein interactions. ELM provides a comprehensive, regularly updated and well-organised repository of manually curated, experimentally validated short linear motifs (SLiMs). An increasing number of SLiM-mediated interactions are discovered each year and keeping the resource up-to-date continues to be a great challenge. In the current update, 30 novel motif classes have been added and five existing classes have undergone major revisions. The update includes 411 new motif instances mostly focused on cell-cycle regulation, control of the actin cytoskeleton, membrane remodelling and vesicle trafficking pathways, liquid-liquid phase separation and integrin signalling. Many of the newly annotated motif-mediated interactions are targets of pathogenic motif mimicry by viral, bacterial or eukaryotic pathogens, providing invaluable insights into the molecular mechanisms underlying infectious diseases. The current ELM release includes 317 motif classes incorporating 3934 individual motif instances manually curated from 3867 scientific publications. ELM is available at: http://elm.eu.org.


Assuntos
Doenças Transmissíveis/genética , Bases de Dados de Proteínas , Interações Hospedeiro-Patógeno/genética , Domínios e Motivos de Interação entre Proteínas , Software , Citoesqueleto de Actina/química , Citoesqueleto de Actina/metabolismo , Animais , Sítios de Ligação , Ciclo Celular/genética , Membrana Celular/química , Membrana Celular/metabolismo , Doenças Transmissíveis/metabolismo , Doenças Transmissíveis/virologia , Ciclinas/química , Ciclinas/genética , Ciclinas/metabolismo , Células Eucarióticas/citologia , Células Eucarióticas/metabolismo , Células Eucarióticas/virologia , Regulação da Expressão Gênica , Humanos , Integrinas/química , Integrinas/genética , Integrinas/metabolismo , Camundongos , Anotação de Sequência Molecular , Ligação Proteica , Ratos , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Transdução de Sinais , Vesículas Transportadoras/química , Vesículas Transportadoras/metabolismo , Vírus/genética , Vírus/metabolismo
5.
Nucleic Acids Res ; 50(D1): D480-D487, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34850135

RESUMO

The Database of Intrinsically Disordered Proteins (DisProt, URL: https://disprot.org) is the major repository of manually curated annotations of intrinsically disordered proteins and regions from the literature. We report here recent updates of DisProt version 9, including a restyled web interface, refactored Intrinsically Disordered Proteins Ontology (IDPO), improvements in the curation process and significant content growth of around 30%. Higher quality and consistency of annotations is provided by a newly implemented reviewing process and training of curators. The increased curation capacity is fostered by the integration of DisProt with APICURON, a dedicated resource for the proper attribution and recognition of biocuration efforts. Better interoperability is provided through the adoption of the Minimum Information About Disorder (MIADE) standard, an active collaboration with the Gene Ontology (GO) and Evidence and Conclusion Ontology (ECO) consortia and the support of the ELIXIR infrastructure.


Assuntos
Bases de Dados de Proteínas , Proteínas Intrinsicamente Desordenadas/metabolismo , Anotação de Sequência Molecular , Software , Sequência de Aminoácidos , DNA/genética , DNA/metabolismo , Conjuntos de Dados como Assunto , Ontologia Genética , Humanos , Internet , Proteínas Intrinsicamente Desordenadas/química , Proteínas Intrinsicamente Desordenadas/genética , Ligação Proteica , RNA/genética , RNA/metabolismo
6.
Brief Bioinform ; 22(5)2021 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-33517364

RESUMO

One of the most intriguing fields emerging in current molecular biology is the study of membraneless organelles formed via liquid-liquid phase separation (LLPS). These organelles perform crucial functions in cell regulation and signalling, and recent years have also brought about the understanding of the molecular mechanism of their formation. The LLPS field is continuously developing and optimizing dedicated in vitro and in vivo methods to identify and characterize these non-stoichiometric molecular condensates and the proteins able to drive or contribute to LLPS. Building on these observations, several computational tools and resources have emerged in parallel to serve as platforms for the collection, annotation and prediction of membraneless organelle-linked proteins. In this survey, we showcase recent advancements in LLPS bioinformatics, focusing on (i) available databases and ontologies that are necessary to describe the studied phenomena and the experimental results in an unambiguous way and (ii) prediction methods to assess the potential LLPS involvement of proteins. Through hands-on application of these resources on example proteins and representative datasets, we give a practical guide to show how they can be used in conjunction to provide in silico information on LLPS.


Assuntos
Bases de Dados Factuais , Modelos Químicos , Organelas/química , Proteínas/química , Proteínas/isolamento & purificação
7.
Proc Natl Acad Sci U S A ; 117(42): 26206-26217, 2020 10 20.
Artigo em Inglês | MEDLINE | ID: mdl-33020264

RESUMO

Bacterial single-stranded (ss)DNA-binding proteins (SSB) are essential for the replication and maintenance of the genome. SSBs share a conserved ssDNA-binding domain, a less conserved intrinsically disordered linker (IDL), and a highly conserved C-terminal peptide (CTP) motif that mediates a wide array of protein-protein interactions with DNA-metabolizing proteins. Here we show that the Escherichia coli SSB protein forms liquid-liquid phase-separated condensates in cellular-like conditions through multifaceted interactions involving all structural regions of the protein. SSB, ssDNA, and SSB-interacting molecules are highly concentrated within the condensates, whereas phase separation is overall regulated by the stoichiometry of SSB and ssDNA. Together with recent results on subcellular SSB localization patterns, our results point to a conserved mechanism by which bacterial cells store a pool of SSB and SSB-interacting proteins. Dynamic phase separation enables rapid mobilization of this protein pool to protect exposed ssDNA and repair genomic loci affected by DNA damage.


Assuntos
Enzimas Reparadoras do DNA/metabolismo , DNA de Cadeia Simples/metabolismo , Proteínas de Ligação a DNA/isolamento & purificação , Proteínas de Escherichia coli/isolamento & purificação , Escherichia coli/metabolismo , Extração Líquido-Líquido/métodos , Dano ao DNA , Reparo do DNA , Enzimas Reparadoras do DNA/genética , DNA de Cadeia Simples/genética , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Escherichia coli/genética , Escherichia coli/crescimento & desenvolvimento , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Ligação Proteica
8.
Int J Mol Sci ; 24(20)2023 Oct 13.
Artigo em Inglês | MEDLINE | ID: mdl-37894817

RESUMO

Colon cancer is a leading cause of death worldwide. Identification of new molecular factors governing the invasiveness of colon cancer holds promise in developing screening and targeted therapeutic methods. The Tyrosine Kinase Substrate with four SH3 domains (TKS4) and the CD2-associated protein (CD2AP) have previously been linked to dynamic actin assembly related processes and cancer cell migration, although their co-instructive role during tumor formation remained unknown. Therefore, this study was designed to investigate the TKS4-CD2AP interaction and study the interdependent effect of TKS4/CD2AP on oncogenic events. We identified CD2AP as a novel TKS4 interacting partner via co-immunoprecipitation-mass spectrometry methods. The interaction was validated via Western blot (WB), immunocytochemistry (ICC) and proximity ligation assay (PLA). The binding motif of CD2AP was explored via peptide microarray. To uncover the possible cooperative effects of TKS4 and CD2AP in cell movement and in epithelial-mesenchymal transition (EMT), we performed gene silencing and overexpressing experiments. Our results showed that TKS4 and CD2AP form a scaffolding protein complex and that they can regulate migration and EMT-related pathways in HCT116 colon cancer cells. This is the first study demonstrating the TKS4-CD2AP protein-protein interaction in vitro, their co-localization in intact cells, and their potential interdependent effects on partial-EMT in colon cancer.


Assuntos
Neoplasias do Colo , Transição Epitelial-Mesenquimal , Humanos , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Movimento Celular , Neoplasias do Colo/genética , Proteínas do Citoesqueleto/metabolismo
9.
Nucleic Acids Res ; 48(D1): D296-D306, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31680160

RESUMO

The eukaryotic linear motif (ELM) resource is a repository of manually curated experimentally validated short linear motifs (SLiMs). Since the initial release almost 20 years ago, ELM has become an indispensable resource for the molecular biology community for investigating functional regions in many proteins. In this update, we have added 21 novel motif classes, made major revisions to 12 motif classes and added >400 new instances mostly focused on DNA damage, the cytoskeleton, SH2-binding phosphotyrosine motifs and motif mimicry by pathogenic bacterial effector proteins. The current release of the ELM database contains 289 motif classes and 3523 individual protein motif instances manually curated from 3467 scientific publications. ELM is available at: http://elm.eu.org.


Assuntos
Motivos de Aminoácidos , Eucariotos , Apicoplastos/metabolismo , Citoesqueleto , Dano ao DNA , Bases de Dados de Proteínas , Fosfotirosina , Domínios de Homologia de src
10.
Nucleic Acids Res ; 48(D1): D360-D367, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31612960

RESUMO

Membraneless organelles (MOs) are dynamic liquid condensates that host a variety of specific cellular processes, such as ribosome biogenesis or RNA degradation. MOs form through liquid-liquid phase separation (LLPS), a process that relies on multivalent weak interactions of the constituent proteins and other macromolecules. Since the first discoveries of certain proteins being able to drive LLPS, it emerged as a general mechanism for the effective organization of cellular space that is exploited in all kingdoms of life. While numerous experimental studies report novel cases, the computational identification of LLPS drivers is lagging behind, and many open questions remain about the sequence determinants, composition, regulation and biological relevance of the resulting condensates. Our limited ability to overcome these issues is largely due to the lack of a dedicated LLPS database. Therefore, here we introduce PhaSePro (https://phasepro.elte.hu), an openly accessible, comprehensive, manually curated database of experimentally validated LLPS driver proteins/protein regions. It not only provides a wealth of information on such systems, but improves the standardization of data by introducing novel LLPS-specific controlled vocabularies. PhaSePro can be accessed through an appealing, user-friendly interface and thus has definite potential to become the central resource in this dynamically developing field.


Assuntos
Bases de Dados de Proteínas , Proteínas/química , Vocabulário Controlado , Organelas/metabolismo , Proteínas/metabolismo , Interface Usuário-Computador
11.
Nucleic Acids Res ; 48(D1): D269-D276, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31713636

RESUMO

The Database of Protein Disorder (DisProt, URL: https://disprot.org) provides manually curated annotations of intrinsically disordered proteins from the literature. Here we report recent developments with DisProt (version 8), including the doubling of protein entries, a new disorder ontology, improvements of the annotation format and a completely new website. The website includes a redesigned graphical interface, a better search engine, a clearer API for programmatic access and a new annotation interface that integrates text mining technologies. The new entry format provides a greater flexibility, simplifies maintenance and allows the capture of more information from the literature. The new disorder ontology has been formalized and made interoperable by adopting the OWL format, as well as its structure and term definitions have been improved. The new annotation interface has made the curation process faster and more effective. We recently showed that new DisProt annotations can be effectively used to train and validate disorder predictors. We believe the growth of DisProt will accelerate, contributing to the improvement of function and disorder predictors and therefore to illuminate the 'dark' proteome.


Assuntos
Bases de Dados de Proteínas , Proteínas Intrinsicamente Desordenadas/química , Ontologias Biológicas , Curadoria de Dados , Anotação de Sequência Molecular
12.
Int J Mol Sci ; 22(6)2021 Mar 16.
Artigo em Inglês | MEDLINE | ID: mdl-33809541

RESUMO

Liquid-liquid phase separation (LLPS) is a molecular process that leads to the formation of membraneless organelles, representing functionally specialized liquid-like cellular condensates formed by proteins and nucleic acids. Integrating the data on LLPS-associated proteins from dedicated databases revealed only modest agreement between them and yielded a high-confidence dataset of 89 human LLPS drivers. Analysis of the supporting evidence for our dataset uncovered a systematic and potentially concerning difference between protein concentrations used in a good fraction of the in vitro LLPS experiments, a key parameter that governs the phase behavior, and the proteomics-derived cellular abundance levels of the corresponding proteins. Closer scrutiny of the underlying experimental data enabled us to offer a sound rationale for this systematic difference, which draws on our current understanding of the cellular organization of the proteome and the LLPS process. In support of this rationale, we find that genes coding for our human LLPS drivers tend to be dosage-sensitive, suggesting that their cellular availability is tightly regulated to preserve their functional role in direct or indirect relation to condensate formation. Our analysis offers guideposts for increasing agreement between in vitro and in vivo studies, probing the roles of proteins in LLPS.


Assuntos
Dosagem de Genes , Genes , Transição de Fase , Proteínas/química , Bases de Dados Factuais , Humanos , Anotação de Sequência Molecular , Organelas , Proteoma/metabolismo
13.
Trends Biochem Sci ; 41(11): 898-906, 2016 11.
Artigo em Inglês | MEDLINE | ID: mdl-27647212

RESUMO

Numerous DNA- and RNA-level functions are embedded in protein-coding regions, which constrains their structure, function, and evolution. Accumulating evidence suggests that such additional, overlapping functions occur preferentially in the coding sequences of intrinsically disordered proteins/regions (IDPs/IDRs), especially in those that are newly incorporated and thus have reduced selective pressure. It is the lack of strict structural constraints that makes disordered proteins more tolerant to mutations and thus more permissive to the appearance of overlapping functions within their coding sequences than structured domains. Therefore, IDPs/IDRs are often mosaics of segments fulfilling protein functionalities and intervening regions primarily carrying nucleotide-level functions. The ensuing complexification of gene-regulatory circuits may have contributed to the evolutionary spread of structural disorder in complex eukaryotic organisms.


Assuntos
Evolução Molecular , Proteínas Intrinsicamente Desordenadas/química , Fases de Leitura Aberta , RNA Mensageiro/metabolismo , Éxons , Expressão Gênica , HIV-1/genética , HIV-1/metabolismo , Humanos , Proteínas Intrinsicamente Desordenadas/genética , Proteínas Intrinsicamente Desordenadas/metabolismo , Modelos Moleculares , Plasmodium falciparum/genética , Plasmodium falciparum/metabolismo , Conformação Proteica , Dobramento de Proteína , RNA Mensageiro/genética , Seleção Genética , Relação Estrutura-Atividade
14.
Hepatology ; 70(2): 532-546, 2019 08.
Artigo em Inglês | MEDLINE | ID: mdl-30153342

RESUMO

The interaction between RNA-binding proteins (RBPs) and RNA plays an important role in regulating cellular function. However, decoding genome-wide protein-RNA regulatory networks as well as how cancer-related mutations impair RNA regulatory activities in hepatocellular carcinoma (HCC) remains mostly undetermined. We explored the genetic alteration patterns of RBPs and found that deleterious mutations are likely to occur on the surface of RBPs. We then constructed protein-RNA interactome networks by integration of target binding screens and expression profiles. Network analysis highlights regulatory principles among interacting RBPs. In addition, somatic mutations selectively target functionally important genes (cancer genes, core fitness genes, or conserved genes) and perturb the RBP-gene regulatory networks in cancer. These regulatory patterns were further validated using independent data. A computational method (Mutational Effect on RNA Interactome Topology) and a web-based, user-friendly resource were further proposed to analyze the RBP-gene regulatory networks across cancer types. Pan-cancer analysis also suggests that cancer cells selectively target "vulnerability" genes to perturb protein-RNA interactome that is involved in cancer hallmark-related functions. Specifically, we experimentally validated four pairs of RBP-gene interactions perturbed by mutations in HCC, which play critical roles in cell proliferation. Based on the expression of perturbed RBP and target genes, we identified three subtypes of HCC with different survival rates. Conclusion: Our results provide a valuable resource for characterizing somatic mutation-perturbed protein-RNA regulatory networks in HCC, yielding valuable insights into the genotype-phenotype relationships underlying human cancer, and potential biomarkers for precision medicine.


Assuntos
Carcinoma Hepatocelular/genética , Neoplasias Hepáticas/genética , Mutação , Proteínas de Ligação a RNA/genética , RNA/genética , Carcinoma Hepatocelular/mortalidade , Carcinoma Hepatocelular/patologia , Proliferação de Células , Redes Reguladoras de Genes , Humanos , Neoplasias Hepáticas/mortalidade , Neoplasias Hepáticas/patologia , Mapas de Interação de Proteínas , Taxa de Sobrevida
15.
Nucleic Acids Res ; 46(D1): D387-D392, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29040693

RESUMO

Soluble functional proteins may transform into insoluble amyloid fibrils that deposit in a variety of tissues. Amyloid formation is a hallmark of age-related degenerative disorders. Perhaps surprisingly, amyloid fibrils can also be beneficial and are frequently exploited for diverse functional roles in organisms. Here we introduce AmyPro, an open-access database providing a comprehensive, carefully curated collection of validated amyloid fibril-forming proteins from all kingdoms of life classified into broad functional categories (http://amypro.net). In particular, AmyPro provides the boundaries of experimentally validated amyloidogenic sequence regions, short descriptions of the functional relevance of the proteins and their amyloid state, a list of the experimental techniques applied to study the amyloid state, important structural/functional/variation/mutation data transferred from UniProt, a list of relevant PDB structures categorized according to protein states, database cross-references and literature references. AmyPro greatly improves on similar currently available resources by incorporating both prions and functional amyloids in addition to pathogenic amyloids, and allows users to screen their sequences against the entire collection of validated amyloidogenic sequence fragments. By enabling further elucidation of the sequential determinants of amyloid fibril formation, we hope AmyPro will enhance the development of new methods for the precise prediction of amyloidogenic regions within proteins.


Assuntos
Proteínas Amiloidogênicas/química , Bases de Dados de Proteínas , Interface Usuário-Computador
16.
Bioinformatics ; 34(3): 535-537, 2018 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-29385418

RESUMO

Motivation: Intrinsically Disordered Proteins (IDPs) mediate crucial protein-protein interactions, most notably in signaling and regulation. As their importance is increasingly recognized, the detailed analyses of specific IDP interactions opened up new opportunities for therapeutic targeting. Yet, large scale information about IDP-mediated interactions in structural and functional details are lacking, hindering the understanding of the mechanisms underlying this distinct binding mode. Results: Here, we present DIBS, the first comprehensive, curated collection of complexes between IDPs and ordered proteins. DIBS not only describes by far the highest number of cases, it also provides the dissociation constants of their interactions, as well as the description of potential post-translational modifications modulating the binding strength and linear motifs involved in the binding. Together with the wide range of structural and functional annotations, DIBS will provide the cornerstone for structural and functional studies of IDP complexes. Availability and implementation: DIBS is freely accessible at http://dibs.enzim.ttk.mta.hu/. The DIBS application is hosted by Apache web server and was implemented in PHP. To enrich querying features and to enhance backend performance a MySQL database was also created. Contact: dosztanyi@caesar.elte.hu or bmeszaros@caesar.elte.hu. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Bases de Dados de Proteínas , Proteínas Intrinsicamente Desordenadas/metabolismo , Domínios e Motivos de Interação entre Proteínas , Sítios de Ligação , Proteínas Intrinsicamente Desordenadas/química
17.
Molecules ; 24(3)2019 Jan 29.
Artigo em Inglês | MEDLINE | ID: mdl-30699990

RESUMO

Whereas the concept of intrinsic disorder derives from biophysical observations of the lack of structure of proteins or protein regions under native conditions, many of our respective concepts rest on proteome-scale bioinformatics predictions. It is established that most predictors work reliably on proteins commonly encountered, but it is often neglected that we know very little about their performance on proteins of microorganisms that thrive in environments of extreme temperature, pH, or salt concentration, which may cause adaptive sequence composition bias. To address this issue, we predicted structural disorder for the complete proteomes of different extremophile groups by popular prediction methods and compared them to those of the reference mesophilic group. While significant deviations from mesophiles could be explained by a lack or gain of disordered regions in hyperthermophiles and radiotolerants, respectively, we found systematic overprediction in the case of halophiles. Additionally, examples were collected from the Protein Data Bank (PDB) to demonstrate misprediction and to help understand the underlying biophysical principles, i.e., halophilic proteins maintain a highly acidic and hydrophilic surface to avoid aggregation in high salt conditions. Although sparseness of data on disordered proteins from extremophiles precludes the development of dedicated general predictors, we do formulate recommendations for how to address their disorder with current bioinformatics tools.


Assuntos
Proteínas/química , Biologia Computacional/métodos , Bases de Dados de Proteínas/estatística & dados numéricos , Modelos Moleculares , Conformação Proteica , Proteoma/análise
18.
Biochemistry ; 57(17): 2452-2461, 2018 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-29392932

RESUMO

Achieving functional specificity while minimizing cost to fitness is a key constraint during evolution. Formation of biological condensates by liquid-liquid phase separation (LLPS) appears to serve as an important regulatory mechanism to generate moderate specificity in molecular recognition while maintaining a reasonable cost for fitness in terms of design complexity. Formation of biological condensates serves as a unique mechanism of molecular recognition achieving some level of specificity without a huge cost to fitness. Rapid formation of biological condensates in vivo induced by specific cellular or environmental triggers has been shown to be an important mechanism for increasing cellular fitness. Here we discuss the functions and regulation of biological condensates, especially those formed by LLPS, involving interactions between proteins and nucleic acids. These condensates are spatially isolated within the cytosol or nucleus and can facilitate specific biochemical functions under conditions such as stress. The misregulation of biological condensates resulting in nondynamic aggregates has been implicated in a number of diseases. Understanding the functional importance of biological condensates and their regulation opens doors for development of therapies targeting dysfunctional biological condensates, as well as spatiotemporal engineering of functions in cells.


Assuntos
Evolução Biológica , Aptidão Genética , Ácidos Nucleicos/genética , Proteínas/genética , Bioensaio , Humanos , Modelos Biológicos , Ácidos Nucleicos/química , Transição de Fase , Proteínas/química
19.
Nucleic Acids Res ; 44(D1): D429-34, 2016 Jan 04.
Artigo em Inglês | MEDLINE | ID: mdl-26582925

RESUMO

Proteins fulfil a wide range of tasks in cells; understanding how they fold into complex three-dimensional (3D) structures and how these structures remain stable while retaining sufficient dynamics for functionality is essential for the interpretation of overall protein behaviour. Since the 1950's, solvent exchange-based methods have been the most powerful experimental means to obtain information on the folding and stability of proteins. Considerable expertise and care were required to obtain the resulting datasets, which, despite their importance and intrinsic value, have never been collected, curated and classified. Start2Fold is an openly accessible database (http://start2fold.eu) of carefully curated hydrogen/deuterium exchange (HDX) data extracted from the literature that is open for new submissions from the community. The database entries contain (i) information on the proteins investigated and the underlying experimental procedures and (ii) the classification of the residues based on their exchange protection levels, also allowing for the instant visualization of the relevant residue groups on the 3D structures of the corresponding proteins. By providing a clear hierarchical framework for the easy sharing, comparison and (re-)interpretation of HDX data, Start2Fold intends to promote a better understanding of how the protein sequence encodes folding and structure as well as the development of new computational methods predicting protein folding and stability.


Assuntos
Bases de Dados de Proteínas , Medição da Troca de Deutério , Dobramento de Proteína , Estabilidade Proteica , Conformação Proteica , Análise de Sequência de Proteína
20.
Int J Mol Sci ; 19(11)2018 Oct 25.
Artigo em Inglês | MEDLINE | ID: mdl-30366362

RESUMO

Although improved strategies for the detection and analysis of evolutionary couplings (ECs) between protein residues already enable the prediction of protein structures and interactions, they are mostly restricted to conserved and well-folded proteins. Whereas intrinsically disordered proteins (IDPs) are central to cellular interaction networks, due to the lack of strict structural constraints, they undergo faster evolutionary changes than folded domains. This makes the reliable identification and alignment of IDP homologs difficult, which led to IDPs being omitted in most large-scale residue co-variation analyses. By preforming a dedicated analysis of phylogenetically widespread bacterial IDP⁻partner interactions, here we demonstrate that partner binding imposes constraints on IDP sequences that manifest in detectable interprotein ECs. These ECs were not detected for interactions mediated by short motifs, rather for those with larger IDP⁻partner interfaces. Most identified coupled residue pairs reside close (<10 Å) to each other on the interface, with a third of them forming multiple direct atomic contacts. EC-carrying interfaces of IDPs are enriched in negatively charged residues, and the EC residues of both IDPs and partners preferentially reside in helices. Our analysis brings hope that IDP⁻partner interactions difficult to study could soon be successfully dissected through residue co-variation analysis.


Assuntos
Evolução Molecular , Proteínas Intrinsicamente Desordenadas/metabolismo , Animais , Humanos , Proteínas Intrinsicamente Desordenadas/química , Proteínas Intrinsicamente Desordenadas/genética , Ligação Proteica , Dobramento de Proteína
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA