Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 32
Filtrar
Mais filtros

País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
1.
Blood ; 142(24): 2055-2068, 2023 12 14.
Artigo em Inglês | MEDLINE | ID: mdl-37647632

RESUMO

Rare genetic diseases affect millions, and identifying causal DNA variants is essential for patient care. Therefore, it is imperative to estimate the effect of each independent variant and improve their pathogenicity classification. Our study of 140 214 unrelated UK Biobank (UKB) participants found that each of them carries a median of 7 variants previously reported as pathogenic or likely pathogenic. We focused on 967 diagnostic-grade gene (DGG) variants for rare bleeding, thrombotic, and platelet disorders (BTPDs) observed in 12 367 UKB participants. By association analysis, for a subset of these variants, we estimated effect sizes for platelet count and volume, and odds ratios for bleeding and thrombosis. Variants causal of some autosomal recessive platelet disorders revealed phenotypic consequences in carriers. Loss-of-function variants in MPL, which cause chronic amegakaryocytic thrombocytopenia if biallelic, were unexpectedly associated with increased platelet counts in carriers. We also demonstrated that common variants identified by genome-wide association studies (GWAS) for platelet count or thrombosis risk may influence the penetrance of rare variants in BTPD DGGs on their associated hemostasis disorders. Network-propagation analysis applied to an interactome of 18 410 nodes and 571 917 edges showed that GWAS variants with large effect sizes are enriched in DGGs and their first-order interactors. Finally, we illustrate the modifying effect of polygenic scores for platelet count and thrombosis risk on disease severity in participants carrying rare variants in TUBB1 or PROC and PROS1, respectively. Our findings demonstrate the power of association analyses using large population datasets in improving pathogenicity classifications of rare variants.


Assuntos
Estudo de Associação Genômica Ampla , Trombose , Humanos , Bancos de Espécimes Biológicos , Hemostasia , Hemorragia/genética , Doenças Raras
2.
Nucleic Acids Res ; 50(D1): D578-D586, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34718729

RESUMO

The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database of macromolecular complexes with known function from a range of model organisms. It summarizes complex composition, topology and function along with links to a large range of domain-specific resources (i.e. wwPDB, EMDB and Reactome). Since the last update in 2019, we have produced a first draft complexome for Escherichia coli, maintained and updated that of Saccharomyces cerevisiae, added over 40 coronavirus complexes and increased the human complexome to over 1100 complexes that include approximately 200 complexes that act as targets for viral proteins or are part of the immune system. The display of protein features in ComplexViewer has been improved and the participant table is now colour-coordinated with the nodes in ComplexViewer. Community collaboration has expanded, for example by contributing to an analysis of putative transcription cofactors and providing data accessible to semantic web tools through Wikidata which is now populated with manually curated Complex Portal content through a new bot. Our data license is now CC0 to encourage data reuse. Users are encouraged to get in touch, provide us with feedback and send curation requests through the 'Support' link.


Assuntos
Curadoria de Dados/métodos , Bases de Dados de Proteínas , Complexos Multiproteicos/química , Coronavirus/química , Visualização de Dados , Bases de Dados de Compostos Químicos , Enzimas/química , Enzimas/metabolismo , Escherichia coli/química , Humanos , Cooperação Internacional , Anotação de Sequência Molecular , Complexos Multiproteicos/metabolismo , Interface Usuário-Computador
3.
Nucleic Acids Res ; 50(D1): D648-D653, 2022 01 07.
Artigo em Inglês | MEDLINE | ID: mdl-34761267

RESUMO

The IntAct molecular interaction database (https://www.ebi.ac.uk/intact) is a curated resource of molecular interactions, derived from the scientific literature and from direct data depositions. As of August 2021, IntAct provides more than one million binary interactions, curated by twelve global partners of the International Molecular Exchange consortium, for which the IntAct database provides a shared curation and dissemination platform. The IMEx curation policy has always emphasised a fine-grained data and curation model, aiming to capture the relevant experimental detail essential for the interpretation of the provided molecular interaction data. Here, we present recent curation focus and progress, as well as a completely redeveloped website which presents IntAct data in a much more user-friendly and detailed way.


Assuntos
Bases de Dados de Proteínas , Mapas de Interação de Proteínas/genética , Software , Humanos , Mapeamento de Interação de Proteínas/métodos
4.
Emerg Infect Dis ; 29(12): 2471-2481, 2023 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-37987585

RESUMO

Mycoplasma spp. are wall-less bacteria able to infect mammals and are classified as hemotropic (hemoplasma) and nonhemotropic. In aquatic mammals, hemoplasma have been reported in California sea lions (Zalophus californianus) and river dolphins (Inia spp.). We investigated Mycoplasma spp. in blood samples of West Indian manatees (Trichechus manatus), pinnipeds (5 species), and marine cetaceans (18 species) that stranded or were undergoing rehabilitation in Brazil during 2002-2022. We detected Mycoplasma in blood of 18/130 (14.8%) cetaceans and 3/18 (16.6%) pinnipeds. All tested manatees were PCR-negative for Mycoplasma. Our findings indicate that >2 different hemoplasma species are circulating in cetaceans. The sequences from pinnipeds were similar to previously described sequences. We also detected a nonhemotropic Mycoplasma in 2 Franciscana dolphins (Pontoporia blainvillei) that might be associated with microscopic lesions. Because certain hemoplasmas can cause disease and death in immunosuppressed mammals, the bacteria could have conservation implications for already endangered aquatic mammals.


Assuntos
Caniformia , Golfinhos , Infecções por Mycoplasma , Mycoplasma , Animais , Mycoplasma/genética , Brasil/epidemiologia , Infecções por Mycoplasma/epidemiologia , Infecções por Mycoplasma/veterinária , Infecções por Mycoplasma/microbiologia , Mamíferos , RNA Ribossômico 16S
5.
Nucleic Acids Res ; 49(6): 3156-3167, 2021 04 06.
Artigo em Inglês | MEDLINE | ID: mdl-33677561

RESUMO

The EMBL-EBI Complex Portal is a knowledgebase of macromolecular complexes providing persistent stable identifiers. Entries are linked to literature evidence and provide details of complex membership, function, structure and complex-specific Gene Ontology annotations. Data are freely available and downloadable in HUPO-PSI community standards and missing entries can be requested for curation. In collaboration with Saccharomyces Genome Database and UniProt, the yeast complexome, a compendium of all known heteromeric assemblies from the model organism Saccharomyces cerevisiae, was curated. This expansion of knowledge and scope has led to a 50% increase in curated complexes compared to the previously published dataset, CYC2008. The yeast complexome is used as a reference resource for the analysis of complexes from large-scale experiments. Our analysis showed that genes coding for proteins in complexes tend to have more genetic interactions, are co-expressed with more genes, are more multifunctional, localize more often in the nucleus, and are more often involved in nucleic acid-related metabolic processes and processes where large machineries are the predominant functional drivers. A comparison to genetic interactions showed that about 40% of expanded co-complex pairs also have genetic interactions, suggesting strong functional links between complex members.


Assuntos
Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/metabolismo , Conjuntos de Dados como Assunto , Ontologia Genética , Bases de Conhecimento , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética
6.
Bioinformatics ; 37(20): 3684-3685, 2021 Oct 25.
Artigo em Inglês | MEDLINE | ID: mdl-33961020

RESUMO

SUMMARY: IntAct App is a Cytoscape 3 application that grants in-depth access to IntAct's molecular interaction data. It build networks where nodes are interacting molecules (mainly proteins, but also genes, RNA, chemicals…) and edges represent evidence of interaction. Users can query a network by providing its molecules, identified by different fields and optionally include all their interacting partners in the resulting network. The app offers three visualizations: one only displaying interactions, another representing every evidence and the last one emphasizing evidence where mutated versions of proteins were used. Users can also filter networks and click on nodes and edges to access all their related details. Finally, the application supports automation of its main features via Cytoscape commands. AVAILABILITY AND IMPLEMENTATION: Implementation available at https://apps.cytoscape.org/apps/intactapp, while the source code is available at https://github.com/EBI-IntAct/IntactApp.

7.
Bioinformatics ; 36(24): 5712-5718, 2021 04 05.
Artigo em Inglês | MEDLINE | ID: mdl-32637990

RESUMO

MOTIVATION: A large variety of molecular interactions occurs between biomolecular components in cells. When a molecular interaction results in a regulatory effect, exerted by one component onto a downstream component, a so-called 'causal interaction' takes place. Causal interactions constitute the building blocks in our understanding of larger regulatory networks in cells. These causal interactions and the biological processes they enable (e.g. gene regulation) need to be described with a careful appreciation of the underlying molecular reactions. A proper description of this information enables archiving, sharing and reuse by humans and for automated computational processing. Various representations of causal relationships between biological components are currently used in a variety of resources. RESULTS: Here, we propose a checklist that accommodates current representations, called the Minimum Information about a Molecular Interaction CAusal STatement (MI2CAST). This checklist defines both the required core information, as well as a comprehensive set of other contextual details valuable to the end user and relevant for reusing and reproducing causal molecular interaction information. The MI2CAST checklist can be used as reporting guidelines when annotating and curating causal statements, while fostering uniformity and interoperability of the data across resources. AVAILABILITY AND IMPLEMENTATION: The checklist together with examples is accessible at https://github.com/MI2CAST/MI2CAST. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Software , Causalidade , Humanos
8.
J Cell Sci ; 132(17)2019 09 06.
Artigo em Inglês | MEDLINE | ID: mdl-31391242

RESUMO

The muscle-specific RING-finger protein MuRF1 (also known as TRIM63) constitutes a bona fide ubiquitin ligase that routes proteins like several different myosin heavy chain proteins (MyHC) to proteasomal degradation during muscle atrophy. In two unbiased screens, we identified DCAF8 as a new MuRF1-binding partner. MuRF1 physically interacts with DCAF8 and both proteins localize to overlapping structures in muscle cells. Importantly, similar to what is seen for MuRF1, DCAF8 levels increase during atrophy, and the downregulation of either protein substantially impedes muscle wasting and MyHC degradation in C2C12 myotubes, a model system for muscle differentiation and atrophy. DCAF proteins typically serve as substrate receptors for cullin 4-type (Cul4) ubiquitin ligases (CRL), and we demonstrate that DCAF8 and MuRF1 associate with the subunits of such a protein complex. Because genetic downregulation of DCAF8 and inhibition of cullin activity also impair myotube atrophy in C2C12 cells, our data imply that the DCAF8 promotes muscle wasting by targeting proteins like MyHC as an integral substrate receptor of a Cul4A-containing ring ubiquitin ligase complex (CRL4A).This article has an associated First Person interview with the first author of the paper.


Assuntos
Proteínas Musculares/metabolismo , Atrofia Muscular/metabolismo , Proteínas com Motivo Tripartido/metabolismo , Ubiquitina-Proteína Ligases/metabolismo , Animais , Células COS , Proteínas de Transporte , Chlorocebus aethiops , Humanos , Camundongos , Atrofia Muscular/enzimologia , Ratos , Transfecção
9.
Nucleic Acids Res ; 47(D1): D550-D558, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30357405

RESUMO

The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database that collates and summarizes information on stable, macromolecular complexes of known function. It captures complex composition, topology and function and links out to a large range of domain-specific resources that hold more detailed data, such as PDB or Reactome. We have made several significant improvements since our last update, including improving compliance to the FAIR data principles by providing complex-specific, stable identifiers that include versioning. Protein complexes are now available from 20 species for download in standards-compliant formats such as PSI-XML, MI-JSON and ComplexTAB or can be accessed via an improved REST API. A component-based JS front-end framework has been implemented to drive a new website and this has allowed the use of APIs from linked services to import and visualize information such as the 3D structure of protein complexes, its role in reactions and pathways and the co-expression of complex components in the tissues of multi-cellular organisms. A first draft of the complete complexome of Saccharomyces cerevisiae is now available to browse and download.


Assuntos
Bases de Dados de Proteínas , Complexos Multiproteicos/química , Animais , Gráficos por Computador , Humanos , Substâncias Macromoleculares/química , Camundongos , Complexos Multiproteicos/metabolismo , Ácidos Nucleicos/química , Conformação Proteica
10.
Nucleic Acids Res ; 42(Database issue): D358-63, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24234451

RESUMO

IntAct (freely available at http://www.ebi.ac.uk/intact) is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. IntAct has developed a sophisticated web-based curation tool, capable of supporting both IMEx- and MIMIx-level curation. This tool is now utilized by multiple additional curation teams, all of whom annotate data directly into the IntAct database. Members of the IntAct team supply appropriate levels of training, perform quality control on entries and take responsibility for long-term data maintenance. Recently, the MINT and IntAct databases decided to merge their separate efforts to make optimal use of limited developer resources and maximize the curation output. All data manually curated by the MINT curators have been moved into the IntAct database at EMBL-EBI and are merged with the existing IntAct dataset. Both IntAct and MINT are active contributors to the IMEx consortium (http://www.imexconsortium.org).


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Internet , Software
11.
PLoS Genet ; 9(3): e1003398, 2013 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-23555304

RESUMO

Essentially all biological processes depend on protein-protein interactions (PPIs). Timing of such interactions is crucial for regulatory function. Although circadian (~24-hour) clocks constitute fundamental cellular timing mechanisms regulating important physiological processes, PPI dynamics on this timescale are largely unknown. Here, we identified 109 novel PPIs among circadian clock proteins via a yeast-two-hybrid approach. Among them, the interaction of protein phosphatase 1 and CLOCK/BMAL1 was found to result in BMAL1 destabilization. We constructed a dynamic circadian PPI network predicting the PPI timing using circadian expression data. Systematic circadian phenotyping (RNAi and overexpression) suggests a crucial role for components involved in dynamic interactions. Systems analysis of a global dynamic network in liver revealed that interacting proteins are expressed at similar times likely to restrict regulatory interactions to specific phases. Moreover, we predict that circadian PPIs dynamically connect many important cellular processes (signal transduction, cell cycle, etc.) contributing to temporal organization of cellular physiology in an unprecedented manner.


Assuntos
Proteínas CLOCK , Relógios Circadianos/genética , Ritmo Circadiano/genética , Mapas de Interação de Proteínas/genética , Fatores de Transcrição ARNTL/genética , Fatores de Transcrição ARNTL/metabolismo , Proteínas CLOCK/genética , Proteínas CLOCK/metabolismo , Ciclo Celular/genética , Ciclo Celular/fisiologia , Receptores ErbB/metabolismo , Células HEK293 , Humanos , Proteína Fosfatase 1/genética , Proteína Fosfatase 1/metabolismo , Transdução de Sinais
12.
Proteomics ; 15(8): 1390-404, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25648416

RESUMO

Molecular interaction databases are essential resources that enable access to a wealth of information on associations between proteins and other biomolecules. Network graphs generated from these data provide an understanding of the relationships between different proteins in the cell, and network analysis has become a widespread tool supporting -omics analysis. Meaningfully representing this information remains far from trivial and different databases strive to provide users with detailed records capturing the experimental details behind each piece of interaction evidence. A targeted curation approach is necessary to transfer published data generated by primarily low-throughput techniques into interaction databases. In this review we present an example highlighting the value of both targeted curation and the subsequent effective visualization of detailed features of manually curated interaction information. We have curated interactions involving LRRK2, a protein of largely unknown function linked to familial forms of Parkinson's disease, and hosted the data in the IntAct database. This LRRK2-specific dataset was then used to produce different visualization examples highlighting different aspects of the data: the level of confidence in the interaction based on orthogonal evidence, those interactions found under close-to-native conditions, and the enzyme-substrate relationships in different in vitro enzymatic assays. Finally, pathway annotation taken from the Reactome database was overlaid on top of interaction networks to bring biological functional context to interaction maps.


Assuntos
Mapas de Interação de Proteínas , Proteínas Serina-Treonina Quinases/fisiologia , Animais , Gráficos por Computador , Bases de Dados de Proteínas , Humanos , Serina-Treonina Proteína Quinase-2 com Repetições Ricas em Leucina , Anotação de Sequência Molecular , Doença de Parkinson/metabolismo , Proteômica/métodos , Software
13.
Nucleic Acids Res ; 41(3): 1496-507, 2013 Feb 01.
Artigo em Inglês | MEDLINE | ID: mdl-23275563

RESUMO

The yeast two-hybrid (Y2H) system is the most widely applied methodology for systematic protein-protein interaction (PPI) screening and the generation of comprehensive interaction networks. We developed a novel Y2H interaction screening procedure using DNA microarrays for high-throughput quantitative PPI detection. Applying a global pooling and selection scheme to a large collection of human open reading frames, proof-of-principle Y2H interaction screens were performed for the human neurodegenerative disease proteins huntingtin and ataxin-1. Using systematic controls for unspecific Y2H results and quantitative benchmarking, we identified and scored a large number of known and novel partner proteins for both huntingtin and ataxin-1. Moreover, we show that this parallelized screening procedure and the global inspection of Y2H interaction data are uniquely suited to define specific PPI patterns and their alteration by disease-causing mutations in huntingtin and ataxin-1. This approach takes advantage of the specificity and flexibility of DNA microarrays and of the existence of solid-related statistical methods for the analysis of DNA microarray data, and allows a quantitative approach toward interaction screens in human and in model organisms.


Assuntos
Análise de Sequência com Séries de Oligonucleotídeos/métodos , Técnicas do Sistema de Duplo-Híbrido , Ataxina-1 , Ataxinas , Humanos , Proteína Huntingtina , Mutação , Proteínas do Tecido Nervoso/genética , Proteínas do Tecido Nervoso/metabolismo , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Fases de Leitura Aberta , Mapas de Interação de Proteínas , Leveduras/genética
14.
Nucleic Acids Res ; 40(Database issue): D841-6, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22121220

RESUMO

IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275,000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.


Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Gráficos por Computador , Genes , Internet , Anotação de Sequência Molecular , Análise de Sequência de Proteína , Software
15.
Database (Oxford) ; 20232023 10 11.
Artigo em Inglês | MEDLINE | ID: mdl-37819683

RESUMO

In recent years, a huge amount of data on ncRNA interactions has been described in scientific papers and databases. Although considerable effort has been made to annotate the available knowledge in public repositories, there are still significant discrepancies in how different resources capture and interpret data on ncRNA functional and physical associations. In the present paper, we present a collection of microRNA-mRNA interactions annotated from the scientific literature following recognized standard criteria and focused on microRNAs, which regulate genes associated with rare diseases as a case study. The list of protein-coding genes with a known role in specific rare diseases was retrieved from the Genome England PanelApp, and associated microRNA-mRNA interactions were annotated in the IntAct database and compared with other datasets. RNAcentral identifiers were used for unambiguous, stable identification of ncRNAs. The information about the interaction was enhanced by a detailed description of the cell types and experimental conditions, providing a computer-interpretable summary of the published data, integrated with the huge amount of protein interactions already gathered in the database. Furthermore, for each interaction, the binding sites of the microRNA are precisely mapped on a well-defined mRNA transcript of the target gene. This information is crucial to conceive and design optimal microRNA mimics or inhibitors to interfere in vivo with a deregulated process. As these approaches become more feasible, high-quality, reliable networks of microRNA interactions are needed to help, for instance, in the selection of the best target to be inhibited and to predict potential secondary off-target effects. Database URL https://www.ebi.ac.uk/intact.


Assuntos
MicroRNAs , Humanos , MicroRNAs/genética , MicroRNAs/metabolismo , Doenças Raras/genética , RNA não Traduzido , Bases de Dados Factuais , RNA Mensageiro/genética
16.
Nat Genet ; 55(3): 389-398, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36823319

RESUMO

Interacting proteins tend to have similar functions, influencing the same organismal traits. Interaction networks can be used to expand the list of candidate trait-associated genes from genome-wide association studies. Here, we performed network-based expansion of trait-associated genes for 1,002 human traits showing that this recovers known disease genes or drug targets. The similarity of network expansion scores identifies groups of traits likely to share an underlying genetic and biological process. We identified 73 pleiotropic gene modules linked to multiple traits, enriched in genes involved in processes such as protein ubiquitination and RNA processing. In contrast to gene deletion studies, pleiotropy as defined here captures specifically multicellular-related processes. We show examples of modules linked to human diseases enriched in genes with known pathogenic variants that can be used to map targets of approved drugs for repurposing. Finally, we illustrate the use of network expansion scores to study genes at inflammatory bowel disease genome-wide association study loci, and implicate inflammatory bowel disease-relevant genes with strong functional and genetic support.


Assuntos
Biologia Celular , Células , Doença , Estudos de Associação Genética , Pleiotropia Genética , Estudos de Associação Genética/métodos , Humanos , Ubiquitinação/genética , Processamento Pós-Transcricional do RNA/genética , Células/metabolismo , Células/patologia , Reposicionamento de Medicamentos/métodos , Reposicionamento de Medicamentos/tendências , Doença/genética , Doenças Inflamatórias Intestinais/genética , Doenças Inflamatórias Intestinais/patologia , Estudo de Associação Genômica Ampla , Fenótipo , Doenças Autoimunes/genética , Doenças Autoimunes/patologia
17.
J Proteome Res ; 11(4): 2014-31, 2012 Apr 06.
Artigo em Inglês | MEDLINE | ID: mdl-22385417

RESUMO

The advent of the "omics" era in biology research has brought new challenges and requires the development of novel strategies to answer previously intractable questions. Molecular interaction networks provide a framework to visualize cellular processes, but their complexity often makes their interpretation an overwhelming task. The inherently artificial nature of interaction detection methods and the incompleteness of currently available interaction maps call for a careful and well-informed utilization of this valuable data. In this tutorial, we aim to give an overview of the key aspects that any researcher needs to consider when working with molecular interaction data sets and we outline an example for interactome analysis. Using the molecular interaction database IntAct, the software platform Cytoscape, and its plugins BiNGO and clusterMaker, and taking as a starting point a list of proteins identified in a mass spectrometry-based proteomics experiment, we show how to build, visualize, and analyze a protein-protein interaction network.


Assuntos
Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Mapeamento de Interação de Proteínas/métodos , Proteômica/métodos , Animais , Análise por Conglomerados , Bases de Dados de Proteínas , Ensaios de Triagem em Larga Escala/métodos , Humanos , Mapas de Interação de Proteínas , Proteômica/educação , Interface Usuário-Computador
18.
Methods Mol Biol ; 2449: 27-42, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35507258

RESUMO

Molecular interaction databases aim to systematically capture and organize the experimental interaction information described in the scientific literature. These data can then be used to perform network analysis, to assign putative roles to uncharacterized proteins and to investigate their involvement in cellular pathways.This chapter gives a brief overview of publicly available molecular interaction databases and focuses on the members of the IMEx Consortium, on their curation policies and standard data formats. All of the goals achieved by IMEx databases over the last 15 years, the data types provided and the many different ways in which such data can be utilized by the research community, are described in detail. The IMEx databases curate molecular interaction data to the highest caliber, following a detailed curation model and supplying rich metadata by employing common curation rules and harmonized standards. The IMEx Consortium provides comprehensively annotated molecular interaction data integrated into a single, non-redundant, open access dataset.


Assuntos
Mapeamento de Interação de Proteínas , Proteínas , Gerenciamento de Dados , Bases de Dados de Compostos Químicos , Bases de Dados de Proteínas , Proteínas/metabolismo
19.
Biochim Biophys Acta ; 1804(4): 839-45, 2010 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-20036764

RESUMO

We have previously shown that glutaredoxin 2 (Grx2) from Saccharomyces cerevisiae localizes at 3 different subcellular compartments, cytosol, mitochondrial matrix and outer membrane, as the result of different postranslational processing of one single gene. Having set the mechanism responsible for this remarkable phenomenon, we have now aimed at defining whether this diversity of subcellular localizations correlates with differences in structure and function of the Grx2 isoforms. We have determined the N-terminal sequence of the soluble mitochondrial matrix Grx2 by mass spectrometry and have determined the exact cleavage site by Mitochondrial Processing Peptidase (MPP). As a consequence of this cleavage, the mitochondrial matrix Grx2 isoform possesses a basic tetrapeptide extension at the N-terminus compared to the cytosolic form. A functional relationship to this structural difference is that mitochondrial Grx2 displays a markedly higher activity in the catalysis of GSSG reduction by the mitochondrial dithiol dihydrolipoamide. We have prepared Grx2 mutants affected on key residues inside the presequence to direct the protein to one single cellular compartment; either the cytosol, the mitochondrial membrane or the matrix and have analyzed their functional phenotypes. Strains expressing Grx2 only in the cytosol are equally sensitive to H(2)O(2) as strains lacking the gene, whereas those expressing Grx2 exclusively in the mitochondrial matrix are more resistant. Mutations on key basic residues drastically affect the cellular fate of the protein, showing that evolutionary diversification of Grx2 structural and functional properties are strictly dependent on the sequence of the targeting signal peptide.


Assuntos
Glutarredoxinas/química , Glutarredoxinas/metabolismo , Proteínas de Saccharomyces cerevisiae/química , Proteínas de Saccharomyces cerevisiae/metabolismo , Sequência de Aminoácidos , Sequência de Bases , Sítios de Ligação/genética , Primers do DNA/genética , DNA Fúngico/genética , Glutarredoxinas/genética , Mitocôndrias/enzimologia , Dados de Sequência Molecular , Mutagênese Sítio-Dirigida , Proteínas Mutantes/química , Proteínas Mutantes/genética , Proteínas Mutantes/metabolismo , Processamento de Proteína Pós-Traducional , RNA Fúngico/genética , RNA Fúngico/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Saccharomyces cerevisiae/enzimologia , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Frações Subcelulares/enzimologia , Espectrometria de Massas em Tandem
20.
Biochim Biophys Acta Gene Regul Mech ; 1864(10): 194749, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34425241

RESUMO

The domain of transcription regulation has been notoriously difficult to annotate in the Gene Ontology, partly because of the intricacies of gene regulation which involve molecular interactions with DNA as well as amongst protein complexes. The molecular function 'transcription coregulator activity' is a part of the biological process 'regulation of transcription, DNA-templated' that occurs in the cellular component 'chromatin'. It can mechanistically link sequence-specific DNA-binding transcription factor (dbTF) regulatory DNA target sites to coactivator and corepressor target sites through the molecular function 'cis-regulatory region sequence-specific DNA binding'. Many questions arise about transcription coregulators (coTF). Here, we asked how many unannotated, putative coregulators can be identified in protein complexes? Therefore, we mined the CORUM and hu.MAP protein complex databases with known and strongly presumed human transcription coregulators. In addition, we trawled the BioGRID and IntAct molecular interaction databases for interactors of the known 1457 human dbTFs annotated by the GREEKC and GO consortia. This yielded 1093 putative transcription factor coregulator complex subunits, of which 954 interact directly with a dbTF. This substantially expands the set of coTFs that could be annotated to 'transcription coregulator activity' and sets the stage for renewed annotation and wet-lab research efforts. To this end, we devised a prioritisation score based on existing GO annotations of already curated transcription coregulators as well as interactome representation. Since all the proteins that we mined are parts of protein complexes, we propose to concomitantly engage in annotation of the putative transcription coregulator-containing complexes in the Complex Portal database.


Assuntos
Proteínas de Ligação a DNA/metabolismo , Fatores de Transcrição/metabolismo , Sequência de Bases , DNA/química , Mineração de Dados , Bases de Dados Genéticas , Regulação da Expressão Gênica , Humanos , Mapeamento de Interação de Proteínas , Subunidades Proteicas/metabolismo , Transcrição Gênica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA