Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 32
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Blood ; 142(24): 2055-2068, 2023 12 14.
Article in English | MEDLINE | ID: mdl-37647632

ABSTRACT

Rare genetic diseases affect millions, and identifying causal DNA variants is essential for patient care. Therefore, it is imperative to estimate the effect of each independent variant and improve their pathogenicity classification. Our study of 140 214 unrelated UK Biobank (UKB) participants found that each of them carries a median of 7 variants previously reported as pathogenic or likely pathogenic. We focused on 967 diagnostic-grade gene (DGG) variants for rare bleeding, thrombotic, and platelet disorders (BTPDs) observed in 12 367 UKB participants. By association analysis, for a subset of these variants, we estimated effect sizes for platelet count and volume, and odds ratios for bleeding and thrombosis. Variants causal of some autosomal recessive platelet disorders revealed phenotypic consequences in carriers. Loss-of-function variants in MPL, which cause chronic amegakaryocytic thrombocytopenia if biallelic, were unexpectedly associated with increased platelet counts in carriers. We also demonstrated that common variants identified by genome-wide association studies (GWAS) for platelet count or thrombosis risk may influence the penetrance of rare variants in BTPD DGGs on their associated hemostasis disorders. Network-propagation analysis applied to an interactome of 18 410 nodes and 571 917 edges showed that GWAS variants with large effect sizes are enriched in DGGs and their first-order interactors. Finally, we illustrate the modifying effect of polygenic scores for platelet count and thrombosis risk on disease severity in participants carrying rare variants in TUBB1 or PROC and PROS1, respectively. Our findings demonstrate the power of association analyses using large population datasets in improving pathogenicity classifications of rare variants.


Subject(s)
Genome-Wide Association Study , Thrombosis , Humans , Biological Specimen Banks , Hemostasis , Hemorrhage/genetics , Rare Diseases
2.
Nucleic Acids Res ; 50(D1): D578-D586, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34718729

ABSTRACT

The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database of macromolecular complexes with known function from a range of model organisms. It summarizes complex composition, topology and function along with links to a large range of domain-specific resources (i.e. wwPDB, EMDB and Reactome). Since the last update in 2019, we have produced a first draft complexome for Escherichia coli, maintained and updated that of Saccharomyces cerevisiae, added over 40 coronavirus complexes and increased the human complexome to over 1100 complexes that include approximately 200 complexes that act as targets for viral proteins or are part of the immune system. The display of protein features in ComplexViewer has been improved and the participant table is now colour-coordinated with the nodes in ComplexViewer. Community collaboration has expanded, for example by contributing to an analysis of putative transcription cofactors and providing data accessible to semantic web tools through Wikidata which is now populated with manually curated Complex Portal content through a new bot. Our data license is now CC0 to encourage data reuse. Users are encouraged to get in touch, provide us with feedback and send curation requests through the 'Support' link.


Subject(s)
Data Curation/methods , Databases, Protein , Multiprotein Complexes/chemistry , Coronavirus/chemistry , Data Visualization , Databases, Chemical , Enzymes/chemistry , Enzymes/metabolism , Escherichia coli/chemistry , Humans , International Cooperation , Molecular Sequence Annotation , Multiprotein Complexes/metabolism , User-Computer Interface
3.
Nucleic Acids Res ; 50(D1): D648-D653, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34761267

ABSTRACT

The IntAct molecular interaction database (https://www.ebi.ac.uk/intact) is a curated resource of molecular interactions, derived from the scientific literature and from direct data depositions. As of August 2021, IntAct provides more than one million binary interactions, curated by twelve global partners of the International Molecular Exchange consortium, for which the IntAct database provides a shared curation and dissemination platform. The IMEx curation policy has always emphasised a fine-grained data and curation model, aiming to capture the relevant experimental detail essential for the interpretation of the provided molecular interaction data. Here, we present recent curation focus and progress, as well as a completely redeveloped website which presents IntAct data in a much more user-friendly and detailed way.


Subject(s)
Databases, Protein , Protein Interaction Maps/genetics , Software , Humans , Protein Interaction Mapping/methods
4.
Emerg Infect Dis ; 29(12): 2471-2481, 2023 Dec.
Article in English | MEDLINE | ID: mdl-37987585

ABSTRACT

Mycoplasma spp. are wall-less bacteria able to infect mammals and are classified as hemotropic (hemoplasma) and nonhemotropic. In aquatic mammals, hemoplasma have been reported in California sea lions (Zalophus californianus) and river dolphins (Inia spp.). We investigated Mycoplasma spp. in blood samples of West Indian manatees (Trichechus manatus), pinnipeds (5 species), and marine cetaceans (18 species) that stranded or were undergoing rehabilitation in Brazil during 2002-2022. We detected Mycoplasma in blood of 18/130 (14.8%) cetaceans and 3/18 (16.6%) pinnipeds. All tested manatees were PCR-negative for Mycoplasma. Our findings indicate that >2 different hemoplasma species are circulating in cetaceans. The sequences from pinnipeds were similar to previously described sequences. We also detected a nonhemotropic Mycoplasma in 2 Franciscana dolphins (Pontoporia blainvillei) that might be associated with microscopic lesions. Because certain hemoplasmas can cause disease and death in immunosuppressed mammals, the bacteria could have conservation implications for already endangered aquatic mammals.


Subject(s)
Caniformia , Dolphins , Mycoplasma Infections , Mycoplasma , Animals , Mycoplasma/genetics , Brazil/epidemiology , Mycoplasma Infections/epidemiology , Mycoplasma Infections/veterinary , Mycoplasma Infections/microbiology , Mammals , RNA, Ribosomal, 16S
5.
Nucleic Acids Res ; 49(6): 3156-3167, 2021 04 06.
Article in English | MEDLINE | ID: mdl-33677561

ABSTRACT

The EMBL-EBI Complex Portal is a knowledgebase of macromolecular complexes providing persistent stable identifiers. Entries are linked to literature evidence and provide details of complex membership, function, structure and complex-specific Gene Ontology annotations. Data are freely available and downloadable in HUPO-PSI community standards and missing entries can be requested for curation. In collaboration with Saccharomyces Genome Database and UniProt, the yeast complexome, a compendium of all known heteromeric assemblies from the model organism Saccharomyces cerevisiae, was curated. This expansion of knowledge and scope has led to a 50% increase in curated complexes compared to the previously published dataset, CYC2008. The yeast complexome is used as a reference resource for the analysis of complexes from large-scale experiments. Our analysis showed that genes coding for proteins in complexes tend to have more genetic interactions, are co-expressed with more genes, are more multifunctional, localize more often in the nucleus, and are more often involved in nucleic acid-related metabolic processes and processes where large machineries are the predominant functional drivers. A comparison to genetic interactions showed that about 40% of expanded co-complex pairs also have genetic interactions, suggesting strong functional links between complex members.


Subject(s)
Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/metabolism , Datasets as Topic , Gene Ontology , Knowledge Bases , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/genetics
6.
Bioinformatics ; 37(20): 3684-3685, 2021 Oct 25.
Article in English | MEDLINE | ID: mdl-33961020

ABSTRACT

SUMMARY: IntAct App is a Cytoscape 3 application that grants in-depth access to IntAct's molecular interaction data. It build networks where nodes are interacting molecules (mainly proteins, but also genes, RNA, chemicals…) and edges represent evidence of interaction. Users can query a network by providing its molecules, identified by different fields and optionally include all their interacting partners in the resulting network. The app offers three visualizations: one only displaying interactions, another representing every evidence and the last one emphasizing evidence where mutated versions of proteins were used. Users can also filter networks and click on nodes and edges to access all their related details. Finally, the application supports automation of its main features via Cytoscape commands. AVAILABILITY AND IMPLEMENTATION: Implementation available at https://apps.cytoscape.org/apps/intactapp, while the source code is available at https://github.com/EBI-IntAct/IntactApp.

7.
Bioinformatics ; 36(24): 5712-5718, 2021 04 05.
Article in English | MEDLINE | ID: mdl-32637990

ABSTRACT

MOTIVATION: A large variety of molecular interactions occurs between biomolecular components in cells. When a molecular interaction results in a regulatory effect, exerted by one component onto a downstream component, a so-called 'causal interaction' takes place. Causal interactions constitute the building blocks in our understanding of larger regulatory networks in cells. These causal interactions and the biological processes they enable (e.g. gene regulation) need to be described with a careful appreciation of the underlying molecular reactions. A proper description of this information enables archiving, sharing and reuse by humans and for automated computational processing. Various representations of causal relationships between biological components are currently used in a variety of resources. RESULTS: Here, we propose a checklist that accommodates current representations, called the Minimum Information about a Molecular Interaction CAusal STatement (MI2CAST). This checklist defines both the required core information, as well as a comprehensive set of other contextual details valuable to the end user and relevant for reusing and reproducing causal molecular interaction information. The MI2CAST checklist can be used as reporting guidelines when annotating and curating causal statements, while fostering uniformity and interoperability of the data across resources. AVAILABILITY AND IMPLEMENTATION: The checklist together with examples is accessible at https://github.com/MI2CAST/MI2CAST. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Software , Causality , Humans
8.
J Cell Sci ; 132(17)2019 09 06.
Article in English | MEDLINE | ID: mdl-31391242

ABSTRACT

The muscle-specific RING-finger protein MuRF1 (also known as TRIM63) constitutes a bona fide ubiquitin ligase that routes proteins like several different myosin heavy chain proteins (MyHC) to proteasomal degradation during muscle atrophy. In two unbiased screens, we identified DCAF8 as a new MuRF1-binding partner. MuRF1 physically interacts with DCAF8 and both proteins localize to overlapping structures in muscle cells. Importantly, similar to what is seen for MuRF1, DCAF8 levels increase during atrophy, and the downregulation of either protein substantially impedes muscle wasting and MyHC degradation in C2C12 myotubes, a model system for muscle differentiation and atrophy. DCAF proteins typically serve as substrate receptors for cullin 4-type (Cul4) ubiquitin ligases (CRL), and we demonstrate that DCAF8 and MuRF1 associate with the subunits of such a protein complex. Because genetic downregulation of DCAF8 and inhibition of cullin activity also impair myotube atrophy in C2C12 cells, our data imply that the DCAF8 promotes muscle wasting by targeting proteins like MyHC as an integral substrate receptor of a Cul4A-containing ring ubiquitin ligase complex (CRL4A).This article has an associated First Person interview with the first author of the paper.


Subject(s)
Muscle Proteins/metabolism , Muscular Atrophy/metabolism , Tripartite Motif Proteins/metabolism , Ubiquitin-Protein Ligases/metabolism , Animals , COS Cells , Carrier Proteins , Chlorocebus aethiops , Humans , Mice , Muscular Atrophy/enzymology , Rats , Transfection
9.
Nucleic Acids Res ; 47(D1): D550-D558, 2019 01 08.
Article in English | MEDLINE | ID: mdl-30357405

ABSTRACT

The Complex Portal (www.ebi.ac.uk/complexportal) is a manually curated, encyclopaedic database that collates and summarizes information on stable, macromolecular complexes of known function. It captures complex composition, topology and function and links out to a large range of domain-specific resources that hold more detailed data, such as PDB or Reactome. We have made several significant improvements since our last update, including improving compliance to the FAIR data principles by providing complex-specific, stable identifiers that include versioning. Protein complexes are now available from 20 species for download in standards-compliant formats such as PSI-XML, MI-JSON and ComplexTAB or can be accessed via an improved REST API. A component-based JS front-end framework has been implemented to drive a new website and this has allowed the use of APIs from linked services to import and visualize information such as the 3D structure of protein complexes, its role in reactions and pathways and the co-expression of complex components in the tissues of multi-cellular organisms. A first draft of the complete complexome of Saccharomyces cerevisiae is now available to browse and download.


Subject(s)
Databases, Protein , Multiprotein Complexes/chemistry , Animals , Computer Graphics , Humans , Macromolecular Substances/chemistry , Mice , Multiprotein Complexes/metabolism , Nucleic Acids/chemistry , Protein Conformation
10.
Nucleic Acids Res ; 42(Database issue): D358-63, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24234451

ABSTRACT

IntAct (freely available at http://www.ebi.ac.uk/intact) is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. IntAct has developed a sophisticated web-based curation tool, capable of supporting both IMEx- and MIMIx-level curation. This tool is now utilized by multiple additional curation teams, all of whom annotate data directly into the IntAct database. Members of the IntAct team supply appropriate levels of training, perform quality control on entries and take responsibility for long-term data maintenance. Recently, the MINT and IntAct databases decided to merge their separate efforts to make optimal use of limited developer resources and maximize the curation output. All data manually curated by the MINT curators have been moved into the IntAct database at EMBL-EBI and are merged with the existing IntAct dataset. Both IntAct and MINT are active contributors to the IMEx consortium (http://www.imexconsortium.org).


Subject(s)
Databases, Protein , Protein Interaction Mapping , Internet , Software
11.
PLoS Genet ; 9(3): e1003398, 2013 Mar.
Article in English | MEDLINE | ID: mdl-23555304

ABSTRACT

Essentially all biological processes depend on protein-protein interactions (PPIs). Timing of such interactions is crucial for regulatory function. Although circadian (~24-hour) clocks constitute fundamental cellular timing mechanisms regulating important physiological processes, PPI dynamics on this timescale are largely unknown. Here, we identified 109 novel PPIs among circadian clock proteins via a yeast-two-hybrid approach. Among them, the interaction of protein phosphatase 1 and CLOCK/BMAL1 was found to result in BMAL1 destabilization. We constructed a dynamic circadian PPI network predicting the PPI timing using circadian expression data. Systematic circadian phenotyping (RNAi and overexpression) suggests a crucial role for components involved in dynamic interactions. Systems analysis of a global dynamic network in liver revealed that interacting proteins are expressed at similar times likely to restrict regulatory interactions to specific phases. Moreover, we predict that circadian PPIs dynamically connect many important cellular processes (signal transduction, cell cycle, etc.) contributing to temporal organization of cellular physiology in an unprecedented manner.


Subject(s)
CLOCK Proteins , Circadian Clocks/genetics , Circadian Rhythm/genetics , Protein Interaction Maps/genetics , ARNTL Transcription Factors/genetics , ARNTL Transcription Factors/metabolism , CLOCK Proteins/genetics , CLOCK Proteins/metabolism , Cell Cycle/genetics , Cell Cycle/physiology , ErbB Receptors/metabolism , HEK293 Cells , Humans , Protein Phosphatase 1/genetics , Protein Phosphatase 1/metabolism , Signal Transduction
12.
Proteomics ; 15(8): 1390-404, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25648416

ABSTRACT

Molecular interaction databases are essential resources that enable access to a wealth of information on associations between proteins and other biomolecules. Network graphs generated from these data provide an understanding of the relationships between different proteins in the cell, and network analysis has become a widespread tool supporting -omics analysis. Meaningfully representing this information remains far from trivial and different databases strive to provide users with detailed records capturing the experimental details behind each piece of interaction evidence. A targeted curation approach is necessary to transfer published data generated by primarily low-throughput techniques into interaction databases. In this review we present an example highlighting the value of both targeted curation and the subsequent effective visualization of detailed features of manually curated interaction information. We have curated interactions involving LRRK2, a protein of largely unknown function linked to familial forms of Parkinson's disease, and hosted the data in the IntAct database. This LRRK2-specific dataset was then used to produce different visualization examples highlighting different aspects of the data: the level of confidence in the interaction based on orthogonal evidence, those interactions found under close-to-native conditions, and the enzyme-substrate relationships in different in vitro enzymatic assays. Finally, pathway annotation taken from the Reactome database was overlaid on top of interaction networks to bring biological functional context to interaction maps.


Subject(s)
Protein Interaction Maps , Protein Serine-Threonine Kinases/physiology , Animals , Computer Graphics , Databases, Protein , Humans , Leucine-Rich Repeat Serine-Threonine Protein Kinase-2 , Molecular Sequence Annotation , Parkinson Disease/metabolism , Proteomics/methods , Software
13.
Nucleic Acids Res ; 41(3): 1496-507, 2013 Feb 01.
Article in English | MEDLINE | ID: mdl-23275563

ABSTRACT

The yeast two-hybrid (Y2H) system is the most widely applied methodology for systematic protein-protein interaction (PPI) screening and the generation of comprehensive interaction networks. We developed a novel Y2H interaction screening procedure using DNA microarrays for high-throughput quantitative PPI detection. Applying a global pooling and selection scheme to a large collection of human open reading frames, proof-of-principle Y2H interaction screens were performed for the human neurodegenerative disease proteins huntingtin and ataxin-1. Using systematic controls for unspecific Y2H results and quantitative benchmarking, we identified and scored a large number of known and novel partner proteins for both huntingtin and ataxin-1. Moreover, we show that this parallelized screening procedure and the global inspection of Y2H interaction data are uniquely suited to define specific PPI patterns and their alteration by disease-causing mutations in huntingtin and ataxin-1. This approach takes advantage of the specificity and flexibility of DNA microarrays and of the existence of solid-related statistical methods for the analysis of DNA microarray data, and allows a quantitative approach toward interaction screens in human and in model organisms.


Subject(s)
Oligonucleotide Array Sequence Analysis/methods , Two-Hybrid System Techniques , Ataxin-1 , Ataxins , Humans , Huntingtin Protein , Mutation , Nerve Tissue Proteins/genetics , Nerve Tissue Proteins/metabolism , Nuclear Proteins/genetics , Nuclear Proteins/metabolism , Open Reading Frames , Protein Interaction Maps , Yeasts/genetics
14.
Nucleic Acids Res ; 40(Database issue): D841-6, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22121220

ABSTRACT

IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275,000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact.


Subject(s)
Databases, Protein , Protein Interaction Mapping , Computer Graphics , Genes , Internet , Molecular Sequence Annotation , Sequence Analysis, Protein , Software
15.
Database (Oxford) ; 20232023 10 11.
Article in English | MEDLINE | ID: mdl-37819683

ABSTRACT

In recent years, a huge amount of data on ncRNA interactions has been described in scientific papers and databases. Although considerable effort has been made to annotate the available knowledge in public repositories, there are still significant discrepancies in how different resources capture and interpret data on ncRNA functional and physical associations. In the present paper, we present a collection of microRNA-mRNA interactions annotated from the scientific literature following recognized standard criteria and focused on microRNAs, which regulate genes associated with rare diseases as a case study. The list of protein-coding genes with a known role in specific rare diseases was retrieved from the Genome England PanelApp, and associated microRNA-mRNA interactions were annotated in the IntAct database and compared with other datasets. RNAcentral identifiers were used for unambiguous, stable identification of ncRNAs. The information about the interaction was enhanced by a detailed description of the cell types and experimental conditions, providing a computer-interpretable summary of the published data, integrated with the huge amount of protein interactions already gathered in the database. Furthermore, for each interaction, the binding sites of the microRNA are precisely mapped on a well-defined mRNA transcript of the target gene. This information is crucial to conceive and design optimal microRNA mimics or inhibitors to interfere in vivo with a deregulated process. As these approaches become more feasible, high-quality, reliable networks of microRNA interactions are needed to help, for instance, in the selection of the best target to be inhibited and to predict potential secondary off-target effects. Database URL https://www.ebi.ac.uk/intact.


Subject(s)
MicroRNAs , Humans , MicroRNAs/genetics , MicroRNAs/metabolism , Rare Diseases/genetics , RNA, Untranslated , Databases, Factual , RNA, Messenger/genetics
16.
Nat Genet ; 55(3): 389-398, 2023 03.
Article in English | MEDLINE | ID: mdl-36823319

ABSTRACT

Interacting proteins tend to have similar functions, influencing the same organismal traits. Interaction networks can be used to expand the list of candidate trait-associated genes from genome-wide association studies. Here, we performed network-based expansion of trait-associated genes for 1,002 human traits showing that this recovers known disease genes or drug targets. The similarity of network expansion scores identifies groups of traits likely to share an underlying genetic and biological process. We identified 73 pleiotropic gene modules linked to multiple traits, enriched in genes involved in processes such as protein ubiquitination and RNA processing. In contrast to gene deletion studies, pleiotropy as defined here captures specifically multicellular-related processes. We show examples of modules linked to human diseases enriched in genes with known pathogenic variants that can be used to map targets of approved drugs for repurposing. Finally, we illustrate the use of network expansion scores to study genes at inflammatory bowel disease genome-wide association study loci, and implicate inflammatory bowel disease-relevant genes with strong functional and genetic support.


Subject(s)
Cell Biology , Cells , Disease , Genetic Association Studies , Genetic Pleiotropy , Genetic Association Studies/methods , Humans , Ubiquitination/genetics , RNA Processing, Post-Transcriptional/genetics , Cells/metabolism , Cells/pathology , Drug Repositioning/methods , Drug Repositioning/trends , Disease/genetics , Inflammatory Bowel Diseases/genetics , Inflammatory Bowel Diseases/pathology , Genome-Wide Association Study , Phenotype , Autoimmune Diseases/genetics , Autoimmune Diseases/pathology
17.
J Proteome Res ; 11(4): 2014-31, 2012 Apr 06.
Article in English | MEDLINE | ID: mdl-22385417

ABSTRACT

The advent of the "omics" era in biology research has brought new challenges and requires the development of novel strategies to answer previously intractable questions. Molecular interaction networks provide a framework to visualize cellular processes, but their complexity often makes their interpretation an overwhelming task. The inherently artificial nature of interaction detection methods and the incompleteness of currently available interaction maps call for a careful and well-informed utilization of this valuable data. In this tutorial, we aim to give an overview of the key aspects that any researcher needs to consider when working with molecular interaction data sets and we outline an example for interactome analysis. Using the molecular interaction database IntAct, the software platform Cytoscape, and its plugins BiNGO and clusterMaker, and taking as a starting point a list of proteins identified in a mass spectrometry-based proteomics experiment, we show how to build, visualize, and analyze a protein-protein interaction network.


Subject(s)
Computational Biology/methods , Database Management Systems , Protein Interaction Mapping/methods , Proteomics/methods , Animals , Cluster Analysis , Databases, Protein , High-Throughput Screening Assays/methods , Humans , Protein Interaction Maps , Proteomics/education , User-Computer Interface
18.
Methods Mol Biol ; 2449: 27-42, 2022.
Article in English | MEDLINE | ID: mdl-35507258

ABSTRACT

Molecular interaction databases aim to systematically capture and organize the experimental interaction information described in the scientific literature. These data can then be used to perform network analysis, to assign putative roles to uncharacterized proteins and to investigate their involvement in cellular pathways.This chapter gives a brief overview of publicly available molecular interaction databases and focuses on the members of the IMEx Consortium, on their curation policies and standard data formats. All of the goals achieved by IMEx databases over the last 15 years, the data types provided and the many different ways in which such data can be utilized by the research community, are described in detail. The IMEx databases curate molecular interaction data to the highest caliber, following a detailed curation model and supplying rich metadata by employing common curation rules and harmonized standards. The IMEx Consortium provides comprehensively annotated molecular interaction data integrated into a single, non-redundant, open access dataset.


Subject(s)
Protein Interaction Mapping , Proteins , Data Management , Databases, Chemical , Databases, Protein , Proteins/metabolism
19.
Biochim Biophys Acta ; 1804(4): 839-45, 2010 Apr.
Article in English | MEDLINE | ID: mdl-20036764

ABSTRACT

We have previously shown that glutaredoxin 2 (Grx2) from Saccharomyces cerevisiae localizes at 3 different subcellular compartments, cytosol, mitochondrial matrix and outer membrane, as the result of different postranslational processing of one single gene. Having set the mechanism responsible for this remarkable phenomenon, we have now aimed at defining whether this diversity of subcellular localizations correlates with differences in structure and function of the Grx2 isoforms. We have determined the N-terminal sequence of the soluble mitochondrial matrix Grx2 by mass spectrometry and have determined the exact cleavage site by Mitochondrial Processing Peptidase (MPP). As a consequence of this cleavage, the mitochondrial matrix Grx2 isoform possesses a basic tetrapeptide extension at the N-terminus compared to the cytosolic form. A functional relationship to this structural difference is that mitochondrial Grx2 displays a markedly higher activity in the catalysis of GSSG reduction by the mitochondrial dithiol dihydrolipoamide. We have prepared Grx2 mutants affected on key residues inside the presequence to direct the protein to one single cellular compartment; either the cytosol, the mitochondrial membrane or the matrix and have analyzed their functional phenotypes. Strains expressing Grx2 only in the cytosol are equally sensitive to H(2)O(2) as strains lacking the gene, whereas those expressing Grx2 exclusively in the mitochondrial matrix are more resistant. Mutations on key basic residues drastically affect the cellular fate of the protein, showing that evolutionary diversification of Grx2 structural and functional properties are strictly dependent on the sequence of the targeting signal peptide.


Subject(s)
Glutaredoxins/chemistry , Glutaredoxins/metabolism , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae Proteins/metabolism , Amino Acid Sequence , Base Sequence , Binding Sites/genetics , DNA Primers/genetics , DNA, Fungal/genetics , Glutaredoxins/genetics , Mitochondria/enzymology , Molecular Sequence Data , Mutagenesis, Site-Directed , Mutant Proteins/chemistry , Mutant Proteins/genetics , Mutant Proteins/metabolism , Protein Processing, Post-Translational , RNA, Fungal/genetics , RNA, Fungal/metabolism , RNA, Messenger/genetics , RNA, Messenger/metabolism , Saccharomyces cerevisiae/enzymology , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/genetics , Subcellular Fractions/enzymology , Tandem Mass Spectrometry
20.
Biochim Biophys Acta Gene Regul Mech ; 1864(10): 194749, 2021 10.
Article in English | MEDLINE | ID: mdl-34425241

ABSTRACT

The domain of transcription regulation has been notoriously difficult to annotate in the Gene Ontology, partly because of the intricacies of gene regulation which involve molecular interactions with DNA as well as amongst protein complexes. The molecular function 'transcription coregulator activity' is a part of the biological process 'regulation of transcription, DNA-templated' that occurs in the cellular component 'chromatin'. It can mechanistically link sequence-specific DNA-binding transcription factor (dbTF) regulatory DNA target sites to coactivator and corepressor target sites through the molecular function 'cis-regulatory region sequence-specific DNA binding'. Many questions arise about transcription coregulators (coTF). Here, we asked how many unannotated, putative coregulators can be identified in protein complexes? Therefore, we mined the CORUM and hu.MAP protein complex databases with known and strongly presumed human transcription coregulators. In addition, we trawled the BioGRID and IntAct molecular interaction databases for interactors of the known 1457 human dbTFs annotated by the GREEKC and GO consortia. This yielded 1093 putative transcription factor coregulator complex subunits, of which 954 interact directly with a dbTF. This substantially expands the set of coTFs that could be annotated to 'transcription coregulator activity' and sets the stage for renewed annotation and wet-lab research efforts. To this end, we devised a prioritisation score based on existing GO annotations of already curated transcription coregulators as well as interactome representation. Since all the proteins that we mined are parts of protein complexes, we propose to concomitantly engage in annotation of the putative transcription coregulator-containing complexes in the Complex Portal database.


Subject(s)
DNA-Binding Proteins/metabolism , Transcription Factors/metabolism , Base Sequence , DNA/chemistry , Data Mining , Databases, Genetic , Gene Expression Regulation , Humans , Protein Interaction Mapping , Protein Subunits/metabolism , Transcription, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL