RESUMO
Global insights into cellular organization and genome function require comprehensive understanding of the interactome networks that mediate genotype-phenotype relationships1,2. Here we present a human 'all-by-all' reference interactome map of human binary protein interactions, or 'HuRI'. With approximately 53,000 protein-protein interactions, HuRI has approximately four times as many such interactions as there are high-quality curated interactions from small-scale studies. The integration of HuRI with genome3, transcriptome4 and proteome5 data enables cellular function to be studied within most physiological or pathological cellular contexts. We demonstrate the utility of HuRI in identifying the specific subcellular roles of protein-protein interactions. Inferred tissue-specific networks reveal general principles for the formation of cellular context-specific functions and elucidate potential molecular mechanisms that might underlie tissue-specific phenotypes of Mendelian diseases. HuRI is a systematic proteome-wide reference that links genomic variation to phenotypic outcomes.
Assuntos
Proteoma/metabolismo , Espaço Extracelular/metabolismo , Humanos , Especificidade de Órgãos , Mapeamento de Interação de ProteínasRESUMO
How do aberrations in widely expressed genes lead to tissue-selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed "Tissue Risk Assessment of Causality by Expression" (TRACE), a machine learning approach to predict genes that underlie tissue-selective diseases and selectivity-related features. TRACE utilized 4,744 biologically interpretable tissue-specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity-related features, the most common of which was previously overlooked. Next, we created a catalog of tissue-associated risks for 18,927 protein-coding genes (https://netbio.bgu.ac.il/trace/). As proof-of-concept, we prioritized candidate disease genes identified in 48 rare-disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.
Assuntos
Aprendizado de Máquina , Doenças Raras , Humanos , Doenças Raras/genética , Medição de Risco , CausalidadeRESUMO
MOTIVATION: Differential network analysis, designed to highlight network changes between conditions, is an important paradigm in network biology. However, differential network analysis methods have been typically designed to compare between two conditions and were rarely applied to multiple protein interaction networks (interactomes). Importantly, large-scale benchmarks for their evaluation have been lacking. RESULTS: Here, we present a framework for assessing the ability of differential network analysis of multiple human tissue interactomes to highlight tissue-selective processes and disorders. For this, we created a benchmark of 6499 curated tissue-specific Gene Ontology biological processes. We applied five methods, including four differential network analysis methods, to construct weighted interactomes for 34 tissues. Rigorous assessment of this benchmark revealed that differential analysis methods perform well in revealing tissue-selective processes (AUCs of 0.82-0.9). Next, we applied differential network analysis to illuminate the genes underlying tissue-selective hereditary disorders. For this, we curated a dataset of 1305 tissue-specific hereditary disorders and their manifesting tissues. Focusing on subnetworks containing the top 1% differential interactions in disease-relevant tissue interactomes revealed significant enrichment for disorder-causing genes in 18.6% of the cases, with a significantly high success rate for blood, nerve, muscle and heart diseases. SUMMARY: Altogether, we offer a framework that includes expansive manually curated datasets of tissue-selective processes and disorders to be used as benchmarks or to illuminate tissue-selective processes and genes. Our results demonstrate that differential analysis of multiple human tissue interactomes is a powerful tool for highlighting processes and genes with tissue-selective functionality and clinical impact. AVAILABILITY AND IMPLEMENTATION: Datasets are available as part of the Supplementary data. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Fenômenos Biológicos , Mapas de Interação de Proteínas , Ontologia Genética , Redes Reguladoras de Genes , HumanosRESUMO
ResponseNet v.3 is an enhanced version of ResponseNet, a web server that is designed to highlight signaling and regulatory pathways connecting user-defined proteins and genes by using the ResponseNet network optimization approach (http://netbio.bgu.ac.il/respnet). Users run ResponseNet by defining source and target sets of proteins, genes and/or microRNAs, and by specifying a molecular interaction network (interactome). The output of ResponseNet is a sparse, high-probability interactome subnetwork that connects the two sets, thereby revealing additional molecules and interactions that are involved in the studied condition. In recent years, massive efforts were invested in profiling the transcriptomes of human tissues, enabling the inference of human tissue interactomes. ResponseNet v.3 expands ResponseNet2.0 by harnessing â¼11,600 RNA-sequenced human tissue profiles made available by the Genotype-Tissue Expression consortium, to support context-specific analysis of 44 human tissues. Thus, ResponseNet v.3 allows users to illuminate the signaling and regulatory pathways potentially active in the context of a specific tissue, and to compare them with active pathways in other tissues. In the era of precision medicine, such analyses open the door for tissue- and patient-specific analyses of pathways and diseases.
Assuntos
Genoma Humano/genética , MicroRNAs/genética , Mapas de Interação de Proteínas , Proteínas/metabolismo , Software , Bases de Dados de Ácidos Nucleicos , Bases de Dados de Proteínas , Redes Reguladoras de Genes/genética , Humanos , InternetRESUMO
DifferentialNet is a novel database that provides users with differential interactome analysis of human tissues (http://netbio.bgu.ac.il/diffnet/). Users query DifferentialNet by protein, and retrieve its differential protein-protein interactions (PPIs) per tissue via an interactive graphical interface. To compute differential PPIs, we integrated available data of experimentally detected PPIs with RNA-sequencing profiles of tens of human tissues gathered by the Genotype-Tissue Expression consortium (GTEx) and by the Human Protein Atlas (HPA). We associated each PPI with a score that reflects whether its corresponding genes were expressed similarly across tissues, or were up- or down-regulated in the selected tissue. By this, users can identify tissue-specific interactions, filter out PPIs that are relatively stable across tissues, and highlight PPIs that show relative changes across tissues. The differential PPIs can be used to identify tissue-specific processes and to decipher tissue-specific phenotypes. Moreover, they unravel processes that are tissue-wide yet tailored to the specific demands of each tissue.
Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas/métodos , Proteínas/química , Software , Atlas como Assunto , Osso e Ossos/química , Osso e Ossos/metabolismo , Encéfalo/metabolismo , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Internet , Rim/química , Rim/metabolismo , Pulmão/química , Pulmão/metabolismo , Masculino , Músculo Esquelético/química , Músculo Esquelético/metabolismo , Especificidade de Órgãos , Ovário/química , Ovário/metabolismo , Fenótipo , Próstata/química , Próstata/metabolismo , Proteínas/metabolismoRESUMO
The Saccharomyces cerevisiae kinase/adenosine triphosphatase Rio1 regulates rDNA transcription and segregation, pre-rRNA processing and small ribosomal subunit maturation. Other roles are unknown. When overexpressed, human ortholog RIOK1 drives tumor growth and metastasis. Likewise, RIOK1 promotes 40S ribosomal subunit biogenesis and has not been characterized globally. We show that Rio1 manages directly and via a series of regulators, an essential signaling network at the protein, chromatin and RNA levels. Rio1 orchestrates growth and division depending on resource availability, in parallel to the nutrient-activated Tor1 kinase. To define the Rio1 network, we identified its physical interactors, profiled its target genes/transcripts, mapped its chromatin-binding sites and integrated our data with yeast's protein-protein and protein-DNA interaction catalogs using network computation. We experimentally confirmed network components and localized Rio1 also to mitochondria and vacuoles. Via its network, Rio1 commands protein synthesis (ribosomal gene expression, assembly and activity) and turnover (26S proteasome expression), and impinges on metabolic, energy-production and cell-cycle programs. We find that Rio1 activity is conserved to humans and propose that pathological RIOK1 may fuel promiscuous transcription, ribosome production, chromosomal instability, unrestrained metabolism and proliferation; established contributors to cancer. Our study will advance the understanding of numerous processes, here revealed to depend on Rio1 activity.
Assuntos
Ciclo Celular/genética , Metabolismo Energético/genética , Proteínas Serina-Treonina Quinases/metabolismo , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Cromatina/metabolismo , Segregação de Cromossomos/genética , Mitocôndrias/genética , Fosfatidilinositol 3-Quinases/metabolismo , RNA Fúngico/genética , Subunidades Ribossômicas Menores de Eucariotos/metabolismo , Transcrição Gênica/genéticaRESUMO
RSRC1, whose polymorphism is associated with altered brain function in schizophrenia, is a member of the serine and arginine rich-related protein family. Through homozygosity mapping and whole exome sequencing we show that RSRC1 mutation causes an autosomal recessive syndrome of intellectual disability, aberrant behaviour, hypotonia and mild facial dysmorphism with normal brain MRI. Further, we show that RSRC1 is ubiquitously expressed, and that the RSRC1 mutation triggers nonsense-mediated mRNA decay of the RSRC1 transcript in patients' fibroblasts. Short hairpin RNA (shRNA)-mediated lentiviral silencing and overexpression of RSRC1 in SH-SY5Y cells demonstrated that RSRC1 has a role in alternative splicing and transcription regulation. Transcriptome profiling of RSRC1-silenced cells unravelled specific differentially expressed genes previously associated with intellectual disability, hypotonia and schizophrenia, relevant to the disease phenotype. Protein-protein interaction network modelling suggested possible intermediate interactions by which RSRC1 affects gene-specific differential expression. Patient-derived induced pluripotent stem cells, differentiated into neural progenitor cells, showed expression dynamics similar to the RSRC1-silenced SH-SY5Y model. Notably, patient neural progenitor cells had 9.6-fold downregulated expression of IGFBP3, whose brain expression is affected by MECP2, aberrant in Rett syndrome. Interestingly, Igfbp3-null mice have behavioural impairment, abnormal synaptic function and monoaminergic neurotransmission, likely correlating with the disease phenotype.
Assuntos
Processamento Alternativo/genética , Deficiências do Desenvolvimento/genética , Regulação para Baixo/genética , Proteína 3 de Ligação a Fator de Crescimento Semelhante à Insulina/metabolismo , Deficiência Intelectual/genética , Proteínas Nucleares/genética , Animais , Diferenciação Celular/genética , Linhagem Celular Transformada , Criança , Pré-Escolar , Consanguinidade , Deficiências do Desenvolvimento/complicações , Feminino , Seguimentos , Ontologia Genética , Humanos , Lactente , Deficiência Intelectual/complicações , Masculino , Camundongos , Camundongos Knockout , Células-Tronco Pluripotentes/metabolismo , RNA Interferente Pequeno/genética , RNA Interferente Pequeno/metabolismoRESUMO
Knowledge of the molecular interactions of human proteins within tissues is important for identifying their tissue-specific roles and for shedding light on tissue phenotypes. However, many protein-protein interactions (PPIs) have no tissue-contexts. The TissueNet database bridges this gap by associating experimentally-identified PPIs with human tissues that were shown to express both pair-mates. Users can select a protein and a tissue, and obtain a network view of the query protein and its tissue-associated PPIs. TissueNet v.2 is an updated version of the TissueNet database previously featured in NAR. It includes over 40 human tissues profiled via RNA-sequencing or protein-based assays. Users can select their preferred expression data source and interactively set the expression threshold for determining tissue-association. The output of TissueNet v.2 emphasizes qualitative and quantitative features of query proteins and their PPIs. The tissue-specificity view highlights tissue-specific and globally-expressed proteins, and the quantitative view highlights proteins that were differentially expressed in the selected tissue relative to all other tissues. Together, these views allow users to quickly assess the unique versus global functionality of query proteins. Thus, TissueNet v.2 offers an extensive, quantitative and user-friendly interface to study the roles of human proteins across tissues. TissueNet v.2 is available at http://netbio.bgu.ac.il/tissuenet.
Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas/métodos , Software , Humanos , Especificidade de ÓrgãosRESUMO
The identification of the molecular pathways active in specific contexts, such as disease states or drug responses, often requires an extensive view of the potential interactions between a subset of proteins. This view is not easily obtained: it requires the integration of context-specific protein list or expression data with up-to-date data of protein interactions that are typically spread across multiple databases. The MyProteinNet web server allows users to easily create such context-sensitive protein interaction networks. Users can automatically gather and consolidate data from up to 11 different databases to create a generic protein interaction network (interactome). They can score the interactions based on reliability and filter them by user-defined contexts including molecular expression and protein annotation. The output of MyProteinNet includes the generic and filtered interactome files, together with a summary of their network attributes. MyProteinNet is particularly geared toward building human tissue interactomes, by maintaining tissue expression profiles from multiple resources. The ability of MyProteinNet to facilitate the construction of up-to-date, context-specific interactomes and its applicability to 11 different organisms and to tens of human tissues, make it a powerful tool in meaningful analysis of protein networks. MyProteinNet is available at http://netbio.bgu.ac.il/myproteinnet.
Assuntos
Mapeamento de Interação de Proteínas/métodos , Software , Bases de Dados de Proteínas , Perfilação da Expressão Gênica , Ontologia Genética , Humanos , Internet , Anotação de Sequência MolecularRESUMO
As challenges to food security increase, the demand for lead genes for improving crop production is growing. However, genetic screens of plant mutants typically yield very low frequencies of desired phenotypes. Here, we present a powerful computational approach for selecting candidate genes for screening insertion mutants. We combined ranking of Arabidopsis thaliana regulatory genes according to their expression in response to multiple abiotic stresses (Multiple Stress [MST] score), with stress-responsive RNA co-expression network analysis to select candidate multiple stress regulatory (MSTR) genes. Screening of 62 T-DNA insertion mutants defective in candidate MSTR genes, for abiotic stress germination phenotypes yielded a remarkable hit rate of up to 62%; this gene discovery rate is 48-fold greater than that of other large-scale insertional mutant screens. Moreover, the MST score of these genes could be used to prioritize them for screening. To evaluate the contribution of the co-expression analysis, we screened 64 additional mutant lines of MST-scored genes that did not appear in the RNA co-expression network. The screening of these MST-scored genes yielded a gene discovery rate of 36%, which is much higher than that of classic mutant screens but not as high as when picking candidate genes from the co-expression network. The MSTR co-expression network that we created, AraSTressRegNet is publicly available at http://netbio.bgu.ac.il/arnet. This systems biology-based screening approach combining gene ranking and network analysis could be generally applicable to enhancing identification of genes regulating additional processes in plants and other organisms provided that suitable transcriptome data are available.
Assuntos
Arabidopsis/genética , Expressão Gênica , Redes Reguladoras de Genes , Genes de Plantas , Estresse Fisiológico/genética , Mutagênese Insercional , Análise de Sequência com Séries de OligonucleotídeosRESUMO
Genome sequencing and transcriptomic profiling are two widely used approaches for the identification of human disease pathways. However, each approach typically provides a limited view of disease pathways: Genome sequencing can identify disease-related mutations but rarely reveals their mode-of-action, while transcriptomic assays do not reveal the series of events that lead to the transcriptomic change. ResponseNet is an integrative network-optimization approach that we developed to fill these gaps by highlighting major signaling and regulatory molecular interaction paths that connect disease-related mutations and genes. The ResponseNet web-server provides a user-friendly interface to ResponseNet. Specifically, users can upload weighted lists of proteins and genes and obtain a sparse, weighted, molecular interaction subnetwork connecting them, that is biased toward regulatory and signaling pathways. ResponseNet2.0 enhances the functionality of the ResponseNet web-server in two important ways. First, it supports analysis of human data by offering a human interactome composed of proteins, genes and micro-RNAs. Second, it offers a new informative view of the output, including a randomization analysis, to help users assess the biological relevance of the output subnetwork. ResponseNet2.0 is available at http://netbio.bgu.ac.il/respnet .
Assuntos
Redes Reguladoras de Genes , Mapeamento de Interação de Proteínas , Transdução de Sinais , Software , Doença/genética , Humanos , InternetRESUMO
Knowledge of protein-protein interactions (PPIs) is important for identifying the functions of proteins and the processes they are involved in. Although data of human PPIs are easily accessible through several public databases, these databases do not specify the human tissues in which these PPIs take place. The TissueNet database of human tissue PPIs (http://netbio.bgu.ac.il/tissuenet/) associates each interaction with human tissues that express both pair mates. This was achieved by integrating current data of experimentally detected PPIs with extensive data of gene and protein expression across 16 main human tissues. Users can query TissueNet using a protein and retrieve its PPI partners per tissue, or using a PPI and retrieve the tissues expressing both pair mates. The graphical representation of the output highlights tissue-specific and tissue-wide PPIs. Thus, TissueNet provides a unique platform for assessing the roles of human proteins and their interactions across tissues.
Assuntos
Bases de Dados de Proteínas , Mapeamento de Interação de Proteínas , Perfilação da Expressão Gênica , Humanos , Internet , Mapas de Interação de Proteínas , Interface Usuário-ComputadorRESUMO
The sensitivity of the protein-folding environment to chaperone disruption can be highly tissue-specific. Yet, the organization of the chaperone system across physiological human tissues has received little attention. Through computational analyses of large-scale tissue transcriptomes, we unveil that the chaperone system is composed of core elements that are uniformly expressed across tissues, and variable elements that are differentially expressed to fit with tissue-specific requirements. We demonstrate via a proteomic analysis that the muscle-specific signature is functional and conserved. Core chaperones are significantly more abundant across tissues and more important for cell survival than variable chaperones. Together with variable chaperones, they form tissue-specific functional networks. Analysis of human organ development and aging brain transcriptomes reveals that these functional networks are established in development and decline with age. In this work, we expand the known functional organization of de novo versus stress-inducible eukaryotic chaperones into a layered core-variable architecture in multi-cellular organisms.