Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 41
Filter
Add more filters

Publication year range
1.
Nat Rev Genet ; 21(3): 137-150, 2020 03.
Article in English | MEDLINE | ID: mdl-31913361

ABSTRACT

Hundreds of heritable traits and diseases that are caused by germline aberrations in ubiquitously expressed genes manifest in a remarkably limited number of cell types and tissues across the body. Unravelling mechanisms that govern their tissue-specific manifestations is critical for our understanding of disease aetiologies and may direct efforts to develop treatments. Owing to recent advances in high-throughput technologies and open resources, data and tools are now available to approach this enigmatic phenomenon at large scales, both computationally and experimentally. Here, we discuss the large prevalence of tissue-selective traits and diseases, describe common molecular mechanisms underlying their tissue-selective manifestation and present computational strategies and publicly available resources for elucidating the molecular basis of their genotype-phenotype relationships.


Subject(s)
Genetic Predisposition to Disease , Databases, Genetic , Genotype , Humans , Infant, Newborn , Phenotype
2.
Nucleic Acids Res ; 51(W1): W478-W483, 2023 07 05.
Article in English | MEDLINE | ID: mdl-37207335

ABSTRACT

The distinct functions and phenotypes of human tissues and cells derive from the activity of biological processes that varies in a context-dependent manner. Here, we present the Process Activity (ProAct) webserver that estimates the preferential activity of biological processes in tissues, cells, and other contexts. Users can upload a differential gene expression matrix measured across contexts or cells, or use a built-in matrix of differential gene expression in 34 human tissues. Per context, ProAct associates gene ontology (GO) biological processes with estimated preferential activity scores, which are inferred from the input matrix. ProAct visualizes these scores across processes, contexts, and process-associated genes. ProAct also offers potential cell-type annotations for cell subsets, by inferring them from the preferential activity of 2001 cell-type-specific processes. Thus, ProAct output can highlight the distinct functions of tissues and cell types in various contexts, and can enhance cell-type annotation efforts. The ProAct webserver is available at https://netbio.bgu.ac.il/ProAct/.


Subject(s)
Biological Phenomena , Gene Expression Profiling , Software , Humans , Gene Ontology , Phenotype , Organ Specificity , Internet
3.
Mol Syst Biol ; 19(8): e11407, 2023 08 08.
Article in English | MEDLINE | ID: mdl-37232043

ABSTRACT

How do aberrations in widely expressed genes lead to tissue-selective hereditary diseases? Previous attempts to answer this question were limited to testing a few candidate mechanisms. To answer this question at a larger scale, we developed "Tissue Risk Assessment of Causality by Expression" (TRACE), a machine learning approach to predict genes that underlie tissue-selective diseases and selectivity-related features. TRACE utilized 4,744 biologically interpretable tissue-specific gene features that were inferred from heterogeneous omics datasets. Application of TRACE to 1,031 disease genes uncovered known and novel selectivity-related features, the most common of which was previously overlooked. Next, we created a catalog of tissue-associated risks for 18,927 protein-coding genes (https://netbio.bgu.ac.il/trace/). As proof-of-concept, we prioritized candidate disease genes identified in 48 rare-disease patients. TRACE ranked the verified disease gene among the patient's candidate genes significantly better than gene prioritization methods that rank by gene constraint or tissue expression. Thus, tissue selectivity combined with machine learning enhances genetic and clinical understanding of hereditary diseases.


Subject(s)
Machine Learning , Rare Diseases , Humans , Rare Diseases/genetics , Risk Assessment , Causality
4.
Proc Natl Acad Sci U S A ; 118(40)2021 10 05.
Article in English | MEDLINE | ID: mdl-34593629

ABSTRACT

Approximately 40% of human messenger RNAs (mRNAs) contain upstream open reading frames (uORFs) in their 5' untranslated regions. Some of these uORF sequences, thought to attenuate scanning ribosomes or lead to mRNA degradation, were recently shown to be translated, although the function of the encoded peptides remains unknown. Here, we show a uORF-encoded peptide that exhibits kinase inhibitory functions. This uORF, upstream of the protein kinase C-eta (PKC-η) main ORF, encodes a peptide (uPEP2) containing the typical PKC pseudosubstrate motif present in all PKCs that autoinhibits their kinase activity. We show that uPEP2 directly binds to and selectively inhibits the catalytic activity of novel PKCs but not of classical or atypical PKCs. The endogenous deletion of uORF2 or its overexpression in MCF-7 cells revealed that the endogenously translated uPEP2 reduces the protein levels of PKC-η and other novel PKCs and restricts cell proliferation. Functionally, treatment of breast cancer cells with uPEP2 diminished cell survival and their migration and synergized with chemotherapy by interfering with the response to DNA damage. Furthermore, in a xenograft of MDA-MB-231 breast cancer tumor in mice models, uPEP2 suppressed tumor progression, invasion, and metastasis. Tumor histology showed reduced proliferation, enhanced cell death, and lower protein expression levels of novel PKCs along with diminished phosphorylation of PKC substrates. Hence, our study demonstrates that uORFs may encode biologically active peptides beyond their role as translation regulators of their downstream ORFs. Together, we point to a unique function of a uORF-encoded peptide as a kinase inhibitor, pertinent to cancer therapy.


Subject(s)
Peptides/pharmacology , Protein Kinase C/antagonists & inhibitors , Protein Kinase Inhibitors/pharmacology , Amino Acid Sequence , Cell Line, Tumor , Humans , Open Reading Frames , Peptides/chemistry , Protein Kinase C/metabolism , Protein Kinase Inhibitors/chemistry , Substrate Specificity
5.
Bioinformatics ; 38(6): 1584-1592, 2022 03 04.
Article in English | MEDLINE | ID: mdl-35015838

ABSTRACT

MOTIVATION: The distinct functionalities of human tissues and cell types underlie complex phenotype-genotype relationships, yet often remain elusive. Harnessing the multitude of bulk and single-cell human transcriptomes while focusing on processes can help reveal these distinct functionalities. RESULTS: The Tissue-Process Activity (TiPA) method aims to identify processes that are preferentially active or under-expressed in specific contexts, by comparing the expression levels of process genes between contexts. We tested TiPA on 1579 tissue-specific processes and bulk tissue transcriptomes, finding that it performed better than another method. Next, we used TiPA to ask whether the activity of certain processes could underlie the tissue-specific manifestation of 1233 hereditary diseases. We found that 21% of the disease-causing genes indeed participated in such processes, thereby illuminating their genotype-phenotype relationships. Lastly, we applied TiPA to single-cell transcriptomes of 108 human cell types, revealing that process activities often match cell-type identities and can thus aid annotation efforts. Hence, differential activity of processes can highlight the distinct functionality of tissues and cells in a robust and meaningful manner. AVAILABILITY AND IMPLEMENTATION: TiPA code is available in GitHub (https://github.com/moranshar/TiPA). In addition, all data are available as part of the Supplementary Material. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Biological Phenomena , Transcriptome , Humans
6.
Bioinformatics ; 36(9): 2821-2828, 2020 05 01.
Article in English | MEDLINE | ID: mdl-31960892

ABSTRACT

MOTIVATION: Differential network analysis, designed to highlight network changes between conditions, is an important paradigm in network biology. However, differential network analysis methods have been typically designed to compare between two conditions and were rarely applied to multiple protein interaction networks (interactomes). Importantly, large-scale benchmarks for their evaluation have been lacking. RESULTS: Here, we present a framework for assessing the ability of differential network analysis of multiple human tissue interactomes to highlight tissue-selective processes and disorders. For this, we created a benchmark of 6499 curated tissue-specific Gene Ontology biological processes. We applied five methods, including four differential network analysis methods, to construct weighted interactomes for 34 tissues. Rigorous assessment of this benchmark revealed that differential analysis methods perform well in revealing tissue-selective processes (AUCs of 0.82-0.9). Next, we applied differential network analysis to illuminate the genes underlying tissue-selective hereditary disorders. For this, we curated a dataset of 1305 tissue-specific hereditary disorders and their manifesting tissues. Focusing on subnetworks containing the top 1% differential interactions in disease-relevant tissue interactomes revealed significant enrichment for disorder-causing genes in 18.6% of the cases, with a significantly high success rate for blood, nerve, muscle and heart diseases. SUMMARY: Altogether, we offer a framework that includes expansive manually curated datasets of tissue-selective processes and disorders to be used as benchmarks or to illuminate tissue-selective processes and genes. Our results demonstrate that differential analysis of multiple human tissue interactomes is a powerful tool for highlighting processes and genes with tissue-selective functionality and clinical impact. AVAILABILITY AND IMPLEMENTATION: Datasets are available as part of the Supplementary data. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Biological Phenomena , Protein Interaction Maps , Gene Ontology , Gene Regulatory Networks , Humans
7.
Nucleic Acids Res ; 47(W1): W242-W247, 2019 07 02.
Article in English | MEDLINE | ID: mdl-31114913

ABSTRACT

ResponseNet v.3 is an enhanced version of ResponseNet, a web server that is designed to highlight signaling and regulatory pathways connecting user-defined proteins and genes by using the ResponseNet network optimization approach (http://netbio.bgu.ac.il/respnet). Users run ResponseNet by defining source and target sets of proteins, genes and/or microRNAs, and by specifying a molecular interaction network (interactome). The output of ResponseNet is a sparse, high-probability interactome subnetwork that connects the two sets, thereby revealing additional molecules and interactions that are involved in the studied condition. In recent years, massive efforts were invested in profiling the transcriptomes of human tissues, enabling the inference of human tissue interactomes. ResponseNet v.3 expands ResponseNet2.0 by harnessing ∼11,600 RNA-sequenced human tissue profiles made available by the Genotype-Tissue Expression consortium, to support context-specific analysis of 44 human tissues. Thus, ResponseNet v.3 allows users to illuminate the signaling and regulatory pathways potentially active in the context of a specific tissue, and to compare them with active pathways in other tissues. In the era of precision medicine, such analyses open the door for tissue- and patient-specific analyses of pathways and diseases.


Subject(s)
Genome, Human/genetics , MicroRNAs/genetics , Protein Interaction Maps , Proteins/metabolism , Software , Databases, Nucleic Acid , Databases, Protein , Gene Regulatory Networks/genetics , Humans , Internet
8.
PLoS Genet ; 14(5): e1007327, 2018 05.
Article in English | MEDLINE | ID: mdl-29723191

ABSTRACT

A longstanding puzzle in human genetics is what limits the clinical manifestation of hundreds of hereditary diseases to certain tissues, while their causal genes are expressed throughout the human body. A general conception is that tissue-selective disease phenotypes emerge when masking factors operate in unaffected tissues, but are specifically absent or insufficient in disease-manifesting tissues. Although this conception has critical impact on the understanding of disease manifestation, it was never challenged in a systematic manner across a variety of hereditary diseases and affected tissues. Here, we address this gap in our understanding via rigorous analysis of the susceptibility of over 30 tissues to 112 tissue-selective hereditary diseases. We focused on the roles of paralogs of causal genes, which are presumably capable of compensating for their aberration. We show for the first time at large-scale via quantitative analysis of omics datasets that, preferentially in the disease-manifesting tissues, paralogs are under-expressed relative to causal genes in more than half of the diseases. This was observed for several susceptible tissues and for causal genes with varying number of paralogs, suggesting that imbalanced expression of paralogs increases tissue susceptibility. While for many diseases this imbalance stemmed from up-regulation of the causal gene in the disease-manifesting tissue relative to other tissues, it was often combined with down-regulation of its paralog. Notably in roughly 20% of the cases, this imbalance stemmed only from significant down-regulation of the paralog. Thus, dosage relationships between paralogs appear as important, yet currently under-appreciated, modifiers of disease manifestation.


Subject(s)
Gene Expression Profiling , Genes, Duplicate , Genetic Diseases, Inborn/genetics , Genetic Predisposition to Disease/genetics , Organ Specificity/genetics , Gene Dosage , Gene Duplication , Humans
9.
Bioinformatics ; 35(17): 3028-3037, 2019 09 01.
Article in English | MEDLINE | ID: mdl-30649201

ABSTRACT

MOTIVATION: The effectiveness of drugs tends to vary between patients. One of the well-known reasons for this phenomenon is genetic polymorphisms in drug target genes among patients. Here, we propose that differences in expression levels of drug target genes across individuals can also contribute to this phenomenon. RESULTS: To explore this hypothesis, we analyzed the expression variability of protein-coding genes, and particularly drug target genes, across individuals. For this, we developed a novel variability measure, termed local coefficient of variation (LCV), which ranks the expression variability of each gene relative to genes with similar expression levels. Unlike commonly used methods, LCV neutralizes expression levels biases without imposing any distribution over the variation and is robust to data incompleteness. Application of LCV to RNA-sequencing profiles of 19 human tissues and to target genes of 1076 approved drugs revealed that drug target genes were significantly more variable than protein-coding genes. Analysis of 113 drugs with available effectiveness scores showed that drugs targeting highly variable genes tended to be less effective in the population. Furthermore, comparison of approved drugs to drugs that were withdrawn from the market showed that withdrawn drugs targeted significantly more variable genes than approved drugs. Last, upon analyzing gender differences we found that the variability of drug target genes was similar between men and women. Altogether, our results suggest that expression variability of drug target genes could contribute to the variable responsiveness and effectiveness of drugs, and is worth considering during drug treatment and development. AVAILABILITY AND IMPLEMENTATION: LCV is available as a python script in GitHub (https://github.com/eyalsim/LCV). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Gene Expression , Humans , Sequence Analysis, RNA
10.
Nucleic Acids Res ; 46(D1): D522-D526, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29069447

ABSTRACT

DifferentialNet is a novel database that provides users with differential interactome analysis of human tissues (http://netbio.bgu.ac.il/diffnet/). Users query DifferentialNet by protein, and retrieve its differential protein-protein interactions (PPIs) per tissue via an interactive graphical interface. To compute differential PPIs, we integrated available data of experimentally detected PPIs with RNA-sequencing profiles of tens of human tissues gathered by the Genotype-Tissue Expression consortium (GTEx) and by the Human Protein Atlas (HPA). We associated each PPI with a score that reflects whether its corresponding genes were expressed similarly across tissues, or were up- or down-regulated in the selected tissue. By this, users can identify tissue-specific interactions, filter out PPIs that are relatively stable across tissues, and highlight PPIs that show relative changes across tissues. The differential PPIs can be used to identify tissue-specific processes and to decipher tissue-specific phenotypes. Moreover, they unravel processes that are tissue-wide yet tailored to the specific demands of each tissue.


Subject(s)
Databases, Protein , Protein Interaction Mapping/methods , Proteins/chemistry , Software , Atlases as Topic , Bone and Bones/chemistry , Bone and Bones/metabolism , Brain/metabolism , Female , High-Throughput Nucleotide Sequencing , Humans , Internet , Kidney/chemistry , Kidney/metabolism , Lung/chemistry , Lung/metabolism , Male , Muscle, Skeletal/chemistry , Muscle, Skeletal/metabolism , Organ Specificity , Ovary/chemistry , Ovary/metabolism , Phenotype , Prostate/chemistry , Prostate/metabolism , Proteins/metabolism
11.
Nucleic Acids Res ; 46(15): 7586-7611, 2018 09 06.
Article in English | MEDLINE | ID: mdl-30011030

ABSTRACT

The Saccharomyces cerevisiae kinase/adenosine triphosphatase Rio1 regulates rDNA transcription and segregation, pre-rRNA processing and small ribosomal subunit maturation. Other roles are unknown. When overexpressed, human ortholog RIOK1 drives tumor growth and metastasis. Likewise, RIOK1 promotes 40S ribosomal subunit biogenesis and has not been characterized globally. We show that Rio1 manages directly and via a series of regulators, an essential signaling network at the protein, chromatin and RNA levels. Rio1 orchestrates growth and division depending on resource availability, in parallel to the nutrient-activated Tor1 kinase. To define the Rio1 network, we identified its physical interactors, profiled its target genes/transcripts, mapped its chromatin-binding sites and integrated our data with yeast's protein-protein and protein-DNA interaction catalogs using network computation. We experimentally confirmed network components and localized Rio1 also to mitochondria and vacuoles. Via its network, Rio1 commands protein synthesis (ribosomal gene expression, assembly and activity) and turnover (26S proteasome expression), and impinges on metabolic, energy-production and cell-cycle programs. We find that Rio1 activity is conserved to humans and propose that pathological RIOK1 may fuel promiscuous transcription, ribosome production, chromosomal instability, unrestrained metabolism and proliferation; established contributors to cancer. Our study will advance the understanding of numerous processes, here revealed to depend on Rio1 activity.


Subject(s)
Cell Cycle/genetics , Energy Metabolism/genetics , Protein Serine-Threonine Kinases/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Chromatin/metabolism , Chromosome Segregation/genetics , Mitochondria/genetics , Phosphatidylinositol 3-Kinases/metabolism , RNA, Fungal/genetics , Ribosome Subunits, Small, Eukaryotic/metabolism , Transcription, Genetic/genetics
12.
Brain ; 141(4): 961-970, 2018 04 01.
Article in English | MEDLINE | ID: mdl-29522154

ABSTRACT

RSRC1, whose polymorphism is associated with altered brain function in schizophrenia, is a member of the serine and arginine rich-related protein family. Through homozygosity mapping and whole exome sequencing we show that RSRC1 mutation causes an autosomal recessive syndrome of intellectual disability, aberrant behaviour, hypotonia and mild facial dysmorphism with normal brain MRI. Further, we show that RSRC1 is ubiquitously expressed, and that the RSRC1 mutation triggers nonsense-mediated mRNA decay of the RSRC1 transcript in patients' fibroblasts. Short hairpin RNA (shRNA)-mediated lentiviral silencing and overexpression of RSRC1 in SH-SY5Y cells demonstrated that RSRC1 has a role in alternative splicing and transcription regulation. Transcriptome profiling of RSRC1-silenced cells unravelled specific differentially expressed genes previously associated with intellectual disability, hypotonia and schizophrenia, relevant to the disease phenotype. Protein-protein interaction network modelling suggested possible intermediate interactions by which RSRC1 affects gene-specific differential expression. Patient-derived induced pluripotent stem cells, differentiated into neural progenitor cells, showed expression dynamics similar to the RSRC1-silenced SH-SY5Y model. Notably, patient neural progenitor cells had 9.6-fold downregulated expression of IGFBP3, whose brain expression is affected by MECP2, aberrant in Rett syndrome. Interestingly, Igfbp3-null mice have behavioural impairment, abnormal synaptic function and monoaminergic neurotransmission, likely correlating with the disease phenotype.


Subject(s)
Alternative Splicing/genetics , Developmental Disabilities/genetics , Down-Regulation/genetics , Insulin-Like Growth Factor Binding Protein 3/metabolism , Intellectual Disability/genetics , Nuclear Proteins/genetics , Animals , Cell Differentiation/genetics , Cell Line, Transformed , Child , Child, Preschool , Consanguinity , Developmental Disabilities/complications , Female , Follow-Up Studies , Gene Ontology , Humans , Infant , Intellectual Disability/complications , Male , Mice , Mice, Knockout , Pluripotent Stem Cells/metabolism , RNA, Small Interfering/genetics , RNA, Small Interfering/metabolism
13.
Nucleic Acids Res ; 45(D1): D427-D431, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27899616

ABSTRACT

Knowledge of the molecular interactions of human proteins within tissues is important for identifying their tissue-specific roles and for shedding light on tissue phenotypes. However, many protein-protein interactions (PPIs) have no tissue-contexts. The TissueNet database bridges this gap by associating experimentally-identified PPIs with human tissues that were shown to express both pair-mates. Users can select a protein and a tissue, and obtain a network view of the query protein and its tissue-associated PPIs. TissueNet v.2 is an updated version of the TissueNet database previously featured in NAR. It includes over 40 human tissues profiled via RNA-sequencing or protein-based assays. Users can select their preferred expression data source and interactively set the expression threshold for determining tissue-association. The output of TissueNet v.2 emphasizes qualitative and quantitative features of query proteins and their PPIs. The tissue-specificity view highlights tissue-specific and globally-expressed proteins, and the quantitative view highlights proteins that were differentially expressed in the selected tissue relative to all other tissues. Together, these views allow users to quickly assess the unique versus global functionality of query proteins. Thus, TissueNet v.2 offers an extensive, quantitative and user-friendly interface to study the roles of human proteins across tissues. TissueNet v.2 is available at http://netbio.bgu.ac.il/tissuenet.


Subject(s)
Computational Biology/methods , Databases, Protein , Protein Interaction Mapping/methods , Software , Humans , Organ Specificity
14.
PLoS Genet ; 12(12): e1006531, 2016 Dec.
Article in English | MEDLINE | ID: mdl-28036392

ABSTRACT

Safeguarding the proteome is central to the health of the cell. In multi-cellular organisms, the composition of the proteome, and by extension, protein-folding requirements, varies between cells. In agreement, chaperone network composition differs between tissues. Here, we ask how chaperone expression is regulated in a cell type-specific manner and whether cellular differentiation affects chaperone expression. Our bioinformatics analyses show that the myogenic transcription factor HLH-1 (MyoD) can bind to the promoters of chaperone genes expressed or required for the folding of muscle proteins. To test this experimentally, we employed HLH-1 myogenic potential to genetically modulate cellular differentiation of Caenorhabditis elegans embryonic cells by ectopically expressing HLH-1 in all cells of the embryo and monitoring chaperone expression. We found that HLH-1-dependent myogenic conversion specifically induced the expression of putative HLH-1-regulated chaperones in differentiating muscle cells. Moreover, disrupting the putative HLH-1-binding sites on ubiquitously expressed daf-21(Hsp90) and muscle-enriched hsp-12.2(sHsp) promoters abolished their myogenic-dependent expression. Disrupting HLH-1 function in muscle cells reduced the expression of putative HLH-1-regulated chaperones and compromised muscle proteostasis during and after embryogenesis. In turn, we found that modulating the expression of muscle chaperones disrupted the folding and assembly of muscle proteins and thus, myogenesis. Moreover, muscle-specific over-expression of the DNAJB6 homolog DNJ-24, a limb-girdle muscular dystrophy-associated chaperone, disrupted the muscle chaperone network and exposed synthetic motility defects. We propose that cellular differentiation could establish a proteostasis network dedicated to the folding and maintenance of the muscle proteome. Such cell-specific proteostasis networks can explain the selective vulnerability that many diseases of protein misfolding exhibit even when the misfolded protein is ubiquitously expressed.


Subject(s)
Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans/genetics , DNA-Binding Proteins/genetics , HSP90 Heat-Shock Proteins/genetics , Heat-Shock Proteins/genetics , Myogenic Regulatory Factors/genetics , Animals , Binding Sites , Caenorhabditis elegans/growth & development , Caenorhabditis elegans Proteins/biosynthesis , Caenorhabditis elegans Proteins/metabolism , Cell Differentiation/genetics , DNA-Binding Proteins/metabolism , Embryonic Development/genetics , Gene Expression Regulation, Developmental , HSP40 Heat-Shock Proteins/genetics , HSP40 Heat-Shock Proteins/metabolism , HSP90 Heat-Shock Proteins/metabolism , Heat-Shock Proteins/biosynthesis , Molecular Chaperones/biosynthesis , Molecular Chaperones/genetics , Molecular Chaperones/metabolism , Muscle Cells/metabolism , Muscle Development/genetics , Muscle Proteins , Myogenic Regulatory Factors/metabolism , Nerve Tissue Proteins/genetics , Nerve Tissue Proteins/metabolism , Nuclear Proteins , Promoter Regions, Genetic , Transcription Factors
15.
Bioinformatics ; 33(12): 1907-1909, 2017 Jun 15.
Article in English | MEDLINE | ID: mdl-28165111

ABSTRACT

SUMMARY: Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. AVAILABILITY AND IMPLEMENTATION: MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. CONTACT: estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology/methods , Software , Databases, Factual , Internet
16.
PLoS Comput Biol ; 13(1): e1005221, 2017 01.
Article in English | MEDLINE | ID: mdl-28135269

ABSTRACT

Protein phosphorylation underlies cellular response pathways across eukaryotes and is governed by the opposing actions of phosphorylating kinases and de-phosphorylating phosphatases. While kinases and phosphatases have been extensively studied, their organization and the mechanisms by which they balance each other are not well understood. To address these questions we performed quantitative analyses of large-scale 'omics' datasets from yeast, fly, plant, mouse and human. We uncovered an asymmetric balance of a previously-hidden scale: Each organism contained many different kinase genes, and these were balanced by a small set of highly abundant phosphatase proteins. Kinases were much more responsive to perturbations at the gene and protein levels. In addition, kinases had diverse scales of phenotypic impact when manipulated. Phosphatases, in contrast, were stable, highly robust and flatly organized, with rather uniform impact downstream. We validated aspects of this organization experimentally in nematode, and supported additional aspects by theoretic analysis of the dynamics of protein phosphorylation. Our analyses explain the empirical bias in the protein phosphorylation field toward characterization and therapeutic targeting of kinases at the expense of phosphatases. We show quantitatively and broadly that this is not only a historical bias, but stems from wide-ranging differences in their organization and impact. The asymmetric balance between these opposing regulators of protein phosphorylation is also common to opposing regulators of two other post-translational modification systems, suggesting its fundamental value.


Subject(s)
Evolution, Molecular , Gene Expression Regulation, Enzymologic/physiology , Phosphoric Monoester Hydrolases/genetics , Phosphoric Monoester Hydrolases/metabolism , Phosphotransferases/genetics , Phosphotransferases/metabolism , Animals , Arabidopsis/genetics , Arabidopsis/metabolism , Drosophila melanogaster/genetics , Drosophila melanogaster/metabolism , Enzyme Activation/genetics , Genetic Variation/genetics , Mice , Phosphoric Monoester Hydrolases/classification , Phosphorylation , Phosphotransferases/classification , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Species Specificity , Yeasts
17.
Nucleic Acids Res ; 43(W1): W258-63, 2015 Jul 01.
Article in English | MEDLINE | ID: mdl-25990735

ABSTRACT

The identification of the molecular pathways active in specific contexts, such as disease states or drug responses, often requires an extensive view of the potential interactions between a subset of proteins. This view is not easily obtained: it requires the integration of context-specific protein list or expression data with up-to-date data of protein interactions that are typically spread across multiple databases. The MyProteinNet web server allows users to easily create such context-sensitive protein interaction networks. Users can automatically gather and consolidate data from up to 11 different databases to create a generic protein interaction network (interactome). They can score the interactions based on reliability and filter them by user-defined contexts including molecular expression and protein annotation. The output of MyProteinNet includes the generic and filtered interactome files, together with a summary of their network attributes. MyProteinNet is particularly geared toward building human tissue interactomes, by maintaining tissue expression profiles from multiple resources. The ability of MyProteinNet to facilitate the construction of up-to-date, context-specific interactomes and its applicability to 11 different organisms and to tens of human tissues, make it a powerful tool in meaningful analysis of protein networks. MyProteinNet is available at http://netbio.bgu.ac.il/myproteinnet.


Subject(s)
Protein Interaction Mapping/methods , Software , Databases, Protein , Gene Expression Profiling , Gene Ontology , Humans , Internet , Molecular Sequence Annotation
18.
PLoS Genet ; 10(3): e1004239, 2014 Mar.
Article in English | MEDLINE | ID: mdl-24603726

ABSTRACT

Cancer is an evolutionary process in which cells acquire new transformative, proliferative and metastatic capabilities. A full understanding of cancer requires learning the dynamics of the cancer evolutionary process. We present here a large-scale analysis of the dynamics of this evolutionary process within tumors, with a focus on breast cancer. We show that the cancer evolutionary process differs greatly from organismal (germline) evolution. Organismal evolution is dominated by purifying selection (that removes mutations that are harmful to fitness). In contrast, in the cancer evolutionary process the dominance of purifying selection is much reduced, allowing for a much easier detection of the signals of positive selection (adaptation). We further show that, as a group, genes that are globally expressed across human tissues show a very strong signal of positive selection within tumors. Indeed, known cancer genes are enriched for global expression patterns. Yet, positive selection is prevalent even on globally expressed genes that have not yet been associated with cancer, suggesting that globally expressed genes are enriched for yet undiscovered cancer related functions. We find that the increased positive selection on globally expressed genes within tumors is not due to their expression in the tissue relevant to the cancer. Rather, such increased adaptation is likely due to globally expressed genes being enriched in important housekeeping and essential functions. Thus, our results suggest that tumor adaptation is most often mediated through somatic changes to those genes that are important for the most basic cellular functions. Together, our analysis reveals the uniqueness of the cancer evolutionary process and the particular importance of globally expressed genes in driving cancer initiation and progression.


Subject(s)
Breast Neoplasms/genetics , Carcinogenesis/genetics , Evolution, Molecular , Selection, Genetic , Breast Neoplasms/pathology , Female , Gene Expression Regulation, Neoplastic , Humans , Mutation
19.
Plant Biotechnol J ; 13(4): 501-13, 2015 May.
Article in English | MEDLINE | ID: mdl-25370817

ABSTRACT

As challenges to food security increase, the demand for lead genes for improving crop production is growing. However, genetic screens of plant mutants typically yield very low frequencies of desired phenotypes. Here, we present a powerful computational approach for selecting candidate genes for screening insertion mutants. We combined ranking of Arabidopsis thaliana regulatory genes according to their expression in response to multiple abiotic stresses (Multiple Stress [MST] score), with stress-responsive RNA co-expression network analysis to select candidate multiple stress regulatory (MSTR) genes. Screening of 62 T-DNA insertion mutants defective in candidate MSTR genes, for abiotic stress germination phenotypes yielded a remarkable hit rate of up to 62%; this gene discovery rate is 48-fold greater than that of other large-scale insertional mutant screens. Moreover, the MST score of these genes could be used to prioritize them for screening. To evaluate the contribution of the co-expression analysis, we screened 64 additional mutant lines of MST-scored genes that did not appear in the RNA co-expression network. The screening of these MST-scored genes yielded a gene discovery rate of 36%, which is much higher than that of classic mutant screens but not as high as when picking candidate genes from the co-expression network. The MSTR co-expression network that we created, AraSTressRegNet is publicly available at http://netbio.bgu.ac.il/arnet. This systems biology-based screening approach combining gene ranking and network analysis could be generally applicable to enhancing identification of genes regulating additional processes in plants and other organisms provided that suitable transcriptome data are available.


Subject(s)
Arabidopsis/genetics , Gene Expression , Gene Regulatory Networks , Genes, Plant , Stress, Physiological/genetics , Mutagenesis, Insertional , Oligonucleotide Array Sequence Analysis
20.
PLoS Comput Biol ; 10(6): e1003632, 2014 Jun.
Article in English | MEDLINE | ID: mdl-24921629

ABSTRACT

An open question in human genetics is what underlies the tissue-specific manifestation of hereditary diseases, which are caused by genomic aberrations that are present in cells across the human body. Here we analyzed this phenomenon for over 300 hereditary diseases by using comparative network analysis. We created an extensive resource of protein expression and interactions in 16 main human tissues, by integrating recent data of gene and protein expression across tissues with data of protein-protein interactions (PPIs). The resulting tissue interaction networks (interactomes) shared a large fraction of their proteins and PPIs, and only a small fraction of them were tissue-specific. Applying this resource to hereditary diseases, we first show that most of the disease-causing genes are widely expressed across tissues, yet, enigmatically, cause disease phenotypes in few tissues only. Upon testing for factors that could lead to tissue-specific vulnerability, we find that disease-causing genes tend to have elevated transcript levels and increased number of tissue-specific PPIs in their disease tissues compared to unaffected tissues. We demonstrate through several examples that these tissue-specific PPIs can highlight disease mechanisms, and thus, owing to their small number, provide a powerful filter for interrogating disease etiologies. As two thirds of the hereditary diseases are associated with these factors, comparative tissue analysis offers a meaningful and efficient framework for enhancing the understanding of the molecular basis of hereditary diseases.


Subject(s)
Genetic Diseases, Inborn/genetics , Genetic Diseases, Inborn/metabolism , Protein Interaction Mapping/methods , Computational Biology , Databases, Genetic , Gene Expression Profiling , Genetic Predisposition to Disease , Genomics , Humans , Protein Interaction Mapping/statistics & numerical data , Protein Interaction Maps , Proteomics , Tissue Distribution
SELECTION OF CITATIONS
SEARCH DETAIL