RESUMO
We used clinical tissue from lethal metastatic castration-resistant prostate cancer (CRPC) patients obtained at rapid autopsy to evaluate diverse genomic, transcriptomic, and phosphoproteomic datasets for pathway analysis. Using Tied Diffusion through Interacting Events (TieDIE), we integrated differentially expressed master transcriptional regulators, functionally mutated genes, and differentially activated kinases in CRPC tissues to synthesize a robust signaling network consisting of druggable kinase pathways. Using MSigDB hallmark gene sets, six major signaling pathways with phosphorylation of several key residues were significantly enriched in CRPC tumors after incorporation of phosphoproteomic data. Individual autopsy profiles developed using these hallmarks revealed clinically relevant pathway information potentially suitable for patient stratification and targeted therapies in late stage prostate cancer. Here, we describe phosphorylation-based cancer hallmarks using integrated personalized signatures (pCHIPS) that shed light on the diversity of activated signaling pathways in metastatic CRPC while providing an integrative, pathway-based reference for drug prioritization in individual patients.
Assuntos
Fosfoproteínas/análise , Neoplasias de Próstata Resistentes à Castração/química , Proteoma/análise , Algoritmos , Humanos , Masculino , Medicina de Precisão , Neoplasias de Próstata Resistentes à Castração/metabolismo , Transdução de Sinais , TranscriptomaRESUMO
It remains unclear whether causal, rather than merely correlational, relationships in molecular networks can be inferred in complex biological settings. Here we describe the HPN-DREAM network inference challenge, which focused on learning causal influences in signaling networks. We used phosphoprotein data from cancer cell lines as well as in silico data from a nonlinear dynamical model. Using the phosphoprotein data, we scored more than 2,000 networks submitted by challenge participants. The networks spanned 32 biological contexts and were scored in terms of causal validity with respect to unseen interventional data. A number of approaches were effective, and incorporating known biology was generally advantageous. Additional sub-challenges considered time-course prediction and visualization. Our results suggest that learning causal relationships may be feasible in complex settings such as disease states. Furthermore, our scoring approach provides a practical way to empirically assess inferred molecular networks in a causal sense.
Assuntos
Causalidade , Redes Reguladoras de Genes , Neoplasias/genética , Mapeamento de Interação de Proteínas/métodos , Software , Biologia de Sistemas , Algoritmos , Biologia Computacional , Simulação por Computador , Perfilação da Expressão Gênica , Humanos , Modelos Biológicos , Transdução de Sinais , Células Tumorais CultivadasRESUMO
Summary: We present pyNBS: a modularized Python 2.7 implementation of the network-based stratification (NBS) algorithm for stratifying tumor somatic mutation profiles into molecularly and clinically relevant subtypes. In addition to release of the software, we benchmark its key parameters and provide a compact cancer reference network that increases the significance of tumor stratification using the NBS algorithm. The structure of the code exposes key steps of the algorithm to foster further collaborative development. Availability and implementation: The package, along with examples and data, can be downloaded and installed from the URL https://github.com/idekerlab/pyNBS.
Assuntos
Mutação , Neoplasias/genética , Software , Algoritmos , Humanos , Análise de Sequência de DNA/estatística & dados numéricosRESUMO
Network propagation is an important and widely used algorithm in systems biology, with applications in protein function prediction, disease gene prioritization, and patient stratification. However, up to this point it has required significant expertise to run. Here we extend the popular network analysis program Cytoscape to perform network propagation as an integrated function. Such integration greatly increases the access to network propagation by putting it in the hands of biologists and linking it to the many other types of network analysis and visualization available through Cytoscape. We demonstrate the power and utility of the algorithm by identifying mutations conferring resistance to Vemurafenib.
Assuntos
Algoritmos , Software , Biologia de Sistemas/métodos , Animais , Resistencia a Medicamentos Antineoplásicos , Indóis , Modelos Biológicos , Mutação , Mapeamento de Interação de Proteínas/métodos , Sulfonamidas , VemurafenibRESUMO
We present a novel regularization scheme called The Generalized Elastic Net (GELnet) that incorporates gene pathway information into feature selection. The proposed formulation is applicable to a wide variety of problems in which the interpretation of predictive features using known molecular interactions is desired. The method naturally steers solutions toward sets of mechanistically interlinked genes. Using experiments on synthetic data, we demonstrate that pathway-guided results maintain, and often improve, the accuracy of predictors even in cases where the full gene network is unknown. We apply the method to predict the drug response of breast cancer cell lines. GELnet is able to reveal genetic determinants of sensitivity and resistance for several compounds. In particular, for an EGFR/HER2 inhibitor, it finds a possible trans-differentiation resistance mechanism missed by the corresponding pathway agnostic approach.
Assuntos
Mapeamento Cromossômico/métodos , Modelos Genéticos , Reconhecimento Automatizado de Padrão/métodos , Mapeamento de Interação de Proteínas/métodos , Proteoma/genética , Transdução de Sinais/genética , Animais , Simulação por Computador , HumanosRESUMO
MOTIVATION: Identifying the cellular wiring that connects genomic perturbations to transcriptional changes in cancer is essential to gain a mechanistic understanding of disease initiation, progression and ultimately to predict drug response. We have developed a method called Tied Diffusion Through Interacting Events (TieDIE) that uses a network diffusion approach to connect genomic perturbations to gene expression changes characteristic of cancer subtypes. The method computes a subnetwork of protein-protein interactions, predicted transcription factor-to-target connections and curated interactions from literature that connects genomic and transcriptomic perturbations. RESULTS: Application of TieDIE to The Cancer Genome Atlas and a breast cancer cell line dataset identified key signaling pathways, with examples impinging on MYC activity. Interlinking genes are predicted to correspond to essential components of cancer signaling and may provide a mechanistic explanation of tumor character and suggest subtype-specific drug targets. AVAILABILITY: Software is available from the Stuart lab's wiki: https://sysbiowiki.soe.ucsc.edu/tiedie. CONTACT: jstuart@ucsc.edu. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Neoplasias da Mama/classificação , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Linhagem Celular Tumoral , Feminino , Perfilação da Expressão Gênica , Genômica , Humanos , Neoplasias/genética , Mapeamento de Interação de Proteínas , Transdução de Sinais , Software , Fatores de Transcrição/metabolismoRESUMO
Many disease-causing genetic variants converge on common biological functions and pathways. Precisely how to incorporate pathway knowledge in genetic association studies is not yet clear, however. Previous approaches employ a two-step approach, in which a regular association test is first performed to identify variants associated with the disease phenotype, followed by a test for functional enrichment within the genes implicated by those variants. Here we introduce a concise one-step approach, Hierarchical Genetic Analysis (Higana), which directly computes phenotype associations against each function in the large hierarchy of biological functions documented by the Gene Ontology. Using this approach, we identify risk genes and functions for Chronic Obstructive Pulmonary Disease (COPD), highlighting microtubule transport, muscle adaptation, and nicotine receptor signaling pathways. Microtubule transport has not been previously linked to COPD, as it integrates genetic variants spread over numerous genes. All associations validate strongly in a second COPD cohort.
Assuntos
Predisposição Genética para Doença , Doença Pulmonar Obstrutiva Crônica , Humanos , Doença Pulmonar Obstrutiva Crônica/genética , Estudos de Associação Genética , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo ÚnicoRESUMO
Biological networks can substantially boost power to identify disease genes in genome-wide association studies. To explore different network GWAS methods, we challenged students of a UC San Diego graduate level bioinformatics course, Network Biology and Biomedicine, to explore and improve such algorithms during a four-week-long classroom competition. Here, we report the many creative solutions and share our experiences in conducting classroom crowd science as both a research and pedagogical tool.
Assuntos
Biologia Computacional/educação , Crowdsourcing/métodos , Estudo de Associação Genômica Ampla/métodos , Educação de Pós-Graduação/métodos , HumanosRESUMO
We present an accessible, fast, and customizable network propagation system for pathway boosting and interpretation of genome-wide association studies. This system-NAGA (Network Assisted Genomic Association)-taps the NDEx biological network resource to gain access to thousands of protein networks and select those most relevant and performative for a specific association study. The method works efficiently, completing genome-wide analysis in under 5 minutes on a modern laptop computer. We show that NAGA recovers many known disease genes from analysis of schizophrenia genetic data, and it substantially boosts associations with previously unappreciated genes such as amyloid beta precursor. On this and seven other gene-disease association tasks, NAGA outperforms conventional approaches in recovery of known disease genes and replicability of results. Protein interactions associated with disease are visualized and annotated in Cytoscape, which, in addition to standard programmatic interfaces, allows for downstream analysis.
RESUMO
Precision cancer medicine promises to tailor clinical decisions to patients using genomic information. Indeed, successes of drugs targeting genetic alterations in tumors, such as imatinib that targets BCR-ABL in chronic myelogenous leukemia, have demonstrated the power of this approach. However, biological systems are complex, and patients may differ not only by the specific genetic alterations in their tumor, but also by more subtle interactions among such alterations. Systems biology and more specifically, network analysis, provides a framework for advancing precision medicine beyond clinical actionability of individual mutations. Here we discuss applications of network analysis to study tumor biology, early methods for N-of-1 tumor genome analysis, and the path for such tools to the clinic.
Assuntos
Oncologia/estatística & dados numéricos , Neoplasias/epidemiologia , Medicina de Precisão/estatística & dados numéricos , Algoritmos , Suscetibilidade a Doenças , Genômica/métodos , Humanos , Oncologia/normas , Neoplasias/etiologia , Neoplasias/metabolismo , Neoplasias/terapia , Redes Neurais de Computação , Medicina de Precisão/normas , Prognóstico , Biologia de SistemasRESUMO
Gene networks are rapidly growing in size and number, raising the question of which networks are most appropriate for particular applications. Here, we evaluate 21 human genome-wide interaction networks for their ability to recover 446 disease gene sets identified through literature curation, gene expression profiling, or genome-wide association studies. While all networks have some ability to recover disease genes, we observe a wide range of performance with STRING, ConsensusPathDB, and GIANT networks having the best performance overall. A general tendency is that performance scales with network size, suggesting that new interaction discovery currently outweighs the detrimental effects of false positives. Correcting for size, we find that the DIP network provides the highest efficiency (value per interaction). Based on these results, we create a parsimonious composite network with both high efficiency and performance. This work provides a benchmark for selection of molecular networks in human disease research.
Assuntos
Redes Reguladoras de Genes , Predisposição Genética para Doença , Algoritmos , Biologia Computacional , Genoma Humano , HumanosRESUMO
We introduce a novel method called Prophetic Granger Causality (PGC) for inferring gene regulatory networks (GRNs) from protein-level time series data. The method uses an L1-penalized regression adaptation of Granger Causality to model protein levels as a function of time, stimuli, and other perturbations. When combined with a data-independent network prior, the framework outperformed all other methods submitted to the HPN-DREAM 8 breast cancer network inference challenge. Our investigations reveal that PGC provides complementary information to other approaches, raising the performance of ensemble learners, while on its own achieves moderate performance. Thus, PGC serves as a valuable new tool in the bioinformatics toolkit for analyzing temporal datasets. We investigate the general and cell-specific interactions predicted by our method and find several novel interactions, demonstrating the utility of the approach in charting new tumor wiring.