Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 213
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Cell ; 165(4): 963-75, 2016 May 05.
Article in English | MEDLINE | ID: mdl-27087444

ABSTRACT

Non-coding RNAs are ubiquitous, but the discovery of new RNA gene sequences far outpaces the research on the structure and functional interactions of these RNA gene sequences. We mine the evolutionary sequence record to derive precise information about the function and structure of RNAs and RNA-protein complexes. As in protein structure prediction, we use maximum entropy global probability models of sequence co-variation to infer evolutionarily constrained nucleotide-nucleotide interactions within RNA molecules and nucleotide-amino acid interactions in RNA-protein complexes. The predicted contacts allow all-atom blinded 3D structure prediction at good accuracy for several known RNA structures and RNA-protein complexes. For unknown structures, we predict contacts in 160 non-coding RNA families. Beyond 3D structure prediction, evolutionary couplings help identify important functional interactions-e.g., at switch points in riboswitches and at a complex nucleation site in HIV. Aided by increasing sequence accumulation, evolutionary coupling analysis can accelerate the discovery of functional interactions and 3D structures involving RNA.


Subject(s)
Nucleic Acid Conformation , RNA, Untranslated/chemistry , Entropy , Evolution, Molecular , Models, Molecular , RNA Folding , RNA, Untranslated/genetics , RNA, Untranslated/metabolism , RNA-Binding Proteins/chemistry , RNA-Binding Proteins/metabolism , Ribosomes/metabolism
2.
Cell ; 167(1): 158-170.e12, 2016 Sep 22.
Article in English | MEDLINE | ID: mdl-27662088

ABSTRACT

Protein flexibility ranges from simple hinge movements to functional disorder. Around half of all human proteins contain apparently disordered regions with little 3D or functional information, and many of these proteins are associated with disease. Building on the evolutionary couplings approach previously successful in predicting 3D states of ordered proteins and RNA, we developed a method to predict the potential for ordered states for all apparently disordered proteins with sufficiently rich evolutionary information. The approach is highly accurate (79%) for residue interactions as tested in more than 60 known disordered regions captured in a bound or specific condition. Assessing the potential for structure of more than 1,000 apparently disordered regions of human proteins reveals a continuum of structural order with at least 50% with clear propensity for three- or two-dimensional states. Co-evolutionary constraints reveal hitherto unseen structures of functional importance in apparently disordered proteins.


Subject(s)
Intrinsically Disordered Proteins/chemistry , Directed Molecular Evolution/methods , Genomics , Humans , Intrinsically Disordered Proteins/genetics , Protein Structure, Secondary , Protein Structure, Tertiary , Proteome/chemistry , Proteome/genetics
3.
Cell ; 166(3): 766-778, 2016 Jul 28.
Article in English | MEDLINE | ID: mdl-27453469

ABSTRACT

The ability to reliably and reproducibly measure any protein of the human proteome in any tissue or cell type would be transformative for understanding systems-level properties as well as specific pathways in physiology and disease. Here, we describe the generation and verification of a compendium of highly specific assays that enable quantification of 99.7% of the 20,277 annotated human proteins by the widely accessible, sensitive, and robust targeted mass spectrometric method selected reaction monitoring, SRM. This human SRMAtlas provides definitive coordinates that conclusively identify the respective peptide in biological samples. We report data on 166,174 proteotypic peptides providing multiple, independent assays to quantify any human protein and numerous spliced variants, non-synonymous mutations, and post-translational modifications. The data are freely accessible as a resource at http://www.srmatlas.org/, and we demonstrate its utility by examining the network response to inhibition of cholesterol synthesis in liver cells and to docetaxel in prostate cancer lines.


Subject(s)
Databases, Protein , Proteome , Access to Information , Antineoplastic Agents/therapeutic use , Cell Line, Tumor , Cholesterol/biosynthesis , Docetaxel , Female , Humans , Internet , Liver/drug effects , Male , Mutation , Prostatic Neoplasms/drug therapy , RNA Splicing , Taxoids/therapeutic use
4.
Cell ; 163(2): 506-19, 2015 Oct 08.
Article in English | MEDLINE | ID: mdl-26451490

ABSTRACT

Invasive lobular carcinoma (ILC) is the second most prevalent histologic subtype of invasive breast cancer. Here, we comprehensively profiled 817 breast tumors, including 127 ILC, 490 ductal (IDC), and 88 mixed IDC/ILC. Besides E-cadherin loss, the best known ILC genetic hallmark, we identified mutations targeting PTEN, TBX3, and FOXA1 as ILC enriched features. PTEN loss associated with increased AKT phosphorylation, which was highest in ILC among all breast cancer subtypes. Spatially clustered FOXA1 mutations correlated with increased FOXA1 expression and activity. Conversely, GATA3 mutations and high expression characterized luminal A IDC, suggesting differential modulation of ER activity in ILC and IDC. Proliferation and immune-related signatures determined three ILC transcriptional subtypes associated with survival differences. Mixed IDC/ILC cases were molecularly classified as ILC-like and IDC-like revealing no true hybrid features. This multidimensional molecular atlas sheds new light on the genetic bases of ILC and provides potential clinical options.


Subject(s)
Breast Neoplasms/genetics , Breast Neoplasms/pathology , Carcinoma, Lobular/genetics , Carcinoma, Lobular/pathology , Antigens, CD , Breast Neoplasms/metabolism , Cadherins/chemistry , Cadherins/genetics , Cadherins/metabolism , Carcinoma, Ductal, Breast/genetics , Carcinoma, Ductal, Breast/pathology , Carcinoma, Lobular/metabolism , Female , Hepatocyte Nuclear Factor 3-alpha/chemistry , Hepatocyte Nuclear Factor 3-alpha/genetics , Hepatocyte Nuclear Factor 3-alpha/metabolism , Humans , Models, Molecular , Mutation , Oligonucleotide Array Sequence Analysis , Oncogene Protein v-akt/metabolism , Transcriptome
5.
Nature ; 625(7994): 377-384, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38057668

ABSTRACT

Cytokines mediate cell-cell communication in the immune system and represent important therapeutic targets1-3. A myriad of studies have highlighted their central role in immune function4-13, yet we lack a global view of the cellular responses of each immune cell type to each cytokine. To address this gap, we created the Immune Dictionary, a compendium of single-cell transcriptomic profiles of more than 17 immune cell types in response to each of 86 cytokines (>1,400 cytokine-cell type combinations) in mouse lymph nodes in vivo. A cytokine-centric view of the dictionary revealed that most cytokines induce highly cell-type-specific responses. For example, the inflammatory cytokine interleukin-1ß induces distinct gene programmes in almost every cell type. A cell-type-centric view of the dictionary identified more than 66 cytokine-driven cellular polarization states across immune cell types, including previously uncharacterized states such as an interleukin-18-induced polyfunctional natural killer cell state. Based on this dictionary, we developed companion software, Immune Response Enrichment Analysis, for assessing cytokine activities and immune cell polarization from gene expression data, and applied it to reveal cytokine networks in tumours following immune checkpoint blockade therapy. Our dictionary generates new hypotheses for cytokine functions, illuminates pleiotropic effects of cytokines, expands our knowledge of activation states of each immune cell type, and provides a framework to deduce the roles of specific cytokines and cell-cell communication networks in any immune response.


Subject(s)
Cytokines , Immunity , Single-Cell Analysis , Animals , Mice , Cell Communication/drug effects , Cytokines/immunology , Gene Expression Profiling , Gene Expression Regulation , Immune Checkpoint Inhibitors/pharmacology , Immune Checkpoint Inhibitors/therapeutic use , Immunity/drug effects , Interleukin-18/immunology , Interleukin-1beta/immunology , Killer Cells, Natural/immunology , Lymph Nodes/cytology , Lymph Nodes/immunology , Neoplasms/immunology , Neoplasms/therapy , Signal Transduction/drug effects , Software
6.
Nature ; 622(7984): 818-825, 2023 Oct.
Article in English | MEDLINE | ID: mdl-37821700

ABSTRACT

Effective pandemic preparedness relies on anticipating viral mutations that are able to evade host immune responses to facilitate vaccine and therapeutic design. However, current strategies for viral evolution prediction are not available early in a pandemic-experimental approaches require host polyclonal antibodies to test against1-16, and existing computational methods draw heavily from current strain prevalence to make reliable predictions of variants of concern17-19. To address this, we developed EVEscape, a generalizable modular framework that combines fitness predictions from a deep learning model of historical sequences with biophysical and structural information. EVEscape quantifies the viral escape potential of mutations at scale and has the advantage of being applicable before surveillance sequencing, experimental scans or three-dimensional structures of antibody complexes are available. We demonstrate that EVEscape, trained on sequences available before 2020, is as accurate as high-throughput experimental scans at anticipating pandemic variation for SARS-CoV-2 and is generalizable to other viruses including influenza, HIV and understudied viruses with pandemic potential such as Lassa and Nipah. We provide continually revised escape scores for all current strains of SARS-CoV-2 and predict probable further mutations to forecast emerging strains as a tool for continuing vaccine development ( evescape.org ).


Subject(s)
Evolution, Molecular , Forecasting , Immune Evasion , Mutation , Pandemics , Viruses , Humans , Drug Design , HIV Infections , Immune Evasion/genetics , Immune Evasion/immunology , Influenza, Human , Lassa virus , Nipah Virus , SARS-CoV-2/genetics , SARS-CoV-2/immunology , Viral Vaccines/immunology , Viruses/genetics , Viruses/immunology
7.
Nature ; 606(7914): 576-584, 2022 06.
Article in English | MEDLINE | ID: mdl-35385861

ABSTRACT

SARS-CoV-2 can cause acute respiratory distress and death in some patients1. Although severe COVID-19 is linked to substantial inflammation, how SARS-CoV-2 triggers inflammation is not clear2. Monocytes and macrophages are sentinel cells that sense invasive infection to form inflammasomes that activate caspase-1 and gasdermin D, leading to inflammatory death (pyroptosis) and the release of potent inflammatory mediators3. Here we show that about 6% of blood monocytes of patients with COVID-19 are infected with SARS-CoV-2. Monocyte infection depends on the uptake of antibody-opsonized virus by Fcγ receptors. The plasma of vaccine recipients does not promote antibody-dependent monocyte infection. SARS-CoV-2 begins to replicate in monocytes, but infection is aborted, and infectious virus is not detected in the supernatants of cultures of infected monocytes. Instead, infected cells undergo pyroptosis mediated by activation of NLRP3 and AIM2 inflammasomes, caspase-1 and gasdermin D. Moreover, tissue-resident macrophages, but not infected epithelial and endothelial cells, from lung autopsies from patients with COVID-19 have activated inflammasomes. Taken together, these findings suggest that antibody-mediated SARS-CoV-2 uptake by monocytes and macrophages triggers inflammatory cell death that aborts the production of infectious virus but causes systemic inflammation that contributes to COVID-19 pathogenesis.


Subject(s)
COVID-19 , Inflammation , Monocytes , Receptors, IgG , SARS-CoV-2 , COVID-19/virology , Caspase 1/metabolism , DNA-Binding Proteins , Humans , Inflammasomes/metabolism , Inflammation/metabolism , Inflammation/virology , Monocytes/metabolism , Monocytes/virology , NLR Family, Pyrin Domain-Containing 3 Protein , Phosphate-Binding Proteins , Pore Forming Cytotoxic Proteins , Receptors, IgG/metabolism
8.
Nat Methods ; 21(3): 531-540, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38279009

ABSTRACT

Analysis across a growing number of single-cell perturbation datasets is hampered by poor data interoperability. To facilitate development and benchmarking of computational methods, we collect a set of 44 publicly available single-cell perturbation-response datasets with molecular readouts, including transcriptomics, proteomics and epigenomics. We apply uniform quality control pipelines and harmonize feature annotations. The resulting information resource, scPerturb, enables development and testing of computational methods, and facilitates comparison and integration across datasets. We describe energy statistics (E-statistics) for quantification of perturbation effects and significance testing, and demonstrate E-distance as a general distance measure between sets of single-cell expression profiles. We illustrate the application of E-statistics for quantifying similarity and efficacy of perturbations. The perturbation-response datasets and E-statistics computation software are publicly available at scperturb.org. This work provides an information resource for researchers working with single-cell perturbation data and recommendations for experimental design, including optimal cell counts and read depth.


Subject(s)
Proteomics , Software , Gene Expression Profiling/methods , Epigenomics , Single-Cell Analysis
9.
Cell ; 149(7): 1607-21, 2012 Jun 22.
Article in English | MEDLINE | ID: mdl-22579045

ABSTRACT

We show that amino acid covariation in proteins, extracted from the evolutionary sequence record, can be used to fold transmembrane proteins. We use this technique to predict previously unknown 3D structures for 11 transmembrane proteins (with up to 14 helices) from their sequences alone. The prediction method (EVfold_membrane) applies a maximum entropy approach to infer evolutionary covariation in pairs of sequence positions within a protein family and then generates all-atom models with the derived pairwise distance constraints. We benchmark the approach with blinded de novo computation of known transmembrane protein structures from 23 families, demonstrating unprecedented accuracy of the method for large transmembrane proteins. We show how the method can predict oligomerization, functional sites, and conformational changes in transmembrane proteins. With the rapid rise in large-scale sequencing, more accurate and more comprehensive information on evolutionary constraints can be decoded from genetic variation, greatly expanding the repertoire of transmembrane proteins amenable to modeling by this method.


Subject(s)
Algorithms , Membrane Proteins/chemistry , Membrane Proteins/genetics , Amino Acid Sequence , Animals , Conserved Sequence , Evolution, Molecular , Humans , Models, Molecular , Protein Conformation , Protein Structure, Secondary , Sequence Alignment , Structural Homology, Protein
10.
Cell ; 149(3): 693-707, 2012 Apr 27.
Article in English | MEDLINE | ID: mdl-22541438

ABSTRACT

Small RNA-mediated gene regulation during development causes long-lasting changes in cellular phenotypes. To determine whether small RNAs of the adult brain can regulate memory storage, a process that requires stable and long-lasting changes in the functional state of neurons, we generated small RNA libraries from the Aplysia CNS. In these libraries, we discovered an unexpectedly abundant expression of a 28 nucleotide sized class of piRNAs in brain, which had been thought to be germline specific. These piRNAs have unique biogenesis patterns, predominant nuclear localization, and robust sensitivity to serotonin, a modulatory transmitter that is important for memory. We find that the Piwi/piRNA complex facilitates serotonin-dependent methylation of a conserved CpG island in the promoter of CREB2, the major inhibitory constraint of memory in Aplysia, leading to enhanced long-term synaptic facilitation. These findings provide a small RNA-mediated gene regulatory mechanism for establishing stable long-term changes in neurons for the persistence of memory.


Subject(s)
Epigenomics , Memory , Neuronal Plasticity , Neurons/physiology , RNA, Small Interfering/metabolism , Animals , Aplysia/metabolism , Base Sequence , Gene Expression Regulation , Humans , Molecular Sequence Data , Nerve Tissue Proteins/metabolism , Synapses/metabolism
11.
Nature ; 569(7755): 275-279, 2019 05.
Article in English | MEDLINE | ID: mdl-30996345

ABSTRACT

Drosophila Lgl and its mammalian homologues, LLGL1 and LLGL2, are scaffolding proteins that regulate the establishment of apical-basal polarity in epithelial cells1,2. Whereas Lgl functions as a tumour suppressor in Drosophila1, the roles of mammalian LLGL1 and LLGL2 in cancer are unclear. The majority (about 75%) of breast cancers express oestrogen receptors (ERs)3, and patients with these tumours receive endocrine treatment4. However, the development of resistance to endocrine therapy and metastatic progression are leading causes of death for patients with ER+ disease4. Here we report that, unlike LLGL1, LLGL2 is overexpressed in ER+ breast cancer and promotes cell proliferation under nutrient stress. LLGL2 regulates cell surface levels of a leucine transporter, SLC7A5, by forming a trimeric complex with SLC7A5 and a regulator of membrane fusion, YKT6, to promote leucine uptake and cell proliferation. The oestrogen receptor targets LLGL2 expression. Resistance to endocrine treatment in breast cancer cells was associated with SLC7A5- and LLGL2-dependent adaption to nutrient stress. SLC7A5 was necessary and sufficient to confer resistance to tamoxifen treatment, identifying SLC7A5 as a potential therapeutic target for overcoming resistance to endocrine treatments in breast cancer. Thus, LLGL2 functions as a promoter of tumour growth and not as a tumour suppressor in ER+ breast cancer. Beyond breast cancer, adaptation to nutrient stress is critically important5, and our findings identify an unexpected role for LLGL2 in this process.


Subject(s)
Breast Neoplasms/metabolism , Cytoskeletal Proteins/metabolism , Leucine/metabolism , Receptors, Estrogen/metabolism , Animals , Breast Neoplasms/pathology , Cell Line, Tumor , Cell Proliferation/drug effects , Estrogens/pharmacology , Female , Humans , Large Neutral Amino Acid-Transporter 1/metabolism , Mice , R-SNARE Proteins/metabolism
12.
Mol Cell Proteomics ; 22(8): 100602, 2023 08.
Article in English | MEDLINE | ID: mdl-37343696

ABSTRACT

Treatment and relevant targets for breast cancer (BC) remain limited, especially for triple-negative BC (TNBC). We identified 6091 proteins of 76 human BC cell lines using data-independent acquisition (DIA). Integrating our proteomic findings with prior multi-omics datasets, we found that including proteomics data improved drug sensitivity predictions and provided insights into the mechanisms of action. We subsequently profiled the proteomic changes in nine cell lines (five TNBC and four non-TNBC) treated with EGFR/AKT/mTOR inhibitors. In TNBC, metabolism pathways were dysregulated after EGFR/mTOR inhibitor treatment, while RNA modification and cell cycle pathways were affected by AKT inhibitor. This systematic multi-omics and in-depth analysis of the proteome of BC cells can help prioritize potential therapeutic targets and provide insights into adaptive resistance in TNBC.


Subject(s)
Signal Transduction , Triple Negative Breast Neoplasms , Humans , Proto-Oncogene Proteins c-akt/metabolism , Proteomics , Cell Proliferation , Cell Line, Tumor , Drug Resistance, Neoplasm/genetics , Triple Negative Breast Neoplasms/metabolism , ErbB Receptors/metabolism
13.
Nucleic Acids Res ; 50(D1): D687-D692, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34788843

ABSTRACT

The Reactome Knowledgebase (https://reactome.org), an Elixir core resource, provides manually curated molecular details across a broad range of physiological and pathological biological processes in humans, including both hereditary and acquired disease processes. The processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Recent curation work has expanded our annotations of normal and disease-associated signaling processes and of the drugs that target them, in particular infections caused by the SARS-CoV-1 and SARS-CoV-2 coronaviruses and the host response to infection. New tools support better simultaneous analysis of high-throughput data from multiple sources and the placement of understudied ('dark') proteins from analyzed datasets in the context of Reactome's manually curated pathways.


Subject(s)
Antiviral Agents/pharmacology , Knowledge Bases , Proteins/metabolism , COVID-19/metabolism , Data Curation , Genome, Human , Host-Pathogen Interactions , Humans , Proteins/genetics , Signal Transduction , Software
14.
J Proteome Res ; 22(9): 2847-2859, 2023 09 01.
Article in English | MEDLINE | ID: mdl-37555633

ABSTRACT

The ongoing pandemic of the coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 still has limited treatment options. Our understanding of the molecular dysregulations that occur in response to infection remains incomplete. We developed a web application COVIDpro (https://www.guomics.com/covidPro/) that includes proteomics data obtained from 41 original studies conducted in 32 hospitals worldwide, involving 3077 patients and covering 19 types of clinical specimens, predominantly plasma and serum. The data set encompasses 53 protein expression matrices, comprising a total of 5434 samples and 14,403 unique proteins. We identified a panel of proteins that exhibit significant dysregulation, enabling the classification of COVID-19 patients into severe and non-severe disease categories. The proteomic signatures achieved promising results in distinguishing severe cases, with a mean area under the curve of 0.87 and accuracy of 0.80 across five independent test sets. COVIDpro serves as a valuable resource for testing hypotheses and exploring potential targets for novel treatments in COVID-19 patients.


Subject(s)
COVID-19 , Humans , Proteomics , SARS-CoV-2
15.
Nucleic Acids Res ; 49(D1): D1083-D1093, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33196823

ABSTRACT

CellMiner Cross-Database (CellMinerCDB, discover.nci.nih.gov/cellminercdb) allows integration and analysis of molecular and pharmacological data within and across cancer cell line datasets from the National Cancer Institute (NCI), Broad Institute, Sanger/MGH and MD Anderson Cancer Center (MDACC). We present CellMinerCDB 1.2 with updates to datasets from NCI-60, Broad Cancer Cell Line Encyclopedia and Sanger/MGH, and the addition of new datasets, including NCI-ALMANAC drug combination, MDACC Cell Line Project proteomic, NCI-SCLC DNA copy number and methylation data, and Broad methylation, genetic dependency and metabolomic datasets. CellMinerCDB (v1.2) includes several improvements over the previously published version: (i) new and updated datasets; (ii) support for pattern comparisons and multivariate analyses across data sources; (iii) updated annotations with drug mechanism of action information and biologically relevant multigene signatures; (iv) analysis speedups via caching; (v) a new dataset download feature; (vi) improved visualization of subsets of multiple tissue types; (vii) breakdown of univariate associations by tissue type; and (viii) enhanced help information. The curation and common annotations (e.g. tissues of origin and identifiers) provided here across pharmacogenomic datasets increase the utility of the individual datasets to address multiple researcher question types, including data reproducibility, biomarker discovery and multivariate analysis of drug activity.


Subject(s)
Computational Biology/methods , Databases, Factual , Neoplasms/metabolism , Pharmacogenetics/methods , Proteomics/methods , Cell Line, Tumor , Data Curation/methods , Data Mining/methods , Drug Therapy/methods , Genomics/methods , Humans , Internet , Neoplasms/drug therapy , Neoplasms/genetics
16.
Nucleic Acids Res ; 49(W1): W535-W540, 2021 07 02.
Article in English | MEDLINE | ID: mdl-33999203

ABSTRACT

Since 1992 PredictProtein (https://predictprotein.org) is a one-stop online resource for protein sequence analysis with its main site hosted at the Luxembourg Centre for Systems Biomedicine (LCSB) and queried monthly by over 3,000 users in 2020. PredictProtein was the first Internet server for protein predictions. It pioneered combining evolutionary information and machine learning. Given a protein sequence as input, the server outputs multiple sequence alignments, predictions of protein structure in 1D and 2D (secondary structure, solvent accessibility, transmembrane segments, disordered regions, protein flexibility, and disulfide bridges) and predictions of protein function (functional effects of sequence variation or point mutations, Gene Ontology (GO) terms, subcellular localization, and protein-, RNA-, and DNA binding). PredictProtein's infrastructure has moved to the LCSB increasing throughput; the use of MMseqs2 sequence search reduced runtime five-fold (apparently without lowering performance of prediction methods); user interface elements improved usability, and new prediction methods were added. PredictProtein recently included predictions from deep learning embeddings (GO and secondary structure) and a method for the prediction of proteins and residues binding DNA, RNA, or other proteins. PredictProtein.org aspires to provide reliable predictions to computational and experimental biologists alike. All scripts and methods are freely available for offline execution in high-throughput settings.


Subject(s)
Protein Conformation , Software , Binding Sites , Coronavirus Nucleocapsid Proteins/chemistry , DNA-Binding Proteins/chemistry , Phosphoproteins/chemistry , Protein Structure, Secondary , Proteins/chemistry , Proteins/physiology , RNA-Binding Proteins/chemistry , Sequence Alignment , Sequence Analysis, Protein
17.
Nucleic Acids Res ; 48(D1): D489-D497, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31647099

ABSTRACT

Pathway Commons (https://www.pathwaycommons.org) is an integrated resource of publicly available information about biological pathways including biochemical reactions, assembly of biomolecular complexes, transport and catalysis events and physical interactions involving proteins, DNA, RNA, and small molecules (e.g. metabolites and drug compounds). Data is collected from multiple providers in standard formats, including the Biological Pathway Exchange (BioPAX) language and the Proteomics Standards Initiative Molecular Interactions format, and then integrated. Pathway Commons provides biologists with (i) tools to search this comprehensive resource, (ii) a download site offering integrated bulk sets of pathway data (e.g. tables of interactions and gene sets), (iii) reusable software libraries for working with pathway information in several programming languages (Java, R, Python and Javascript) and (iv) a web service for programmatically querying the entire dataset. Visualization of pathways is supported using the Systems Biological Graphical Notation (SBGN). Pathway Commons currently contains data from 22 databases with 4794 detailed human biochemical processes (i.e. pathways) and ∼2.3 million interactions. To enhance the usability of this large resource for end-users, we develop and maintain interactive web applications and training materials that enable pathway exploration and advanced analysis.


Subject(s)
Databases, Factual , Metabolic Networks and Pathways , Software , Genome, Human , Genomics/methods , Humans , Metabolomics/methods
18.
PLoS Comput Biol ; 16(7): e1007909, 2020 07.
Article in English | MEDLINE | ID: mdl-32667922

ABSTRACT

Cancer cells have genetic alterations that often directly affect intracellular protein signaling processes allowing them to bypass control mechanisms for cell death, growth and division. Cancer drugs targeting these alterations often work initially, but resistance is common. Combinations of targeted drugs may overcome or prevent resistance, but their selection requires context-specific knowledge of signaling pathways including complex interactions such as feedback loops and crosstalk. To infer quantitative pathway models, we collected a rich dataset on a melanoma cell line: Following perturbation with 54 drug combinations, we measured 124 (phospho-)protein levels and phenotypic response (cell growth, apoptosis) in a time series from 10 minutes to 67 hours. From these data, we trained time-resolved mathematical models that capture molecular interactions and the coupling of molecular levels to cellular phenotype, which in turn reveal the main direct or indirect molecular responses to each drug. Systematic model simulations identified novel combinations of drugs predicted to reduce the survival of melanoma cells, with partial experimental verification. This particular application of perturbation biology demonstrates the potential impact of combining time-resolved data with modeling for the discovery of new combinations of cancer drugs.


Subject(s)
Antineoplastic Agents/pharmacology , Melanoma , Phosphoproteins , Cell Line, Tumor , Cell Survival/drug effects , Drug Therapy, Combination , Humans , Models, Biological , Phosphoproteins/analysis , Phosphoproteins/metabolism , Signal Transduction/drug effects , Systems Biology
19.
Nature ; 517(7533): 205-8, 2015 Jan 08.
Article in English | MEDLINE | ID: mdl-25337874

ABSTRACT

The gastrointestinal tracts of mammals are colonized by hundreds of microbial species that contribute to health, including colonization resistance against intestinal pathogens. Many antibiotics destroy intestinal microbial communities and increase susceptibility to intestinal pathogens. Among these, Clostridium difficile, a major cause of antibiotic-induced diarrhoea, greatly increases morbidity and mortality in hospitalized patients. Which intestinal bacteria provide resistance to C. difficile infection and their in vivo inhibitory mechanisms remain unclear. Here we correlate loss of specific bacterial taxa with development of infection, by treating mice with different antibiotics that result in distinct microbiota changes and lead to varied susceptibility to C. difficile. Mathematical modelling augmented by analyses of the microbiota of hospitalized patients identifies resistance-associated bacteria common to mice and humans. Using these platforms, we determine that Clostridium scindens, a bile acid 7α-dehydroxylating intestinal bacterium, is associated with resistance to C. difficile infection and, upon administration, enhances resistance to infection in a secondary bile acid dependent fashion. Using a workflow involving mouse models, clinical studies, metagenomic analyses, and mathematical modelling, we identify a probiotic candidate that corrects a clinically relevant microbiome deficiency. These findings have implications for the rational design of targeted antimicrobials as well as microbiome-based diagnostics and therapeutics for individuals at risk of C. difficile infection.


Subject(s)
Bile Acids and Salts/metabolism , Clostridioides difficile/physiology , Disease Susceptibility/microbiology , Intestinal Mucosa/metabolism , Intestines/microbiology , Microbiota/physiology , Animals , Anti-Bacterial Agents/pharmacology , Biological Evolution , Clostridioides difficile/drug effects , Clostridium/metabolism , Colitis/metabolism , Colitis/microbiology , Colitis/prevention & control , Colitis/therapy , Feces/microbiology , Female , Humans , Intestines/drug effects , Metagenome/genetics , Mice , Mice, Inbred C57BL , Microbiota/drug effects , Microbiota/genetics , Symbiosis
20.
Bioinformatics ; 35(9): 1582-1584, 2019 05 01.
Article in English | MEDLINE | ID: mdl-30304492

ABSTRACT

SUMMARY: Coevolutionary sequence analysis has become a commonly used technique for de novo prediction of the structure and function of proteins, RNA, and protein complexes. We present the EVcouplings framework, a fully integrated open-source application and Python package for coevolutionary analysis. The framework enables generation of sequence alignments, calculation and evaluation of evolutionary couplings (ECs), and de novo prediction of structure and mutation effects. The combination of an easy to use, flexible command line interface and an underlying modular Python package makes the full power of coevolutionary analyses available to entry-level and advanced users. AVAILABILITY AND IMPLEMENTATION: https://github.com/debbiemarkslab/evcouplings.


Subject(s)
Sequence Analysis , Software , Proteins , RNA , Sequence Alignment
SELECTION OF CITATIONS
SEARCH DETAIL