Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 164
Filter
Add more filters

Country/Region as subject
Publication year range
1.
Cell ; 145(3): 447-58, 2011 Apr 29.
Article in English | MEDLINE | ID: mdl-21529716

ABSTRACT

Random X inactivation represents a paradigm for monoallelic gene regulation during early ES cell differentiation. In mice, the choice of X chromosome to inactivate in XX cells is ensured by monoallelic regulation of Xist RNA via its antisense transcription unit Tsix/Xite. Homologous pairing events have been proposed to underlie asymmetric Tsix expression, but direct evidence has been lacking owing to their dynamic and transient nature. Here we investigate the live-cell dynamics and outcome of Tsix pairing in differentiating mouse ES cells. We find an overall increase in genome dynamics including the Xics during early differentiation. During pairing, however, Xic loci show markedly reduced movements. Upon separation, Tsix expression becomes transiently monoallelic, providing a window of opportunity for monoallelic Xist upregulation. Our findings reveal the spatiotemporal choreography of the X chromosomes during early differentiation and indicate a direct role for pairing in facilitating symmetry-breaking and monoallelic regulation of Xist during random X inactivation.


Subject(s)
Cell Differentiation , Chromosome Pairing , Embryonic Stem Cells/metabolism , X Chromosome Inactivation , X Chromosome/metabolism , Animals , Embryonic Stem Cells/cytology , Female , Mice , RNA, Long Noncoding , RNA, Untranslated/genetics , Time-Lapse Imaging
3.
Cell ; 141(6): 956-69, 2010 Jun 11.
Article in English | MEDLINE | ID: mdl-20550932

ABSTRACT

During X chromosome inactivation (XCI), Xist RNA coats and silences one of the two X chromosomes in female cells. Little is known about how XCI spreads across the chromosome, although LINE-1 elements have been proposed to play a role. Here we show that LINEs participate in creating a silent nuclear compartment into which genes become recruited. A subset of young LINE-1 elements, however, is expressed during XCI, rather than being silenced. We demonstrate that such LINE expression requires the specific heterochromatic state induced by Xist. These LINEs often lie within escape-prone regions of the X chromosome, but close to genes that are subject to XCI, and are associated with putative endo-siRNAs. LINEs may thus facilitate XCI at different levels, with silent LINEs participating in assembly of a heterochromatic nuclear compartment induced by Xist, and active LINEs participating in local propagation of XCI into regions that would otherwise be prone to escape.


Subject(s)
Heterochromatin/metabolism , Long Interspersed Nucleotide Elements , X Chromosome Inactivation , Animals , Cell Line , Embryonic Stem Cells/metabolism , Female , Humans , Mice , RNA, Long Noncoding , RNA, Untranslated/metabolism , Transcription, Genetic , X Chromosome/metabolism
4.
BMC Bioinformatics ; 25(1): 199, 2024 May 24.
Article in English | MEDLINE | ID: mdl-38789933

ABSTRACT

BACKGROUND: Computational models in systems biology are becoming more important with the advancement of experimental techniques to query the mechanistic details responsible for leading to phenotypes of interest. In particular, Boolean models are well fit to describe the complexity of signaling networks while being simple enough to scale to a very large number of components. With the advance of Boolean model inference techniques, the field is transforming from an artisanal way of building models of moderate size to a more automatized one, leading to very large models. In this context, adapting the simulation software for such increases in complexity is crucial. RESULTS: We present two new developments in the continuous time Boolean simulators: MaBoSS.MPI, a parallel implementation of MaBoSS which can exploit the computational power of very large CPU clusters, and MaBoSS.GPU, which can use GPU accelerators to perform these simulations. CONCLUSION: These implementations enable simulation and exploration of the behavior of very large models, thus becoming a valuable analysis tool for the systems biology community.


Subject(s)
Computer Simulation , Software , Systems Biology/methods , Computational Biology/methods , Algorithms , Computer Graphics
5.
Bioinformatics ; 39(6)2023 06 01.
Article in English | MEDLINE | ID: mdl-37289551

ABSTRACT

MOTIVATION: Mathematical models of biological processes altered in cancer are built using the knowledge of complex networks of signaling pathways, detailing the molecular regulations inside different cell types, such as tumor cells, immune and other stromal cells. If these models mainly focus on intracellular information, they often omit a description of the spatial organization among cells and their interactions, and with the tumoral microenvironment. RESULTS: We present here a model of tumor cell invasion simulated with PhysiBoSS, a multiscale framework, which combines agent-based modeling and continuous time Markov processes applied on Boolean network models. With this model, we aim to study the different modes of cell migration and to predict means to block it by considering not only spatial information obtained from the agent-based simulation but also intracellular regulation obtained from the Boolean model.Our multiscale model integrates the impact of gene mutations with the perturbation of the environmental conditions and allows the visualization of the results with 2D and 3D representations. The model successfully reproduces single and collective migration processes and is validated on published experiments on cell invasion. In silico experiments are suggested to search for possible targets that can block the more invasive tumoral phenotypes. AVAILABILITY AND IMPLEMENTATION: https://github.com/sysbio-curie/Invasion_model_PhysiBoSS.


Subject(s)
Models, Biological , Models, Theoretical , Humans , Computer Simulation , Signal Transduction/genetics , Neoplasm Invasiveness , Tumor Microenvironment
6.
Bioinformatics ; 38(10): 2963-2964, 2022 05 13.
Article in English | MEDLINE | ID: mdl-35561190

ABSTRACT

SUMMARY: We developed BIODICA, an integrated computational environment for application of independent component analysis (ICA) to bulk and single-cell molecular profiles, interpretation of the results in terms of biological functions and correlation with metadata. The computational core is the novel Python package stabilized-ica which provides interface to several ICA algorithms, a stabilization procedure, meta-analysis and component interpretation tools. BIODICA is equipped with a user-friendly graphical user interface, allowing non-experienced users to perform the ICA-based omics data analysis. The results are provided in interactive ways, thus facilitating communication with biology experts. AVAILABILITY AND IMPLEMENTATION: BIODICA is implemented in Java, Python and JavaScript. The source code is freely available on GitHub under the MIT and the GNU LGPL licenses. BIODICA is supported on all major operating systems. URL: https://sysbio-curie.github.io/biodica-environment/.


Subject(s)
Algorithms , Software , Computational Biology/methods , Metadata
7.
PLoS Comput Biol ; 17(1): e1007900, 2021 01.
Article in English | MEDLINE | ID: mdl-33507915

ABSTRACT

The study of response to cancer treatments has benefited greatly from the contribution of different omics data but their interpretation is sometimes difficult. Some mathematical models based on prior biological knowledge of signaling pathways facilitate this interpretation but often require fitting of their parameters using perturbation data. We propose a more qualitative mechanistic approach, based on logical formalism and on the sole mapping and interpretation of omics data, and able to recover differences in sensitivity to gene inhibition without model training. This approach is showcased by the study of BRAF inhibition in patients with melanomas and colorectal cancers who experience significant differences in sensitivity despite similar omics profiles. We first gather information from literature and build a logical model summarizing the regulatory network of the mitogen-activated protein kinase (MAPK) pathway surrounding BRAF, with factors involved in the BRAF inhibition resistance mechanisms. The relevance of this model is verified by automatically assessing that it qualitatively reproduces response or resistance behaviors identified in the literature. Data from over 100 melanoma and colorectal cancer cell lines are then used to validate the model's ability to explain differences in sensitivity. This generic model is transformed into personalized cell line-specific logical models by integrating the omics information of the cell lines as constraints of the model. The use of mutations alone allows personalized models to correlate significantly with experimental sensitivities to BRAF inhibition, both from drug and CRISPR targeting, and even better with the joint use of mutations and RNA, supporting multi-omics mechanistic models. A comparison of these untrained models with learning approaches highlights similarities in interpretation and complementarity depending on the size of the datasets. This parsimonious pipeline, which can easily be extended to other biological questions, makes it possible to explore the mechanistic causes of the response to treatment, on an individualized basis.


Subject(s)
Colorectal Neoplasms , Melanoma , Patient-Specific Modeling , Proto-Oncogene Proteins B-raf/antagonists & inhibitors , Antineoplastic Agents/pharmacology , Antineoplastic Agents/therapeutic use , CRISPR-Cas Systems , Cell Line, Tumor , Colorectal Neoplasms/genetics , Colorectal Neoplasms/metabolism , Colorectal Neoplasms/therapy , Computational Biology , Genetic Therapy , Humans , Machine Learning , Melanoma/genetics , Melanoma/metabolism , Melanoma/therapy , Signal Transduction/drug effects , Transcriptome/drug effects
8.
Int J Cancer ; 148(8): 1895-1909, 2021 04 15.
Article in English | MEDLINE | ID: mdl-33368296

ABSTRACT

Single-nucleotide polymorphisms (SNPs) in over 180 loci have been associated with breast cancer (BC) through genome-wide association studies involving mostly unselected population-based case-control series. Some of them modify BC risk of women carrying a BRCA1 or BRCA2 (BRCA1/2) mutation and may also explain BC risk variability in BC-prone families with no BRCA1/2 mutation. Here, we assessed the contribution of SNPs of the iCOGS array in GENESIS consisting of BC cases with no BRCA1/2 mutation and a sister with BC, and population controls. Genotyping data were available for 1281 index cases, 731 sisters with BC, 457 unaffected sisters and 1272 controls. In addition to the standard SNP-level analysis using index cases and controls, we performed pedigree-based association tests to capture transmission information in the sibships. We also performed gene- and pathway-level analyses to maximize the power to detect associations with lower-frequency SNPs or those with modest effect sizes. While SNP-level analyses identified 18 loci, gene-level analyses identified 112 genes. Furthermore, 31 Kyoto Encyclopedia of Genes and Genomes and 7 Atlas of Cancer Signaling Network pathways were highlighted (false discovery rate of 5%). Using results from the "index case-control" analysis, we built pathway-derived polygenic risk scores (PRS) and assessed their performance in the population-based CECILE study and in a data set composed of GENESIS-affected sisters and CECILE controls. Although these PRS had poor predictive value in the general population, they performed better than a PRS built using our SNP-level findings, and we found that the joint effect of family history and PRS needs to be considered in risk prediction models.


Subject(s)
Breast Neoplasms/genetics , Genetic Predisposition to Disease/genetics , Mutation , Polymorphism, Single Nucleotide , Signal Transduction/genetics , BRCA1 Protein/genetics , BRCA1 Protein/metabolism , BRCA2 Protein/genetics , BRCA2 Protein/metabolism , Breast Neoplasms/diagnosis , Breast Neoplasms/metabolism , Case-Control Studies , Female , Gene Regulatory Networks/genetics , Genetic Testing/methods , Genome-Wide Association Study/methods , Humans , Protein Interaction Maps/genetics , ROC Curve , Siblings
9.
Brief Bioinform ; 20(4): 1238-1249, 2019 07 19.
Article in English | MEDLINE | ID: mdl-29237040

ABSTRACT

Mathematical models can serve as a tool to formalize biological knowledge from diverse sources, to investigate biological questions in a formal way, to test experimental hypotheses, to predict the effect of perturbations and to identify underlying mechanisms. We present a pipeline of computational tools that performs a series of analyses to explore a logical model's properties. A logical model of initiation of the metastatic process in cancer is used as a transversal example. We start by analysing the structure of the interaction network constructed from the literature or existing databases. Next, we show how to translate this network into a mathematical object, specifically a logical model, and how robustness analyses can be applied to it. We explore the visualization of the stable states, defined as specific attractors of the model, and match them to cellular fates or biological read-outs. With the different tools we present here, we explain how to assign to each solution of the model a probability and how to identify genetic interactions using mutant phenotype probabilities. Finally, we connect the model to relevant experimental data: we present how some data analyses can direct the construction of the network, and how the solutions of a mathematical model can also be compared with experimental data, with a particular focus on high-throughput data in cancer biology. A step-by-step tutorial is provided as a Supplementary Material and all models, tools and scripts are provided on an accompanying website: https://github.com/sysbio-curie/Logical_modelling_pipeline.


Subject(s)
Models, Biological , Signal Transduction , Computational Biology/methods , Computer Simulation , Databases, Factual , Disease , Epistasis, Genetic , Gene Regulatory Networks , Humans , Logistic Models , Mathematical Concepts , Metabolic Networks and Pathways , Mutation , Neoplasm Metastasis/genetics , Neoplasm Metastasis/pathology , Neoplasm Metastasis/physiopathology , Software , Systems Biology/statistics & numerical data
10.
Brief Bioinform ; 20(2): 701-716, 2019 03 25.
Article in English | MEDLINE | ID: mdl-29726961

ABSTRACT

Cancer initiation and progression are associated with multiple molecular mechanisms. The knowledge of these mechanisms is expanding and should be converted into guidelines for tackling the disease. Here, we discuss the formalization of biological knowledge into a comprehensive resource: the Atlas of Cancer Signalling Network (ACSN) and the Google Maps-based tool NaviCell, which supports map navigation. The application of ACSN for omics data visualization, in the context of signalling maps, is possible via the NaviCell Web Service module and through the NaviCom tool. It allows generation of network-based molecular portraits of cancer using multilevel omics data. We review how these resources and tools are applied for cancer preclinical studies. Structural analysis of the maps together with omics data helps to rationalize the synergistic effects of drugs and allows design of complex disease stage-specific druggable interventions. The use of ACSN modules and maps as signatures of biological functions can help in cancer data analysis and interpretation. In addition, they empowered finding of associations between perturbations in particular molecular mechanisms and the risk to develop a specific type of cancer. These approaches are helpful, among others, to study the interplay between molecular mechanisms of cancer. It opens an opportunity to decipher how gene interactions govern the hallmarks of cancer in specific contexts. We discuss a perspective to develop a flexible methodology and a pipeline to enable systematic omics data analysis in the context of signalling network maps, for stratifying patients and suggesting interventions points and drug repositioning in cancer and other diseases.


Subject(s)
Atlases as Topic , Neoplasms/metabolism , Signal Transduction , Computational Biology/methods , Humans , Neoplasms/genetics
11.
Brief Bioinform ; 20(2): 659-670, 2019 03 25.
Article in English | MEDLINE | ID: mdl-29688273

ABSTRACT

The Disease Maps Project builds on a network of scientific and clinical groups that exchange best practices, share information and develop systems biomedicine tools. The project aims for an integrated, highly curated and user-friendly platform for disease-related knowledge. The primary focus of disease maps is on interconnected signaling, metabolic and gene regulatory network pathways represented in standard formats. The involvement of domain experts ensures that the key disease hallmarks are covered and relevant, up-to-date knowledge is adequately represented. Expert-curated and computer readable, disease maps may serve as a compendium of knowledge, allow for data-supported hypothesis generation or serve as a scaffold for the generation of predictive mathematical models. This article summarizes the 2nd Disease Maps Community meeting, highlighting its important topics and outcomes. We outline milestones on the roadmap for the future development of disease maps, including creating and maintaining standardized disease maps; sharing parts of maps that encode common human disease mechanisms; providing technical solutions for complexity management of maps; and Web tools for in-depth exploration of such maps. A dedicated discussion was focused on mathematical modeling approaches, as one of the main goals of disease map development is the generation of mathematically interpretable representations to predict disease comorbidity or drug response and to suggest drug repositioning, altogether supporting clinical decisions.


Subject(s)
Gene Regulatory Networks , Genetic Predisposition to Disease , Computational Biology , Humans , Models, Statistical , Translational Research, Biomedical
12.
Bioinformatics ; 36(8): 2620-2622, 2020 04 15.
Article in English | MEDLINE | ID: mdl-31904823

ABSTRACT

MOTIVATION: CellDesigner is a well-established biological map editor used in many large-scale scientific efforts. However, the interoperability between the Systems Biology Graphical Notation (SBGN) Markup Language (SBGN-ML) and the CellDesigner's proprietary Systems Biology Markup Language (SBML) extension formats remains a challenge due to the proprietary extensions used in CellDesigner files. RESULTS: We introduce a library named cd2sbgnml and an associated web service for bidirectional conversion between CellDesigner's proprietary SBML extension and SBGN-ML formats. We discuss the functionality of the cd2sbgnml converter, which was successfully used for the translation of comprehensive large-scale diagrams such as the RECON Human Metabolic network and the complete Atlas of Cancer Signalling Network, from the CellDesigner file format into SBGN-ML. AVAILABILITY AND IMPLEMENTATION: The cd2sbgnml conversion library and the web service were developed in Java, and distributed under the GNU Lesser General Public License v3.0. The sources along with a set of examples are available on GitHub (https://github.com/sbgn/cd2sbgnml and https://github.com/sbgn/cd2sbgnml-webservice, respectively). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Software , Systems Biology , Humans , Metabolic Networks and Pathways , Signal Transduction
13.
PLoS Comput Biol ; 16(2): e1007652, 2020 02.
Article in English | MEDLINE | ID: mdl-32069277

ABSTRACT

English Wikipedia, containing more than five millions articles, has approximately eleven thousands web pages devoted to proteins or genes most of which were generated by the Gene Wiki project. These pages contain information about interactions between proteins and their functional relationships. At the same time, they are interconnected with other Wikipedia pages describing biological functions, diseases, drugs and other topics curated by independent, not coordinated collective efforts. Therefore, Wikipedia contains a directed network of protein functional relations or physical interactions embedded into the global network of the encyclopedia terms, which defines hidden (indirect) functional proximity between proteins. We applied the recently developed reduced Google Matrix (REGOMAX) algorithm in order to extract the network of hidden functional connections between proteins in Wikipedia. In this network we discovered tight communities which reflect areas of interest in molecular biology or medicine and can be considered as definitions of biological functions shaped by collective intelligence. Moreover, by comparing two snapshots of Wikipedia graph (from years 2013 and 2017), we studied the evolution of the network of direct and hidden protein connections. We concluded that the hidden connections are more dynamic compared to the direct ones and that the size of the hidden interaction communities grows with time. We recapitulate the results of Wikipedia protein community analysis and annotation in the form of an interactive online map, which can serve as a portal to the Gene Wiki project.


Subject(s)
Biological Phenomena , Computational Biology/methods , Protein Interaction Mapping , Proteins/chemistry , Search Engine , Algorithms , Cluster Analysis , Databases, Genetic , Internet , Markov Chains , Probability
14.
Nucleic Acids Res ; 47(5): 2205-2215, 2019 03 18.
Article in English | MEDLINE | ID: mdl-30657980

ABSTRACT

MicroRNAs play important roles in many biological processes. Their aberrant expression can have oncogenic or tumor suppressor function directly participating to carcinogenesis, malignant transformation, invasiveness and metastasis. Indeed, miRNA profiles can distinguish not only between normal and cancerous tissue but they can also successfully classify different subtypes of a particular cancer. Here, we focus on a particular class of transcripts encoding polycistronic miRNA genes that yields multiple miRNA components. We describe 'clustered MiRNA Master Regulator Analysis (ClustMMRA)', a fully redesigned release of the MMRA computational pipeline (MiRNA Master Regulator Analysis), developed to search for clustered miRNAs potentially driving cancer molecular subtyping. Genomically clustered miRNAs are frequently co-expressed to target different components of pro-tumorigenic signaling pathways. By applying ClustMMRA to breast cancer patient data, we identified key miRNA clusters driving the phenotype of different tumor subgroups. The pipeline was applied to two independent breast cancer datasets, providing statistically concordant results between the two analyses. We validated in cell lines the miR-199/miR-214 as a novel cluster of miRNAs promoting the triple negative breast cancer (TNBC) phenotype through its control of proliferation and EMT.


Subject(s)
Epithelial-Mesenchymal Transition/genetics , MicroRNAs/genetics , Multigene Family/genetics , Triple Negative Breast Neoplasms/genetics , Triple Negative Breast Neoplasms/pathology , Cell Line, Tumor , Cell Proliferation , Datasets as Topic , Gene Silencing , Humans , Neoplasm Invasiveness/genetics , Reproducibility of Results , Triple Negative Breast Neoplasms/classification
15.
BMC Bioinformatics ; 21(1): 241, 2020 Jun 11.
Article in English | MEDLINE | ID: mdl-32527218

ABSTRACT

BACKGROUND: Solutions to stochastic Boolean models are usually estimated by Monte Carlo simulations, but as the state space of these models can be enormous, there is an inherent uncertainty about the accuracy of Monte Carlo estimates and whether simulations have reached all attractors. Moreover, these models have timescale parameters (transition rates) that the probability values of stationary solutions depend on in complex ways, raising the necessity of parameter sensitivity analysis. We address these two issues by an exact calculation method for this class of models. RESULTS: We show that the stationary probability values of the attractors of stochastic (asynchronous) continuous time Boolean models can be exactly calculated. The calculation does not require Monte Carlo simulations, instead it uses graph theoretical and matrix calculation methods previously applied in the context of chemical kinetics. In this version of the asynchronous updating framework the states of a logical model define a continuous time Markov chain and for a given initial condition the stationary solution is fully defined by the right and left nullspace of the master equation's kinetic matrix. We use topological sorting of the state transition graph and the dependencies between the nullspaces and the kinetic matrix to derive the stationary solution without simulations. We apply this calculation to several published Boolean models to analyze the under-explored question of the effect of transition rates on the stationary solutions and show they can be sensitive to parameter changes. The analysis distinguishes processes robust or, alternatively, sensitive to parameter values, providing both methodological and biological insights. CONCLUSION: Up to an intermediate size (the biggest model analyzed is 23 nodes) stochastic Boolean models can be efficiently solved by an exact matrix method, without using Monte Carlo simulations. Sensitivity analysis with respect to the model's timescale parameters often reveals a small subset of all parameters that primarily determine the stationary probability of attractor states.


Subject(s)
Models, Biological , Monte Carlo Method , Stochastic Processes
16.
Genome Res ; 27(2): 259-268, 2017 02.
Article in English | MEDLINE | ID: mdl-27965291

ABSTRACT

Super-enhancers (SEs) are key transcriptional drivers of cellular, developmental, and disease states in mammals, yet the conservational and regulatory features of these enhancer elements in nonmammalian vertebrates are unknown. To define SEs in zebrafish and enable sequence and functional comparisons to mouse and human SEs, we used genome-wide histone H3 lysine 27 acetylation (H3K27ac) occupancy as a primary SE delineator. Our study determined the set of SEs in pluripotent state cells and adult zebrafish tissues and revealed both similarities and differences between zebrafish and mammalian SEs. Although the total number of SEs was proportional to the genome size, the genomic distribution of zebrafish SEs differed from that of the mammalian SEs. Despite the evolutionary distance separating zebrafish and mammals and the low overall SE sequence conservation, ∼42% of zebrafish SEs were located in close proximity to orthologs that also were associated with SEs in mouse and human. Compared to their nonassociated counterparts, higher sequence conservation was revealed for those SEs that have maintained orthologous gene associations. Functional dissection of two of these SEs identified conserved sequence elements and tissue-specific expression patterns, while chromatin accessibility analyses predicted transcription factors governing the function of pluripotent state zebrafish SEs. Our zebrafish annotations and comparative studies show the extent of SE usage and their conservation across vertebrates, permitting future gene regulatory studies in several tissues.


Subject(s)
Chromatin/genetics , Conserved Sequence/genetics , Enhancer Elements, Genetic , Zebrafish/genetics , Acetylation , Animals , Embryonic Development/genetics , Gene Expression Regulation, Developmental , Genomics , Histones/genetics , Humans , Mice , Transcription Factors/genetics
17.
Bioinformatics ; 35(7): 1188-1196, 2019 04 01.
Article in English | MEDLINE | ID: mdl-30169736

ABSTRACT

MOTIVATION: Due to the complexity and heterogeneity of multicellular biological systems, mathematical models that take into account cell signalling, cell population behaviour and the extracellular environment are particularly helpful. We present PhysiBoSS, an open source software which combines intracellular signalling using Boolean modelling (MaBoSS) and multicellular behaviour using agent-based modelling (PhysiCell). RESULTS: PhysiBoSS provides a flexible and computationally efficient framework to explore the effect of environmental and genetic alterations of individual cells at the population level, bridging the critical gap from single-cell genotype to single-cell phenotype and emergent multicellular behaviour. PhysiBoSS thus becomes very useful when studying heterogeneous population response to treatment, mutation effects, different modes of invasion or isomorphic morphogenesis events. To concretely illustrate a potential use of PhysiBoSS, we studied heterogeneous cell fate decisions in response to TNF treatment. We explored the effect of different treatments and the behaviour of several resistant mutants. We highlighted the importance of spatial information on the population dynamics by considering the effect of competition for resources like oxygen. AVAILABILITY AND IMPLEMENTATION: PhysiBoSS is freely available on GitHub (https://github.com/sysbio-curie/PhysiBoSS), with a Docker image (https://hub.docker.com/r/gletort/physiboss/). It is distributed as open source under the BSD 3-clause license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Models, Genetic , Signal Transduction , Software , Genotype , Humans , Signal Transduction/genetics , Systems Analysis
18.
Bioinformatics ; 35(21): 4307-4313, 2019 11 01.
Article in English | MEDLINE | ID: mdl-30938767

ABSTRACT

MOTIVATION: Matrix factorization (MF) methods are widely used in order to reduce dimensionality of transcriptomic datasets to the action of few hidden factors (metagenes). MF algorithms have never been compared based on the between-datasets reproducibility of their outputs in similar independent datasets. Lack of this knowledge might have a crucial impact when generalizing the predictions made in a study to others. RESULTS: We systematically test widely used MF methods on several transcriptomic datasets collected from the same cancer type (14 colorectal, 8 breast and 4 ovarian cancer transcriptomic datasets). Inspired by concepts of evolutionary bioinformatics, we design a novel framework based on Reciprocally Best Hit (RBH) graphs in order to benchmark the MF methods for their ability to produce generalizable components. We show that a particular protocol of application of independent component analysis (ICA), accompanied by a stabilization procedure, leads to a significant increase in the between-datasets reproducibility. Moreover, we show that the signals detected through this method are systematically more interpretable than those of other standard methods. We developed a user-friendly tool for performing the Stabilized ICA-based RBH meta-analysis. We apply this methodology to the study of colorectal cancer (CRC) for which 14 independent transcriptomic datasets can be collected. The resulting RBH graph maps the landscape of interconnected factors associated to biological processes or to technological artifacts. These factors can be used as clinical biomarkers or robust and tumor-type specific transcriptomic signatures of tumoral cells or tumoral microenvironment. Their intensities in different samples shed light on the mechanistic basis of CRC molecular subtyping. AVAILABILITY AND IMPLEMENTATION: The RBH construction tool is available from http://goo.gl/DzpwYp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Transcriptome , Algorithms , Breast Neoplasms , Gene Expression Profiling , Humans , Reproducibility of Results , Tumor Microenvironment
19.
Nat Immunol ; 9(6): 650-7, 2008 Jun.
Article in English | MEDLINE | ID: mdl-18454150

ABSTRACT

Interleukin 17 (IL-17)-producing T helper 17 cells (T(H)-17 cells) have been described as a T helper cell subset distinct from T helper type 1 (T(H)1) and T(H)2 cells, with specific functions in antimicrobial defense and autoimmunity. The factors driving human T(H)-17 differentiation remain controversial. Using a systematic approach combining experimental and computational methods, we show here that transforming growth factor-beta, interleukin 23 (IL-23) and proinflammatory cytokines (IL-1beta and IL-6) were all essential for human T(H)-17 differentiation. However, individual T(H)-17 cell-derived cytokines, such as IL-17, IL-21, IL-22 and IL-6, as well as the global T(H)-17 cytokine profile, were differentially modulated by T(H)-17-promoting cytokines. Transforming growth factor-beta was critical, and its absence induced a shift from a T(H)-17 profile to a T(H)1-like profile. Our results shed new light on the regulation of human T(H)-17 differentiation and provide a framework for the global analysis of T helper responses.


Subject(s)
Cell Differentiation/immunology , Interleukin-17/biosynthesis , Interleukin-23/metabolism , T-Lymphocytes, Helper-Inducer/cytology , Transforming Growth Factor beta/metabolism , Cytokines/metabolism , Humans , T-Lymphocytes, Helper-Inducer/immunology
20.
Entropy (Basel) ; 22(3)2020 Mar 04.
Article in English | MEDLINE | ID: mdl-33286070

ABSTRACT

Multidimensional datapoint clouds representing large datasets are frequently characterized by non-trivial low-dimensional geometry and topology which can be recovered by unsupervised machine learning approaches, in particular, by principal graphs. Principal graphs approximate the multivariate data by a graph injected into the data space with some constraints imposed on the node mapping. Here we present ElPiGraph, a scalable and robust method for constructing principal graphs. ElPiGraph exploits and further develops the concept of elastic energy, the topological graph grammar approach, and a gradient descent-like optimization of the graph topology. The method is able to withstand high levels of noise and is capable of approximating data point clouds via principal graph ensembles. This strategy can be used to estimate the statistical significance of complex data features and to summarize them into a single consensus principal graph. ElPiGraph deals efficiently with large datasets in various fields such as biology, where it can be used for example with single-cell transcriptomic or epigenomic datasets to infer gene expression dynamics and recover differentiation landscapes.

SELECTION OF CITATIONS
SEARCH DETAIL