RESUMEN
Recent advances in biological technologies, such as multi-way chromosome conformation capture (3C), require development of methods for analysis of multi-way interactions. Hypergraphs are mathematically tractable objects that can be utilized to precisely represent and analyze multi-way interactions. Here we present the Hypergraph Analysis Toolbox (HAT), a software package for visualization and analysis of multi-way interactions in complex systems.
Asunto(s)
Cromosomas , Programas InformáticosRESUMEN
Mutations in the melanocortin 4 receptor (MC4R) result in hyperphagia and obesity and are the most common cause of monogenic obesity in humans. Preclinical rodent studies have determined that the critical role of the MC4R in controlling feeding can be mapped in part to its expression in the paraventricular nucleus of the hypothalamus (paraventricular nucleus [PVN]), where it regulates the activity of anorexic neural circuits. Despite the critical role of PVN MC4R neurons in regulating feeding, the in vivo neuronal activity of these cells remains largely unstudied, and the network activity of PVN MC4R neurons has not been determined. Here, we utilize in vivo single-cell endomicroscopic and mathematical approaches to determine the activity and network dynamics of PVN MC4R neurons in response to changes in energy state and pharmacological manipulation of central melanocortin receptors. We determine that PVN MC4R neurons exhibit both quantitative and qualitative changes in response to fasting and refeeding. Pharmacological stimulation of MC4R with the therapeutic MC4R agonist setmelanotide rapidly increases basal PVN MC4R activity, while stimulation of melanocortin 3 receptor (MC3R) inhibits PVN MC4R activity. Finally, we find that distinct PVN MC4R neuronal ensembles encode energy deficit and energy surfeit and that energy surfeit is associated with enhanced network connections within PVN MC4R neurons. These findings provide valuable insight into the neural dynamics underlying hunger and energy surfeit.
Asunto(s)
Conducta Alimentaria/fisiología , Núcleo Hipotalámico Paraventricular/fisiología , Receptor de Melanocortina Tipo 4/metabolismo , Animales , Masculino , Ratones , Microscopía Fluorescente , Red Nerviosa , Imagen Óptica , Núcleo Hipotalámico Paraventricular/citología , Receptor de Melanocortina Tipo 3/agonistas , Análisis de la Célula IndividualRESUMEN
The mammalian adaptive immune system has evolved over millions of years to become an incredibly effective defense against foreign antigens. The adaptive immune system's humoral response creates plasma B cells and memory B cells, each with their own immunological objectives. The affinity maturation process is widely viewed as a heuristic to solve the global optimization problem of finding B cells with high affinity to the antigen. However, memory B cells appear to be purposely selected earlier in the affinity maturation process and have lower affinity. We propose that this memory B cell selection process may be an approximate solution to two optimization problems: optimizing for affinity to similar antigens in the future despite mutations or other minor differences, and optimizing to warm start the generation of plasma B cells in the future. We use simulations to provide evidence for our hypotheses, taking into account data showing that certain B cell mutations are more likely than others. Our findings are consistent with memory B cells having high-affinity to mutated antigens, but do not provide strong evidence that memory B cells will be more useful than selected naive B cells for seeding the secondary germinal centers.
Asunto(s)
Centro Germinal , Memoria Inmunológica , Inmunidad Adaptativa , Animales , Antígenos , Linfocitos BRESUMEN
The ability to measure human aging from molecular profiles has practical implications in many fields, including disease prevention and treatment, forensics, and extension of life. Although chronological age has been linked to changes in DNA methylation, the methylome has not yet been used to measure and compare human aging rates. Here, we build a quantitative model of aging using measurements at more than 450,000 CpG markers from the whole blood of 656 human individuals, aged 19 to 101. This model measures the rate at which an individual's methylome ages, which we show is impacted by gender and genetic variants. We also show that differences in aging rates help explain epigenetic drift and are reflected in the transcriptome. Moreover, we show how our aging model is upheld in other human tissues and reveals an advanced aging rate in tumor tissue. Our model highlights specific components of the aging process and provides a quantitative readout for studying the role of methylation in age-related disease.
Asunto(s)
Envejecimiento/genética , Metilación de ADN , Genoma Humano , Adulto , Anciano , Anciano de 80 o más Años , Epigénesis Genética , Femenino , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Persona de Mediana Edad , Modelos Genéticos , Fenotipo , Análisis de Secuencia de ADN , Transcriptoma , Adulto JovenRESUMEN
This work presents a mathematical study of tissue dynamics. We combine within-cell genome dynamics and diffusion between cells, so that the synthesis of the two gives rise to the emergence of function, akin to establishing "tissue homeostasis." We introduce two concepts, monotonicity and a weak version of hardwiring. These together are sufficient for global convergence of the tissue dynamics.
Asunto(s)
Fenómenos Fisiológicos Celulares , Modelos Biológicos , Algoritmos , Comunicación Celular , Células/metabolismo , Difusión , Genoma , Homeostasis , MorfogénesisRESUMEN
The day we understand the time evolution of subcellular events at a level of detail comparable to physical systems governed by Newton's laws of motion seems far away. Even so, quantitative approaches to cellular dynamics add to our understanding of cell biology. With data-guided frameworks we can develop better predictions about, and methods for, control over specific biological processes and system-wide cell behavior. Here we describe an approach for optimizing the use of transcription factors (TFs) in cellular reprogramming, based on a device commonly used in optimal control. We construct an approximate model for the natural evolution of a cell-cycle-synchronized population of human fibroblasts, based on data obtained by sampling the expression of 22,083 genes at several time points during the cell cycle. To arrive at a model of moderate complexity, we cluster gene expression based on division of the genome into topologically associating domains (TADs) and then model the dynamics of TAD expression levels. Based on this dynamical model and additional data, such as known TF binding sites and activity, we develop a methodology for identifying the top TF candidates for a specific cellular reprogramming task. Our data-guided methodology identifies a number of TFs previously validated for reprogramming and/or natural differentiation and predicts some potentially useful combinations of TFs. Our findings highlight the immense potential of dynamical models, mathematics, and data-guided methodologies for improving strategies for control over biological processes.
Asunto(s)
Algoritmos , Reprogramación Celular/genética , Biología Computacional/métodos , Fibroblastos/citología , Factores de Transcripción/genética , Sitios de Unión/genética , Ciclo Celular/genética , Diferenciación Celular , Células Cultivadas , Reprogramación Celular/fisiología , Perfilación de la Expresión Génica , Genoma Humano/genética , Humanos , Modelos GenéticosRESUMEN
Motivation: The availability of powerful analysis tools will further understanding of genome organization and its relationship to phenotype in dynamical settings. Results: The 4D Nucleome Analysis Toolbox (NAT) is a user-friendly and powerful MATLAB toolbox for time series analysis of genome-wide chromosome conformation capture (Hi-C) data and gene expression (RNA-seq). NAT can load and normalize data, define topologically associating domains, analyse translocations, produce visualization, and study time course data. We provide examples that include time series data sets and karyotypically abnormal cell lines demonstrating the flexibility of NAT. Availability and implementation: https://github.com/laseaman/4D_Nucleome_Analysis_Toolbox. Contact: indikar@umich.edu.
Asunto(s)
Cariotipo Anormal , Cromosomas Humanos , Perfilación de la Expresión Génica/métodos , Genómica/métodos , Programas Informáticos , Regulación de la Expresión Génica , Humanos , Análisis de Secuencia de ARN/métodosRESUMEN
The human genome is dynamic in structure, complicating researcher's attempts at fully understanding it. Time series "Fluorescent in situ Hybridization" (FISH) imaging has increased our ability to observe genome structure, but due to cell type and experimental variability this data is often noisy and difficult to analyze. Furthermore, computational analysis techniques are needed for homolog discrimination and canonical framework detection, in the case of time-series images. In this paper we introduce novel ideas for nucleus imaging analysis, present findings extracted using dynamic genome imaging, and propose an objective algorithm for high-throughput, time-series FISH imaging. While a canonical framework could not be detected beyond statistical significance in the analyzed dataset, a mathematical framework for detection has been outlined with extension to 3D image analysis.
Asunto(s)
Proteínas CLOCK/genética , Núcleo Celular/metabolismo , Fibroblastos/metabolismo , Genoma Humano , Ensayos Analíticos de Alto Rendimiento , Imagenología Tridimensional/métodos , Algoritmos , Proteínas CLOCK/metabolismo , Línea Celular , Núcleo Celular/ultraestructura , Ritmo Circadiano/genética , Fibroblastos/ultraestructura , Regulación de la Expresión Génica , Humanos , Imagenología Tridimensional/instrumentación , Hibridación Fluorescente in Situ , Factores Reguladores Miogénicos/genética , Factores Reguladores Miogénicos/metabolismoRESUMEN
The 4D organization of the interphase nucleus, or the 4D Nucleome (4DN), reflects a dynamical interaction between 3D genome structure and function and its relationship to phenotype. We present initial analyses of the human 4DN, capturing genome-wide structure using chromosome conformation capture and 3D imaging, and function using RNA-sequencing. We introduce a quantitative index that measures underlying topological stability of a genomic region. Our results show that structural features of genomic regions correlate with function with surprising persistence over time. Furthermore, constructing genome-wide gene-level contact maps aided in identifying gene pairs with high potential for coregulation and colocalization in a manner consistent with expression via transcription factories. We additionally use 2D phase planes to visualize patterns in 4DN data. Finally, we evaluated gene pairs within a circadian gene module using 3D imaging, and found periodicity in the movement of clock circadian regulator and period circadian clock 2 relative to each other that followed a circadian rhythm and entrained with their expression.
Asunto(s)
Núcleo Celular/metabolismo , Genoma Humano , Interfase , Ritmo Circadiano/genética , Redes Reguladoras de Genes , HumanosRESUMEN
MOTIVATION: Topological domains have been proposed as the backbone of interphase chromosome structure. They are regions of high local contact frequency separated by sharp boundaries. Genes within a domain often have correlated transcription. In this paper, we present a computational efficient spectral algorithm to identify topological domains from chromosome conformation data (Hi-C data). We consider the genome as a weighted graph with vertices defined by loci on a chromosome and the edge weights given by interaction frequency between two loci. Laplacian-based graph segmentation is then applied iteratively to obtain the domains at the given compactness level. Comparison with algorithms in the literature shows the advantage of the proposed strategy. RESULTS: An efficient algorithm is presented to identify topological domains from the Hi-C matrix. AVAILABILITY AND IMPLEMENTATION: The Matlab source code and illustrative examples are available at http://bionetworks.ccmb.med.umich.edu/ CONTACT: : indikar@med.umich.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Algoritmos , Cromosomas/ultraestructura , Transcripción Genética , Modelos Teóricos , Lenguajes de ProgramaciónRESUMEN
A state-dependent dynamic network is a collection of elements that interact through a network, whose geometry evolves as the state of the elements changes over time. The genome is an intriguing example of a state-dependent network, where chromosomal geometry directly relates to genomic activity, which in turn strongly correlates with geometry. Here we examine various aspects of a genomic state-dependent dynamic network. In particular, we elaborate on one of the important ramifications of viewing genomic networks as being state-dependent, namely, their controllability during processes of genomic reorganization such as in cell differentiation.
Asunto(s)
Diferenciación Celular/genética , Redes Reguladoras de Genes , Modelos Genéticos , Animales , Diferenciación Celular/fisiología , Retroalimentación Fisiológica , Factor de Transcripción GATA1/genética , Factor de Transcripción GATA1/metabolismo , Hematopoyesis/genética , Hematopoyesis/fisiología , Humanos , Modelos Biológicos , Proteína MioD/genética , Proteína MioD/metabolismoRESUMEN
Biomarkers enable objective monitoring of a given cell or state in a biological system and are widely used in research, biomanufacturing, and clinical practice. However, identifying appropriate biomarkers that are both robustly measurable and capture a state accurately remains challenging. We present a framework for biomarker identification based upon observability guided sensor selection. Our methods, Dynamic Sensor Selection (DSS) and Structure-Guided Sensor Selection (SGSS), utilize temporal models and experimental data, offering a template for applying observability theory to data from biological systems. Unlike conventional methods that assume well-known, fixed dynamics, DSS adaptively select biomarkers or sensors that maximize observability while accounting for the time-varying nature of biological systems. Additionally, SGSS incorporates structural information and diverse data to identify sensors which are resilient against inaccuracies in our model of the underlying system. We validate our approaches by performing estimation on high dimensional systems derived from temporal gene expression data from partial observations. Our algorithms reliably identify known biomarkers and uncover new ones within our datasets. Additionally, integrating chromosome conformation and gene expression data addresses noise and uncertainty, enhancing the reliability of our biomarker selection approach for the genome.
RESUMEN
The advent of advanced robotic platforms and workflow automation tools has revolutionized the landscape of biological research, offering unprecedented levels of precision, reproducibility, and versatility in experimental design. In this work, we present an automated and modular workflow for exploring cell behavior in two-dimensional culture systems. By integrating the BioAssemblyBot® (BAB) robotic platform and the BioApps™ workflow automater with live-cell fluorescence microscopy, our workflow facilitates execution and analysis of in vitro migration and proliferation assays. Robotic assistance and automation allow for the precise and reproducible creation of highly customizable cell-free zones (CFZs), or wounds, in cell monolayers and "hands-free," schedulable integration with real-time monitoring systems for cellular dynamics. CFZs are designed as computer-aided design models and recreated in confluent cell layers by the BAB 3D-Bioprinting tool. The dynamics of migration and proliferation are evaluated in individual cells using live-cell fluorescence microscopy and an in-house pipeline for image processing and single-cell tracking. Our robotics-assisted approach outperforms manual scratch assays with enhanced reproducibility, adaptability, and precision. The incorporation of automation further facilitates increased flexibility in wound geometry and allows for many experimental conditions to be analyzed in parallel. Unlike traditional cell migration assays, our workflow offers an adjustable platform that can be tailored to a wide range of applications with high-throughput capability. The key features of this system, including its scalability, versatility, and the ability to maintain a high degree of experimental control, position it as a valuable tool for researchers across various disciplines.
RESUMEN
Unraveling the nature of genetic interactions is crucial to obtaining a more complete picture of complex diseases. It is thought that gene-gene interactions play an important role in the etiology of cancer, cardiovascular, and immune-mediated disease. Interactions among genes are defined as phenotypic effects that differ from those observed for independent contributions of each gene, usually detected by univariate logistic regression methods. Using a multivariate extension of linkage disequilibrium (LD), we have developed a new method, based on distances between sample covariance matrices for groups of single nucleotide polymorphisms (SNPs), to test for interaction effects of two groups of genes associated with a disease phenotype. Since a disease-associated interacting locus will often be in LD with more than one marker in the region, a method that examines a set of markers in a region collectively can offer greater power than traditional methods. Our method effectively identifies interaction effects in simulated data, as well as in data on the genetic contributions to the risk for graft-versus-host disease following hematopoietic stem cell transplantation.
Asunto(s)
Epistasis Genética , Predisposición Genética a la Enfermedad , Desequilibrio de Ligamiento , Modelos Genéticos , Polimorfismo de Nucleótido Simple , Estudios de Casos y Controles , Simulación por Computador , Enfermedad Injerto contra Huésped/genética , Trasplante de Células Madre Hematopoyéticas/efectos adversos , Humanos , Modelos Logísticos , Modelos Estadísticos , Análisis MultivarianteRESUMEN
Adipose tissue macrophage (ATM) infiltration is associated with adipose tissue dysfunction and insulin resistance in mice and humans. Recent single-cell data highlight increased ATM heterogeneity in obesity but do not provide a spatial context for ATM phenotype dynamics. We integrated single-cell RNA-Seq, spatial transcriptomics, and imaging of murine adipose tissue in a time course study of diet-induced obesity. Overall, proinflammatory immune cells were predominant in early obesity, whereas nonresident antiinflammatory ATMs predominated in chronic obesity. A subset of these antiinflammatory ATMs were transcriptomically intermediate between monocytes and mature lipid-associated macrophages (LAMs) and were consistent with a LAM precursor (pre-LAM). Pre-LAMs were spatially associated with early obesity crown-like structures (CLSs), which indicate adipose tissue dysfunction. Spatial data showed colocalization of ligand-receptor transcripts related to lipid signaling among monocytes, pre-LAMs, and LAMs, including Apoe, Lrp1, Lpl, and App. Pre-LAM expression of these ligands in early obesity suggested signaling to LAMs in the CLS microenvironment. Our results refine understanding of ATM diversity and provide insight into the dynamics of the LAM lineage during development of metabolic disease.
Asunto(s)
Tejido Adiposo , Obesidad , Humanos , Ratones , Animales , Tejido Adiposo/metabolismo , Obesidad/metabolismo , Macrófagos/metabolismo , Dieta , LípidosRESUMEN
Deciphering the non-trivial interactions and mechanisms driving the evolution of time-varying complex networks (TVCNs) plays a crucial role in designing optimal control strategies for such networks or enhancing their causal predictive capabilities. In this paper, we advance the science of TVCNs by providing a mathematical framework through which we can gauge how local changes within a complex weighted network affect its global properties. More precisely, we focus on unraveling unknown geometric properties of a network and determine its implications on detecting phase transitions within the dynamics of a TVCN. In this vein, we aim at elaborating a novel and unified approach that can be used to depict the relationship between local interactions in a complex network and its global kinetics. We propose a geometric-inspired framework to characterize the network's state and detect a phase transition between different states, to infer the TVCN's dynamics. A phase of a TVCN is determined by its Forman-Ricci curvature property. Numerical experiments show the usefulness of the proposed curvature formalism to detect the transition between phases within artificially generated networks. Furthermore, we demonstrate the effectiveness of the proposed framework in identifying the phase transition phenomena governing the training and learning processes of artificial neural networks. Moreover, we exploit this approach to investigate the phase transition phenomena in cellular re-programming by interpreting the dynamics of Hi-C matrices as TVCNs and observing singularity trends in the curvature network entropy. Finally, we demonstrate that this curvature formalism can detect a political change. Specifically, our framework can be applied to the US Senate data to detect a political change in the United States of America after the 1994 election, as discussed by political scientists.
RESUMEN
Next-generation DNA sequencing platforms provide exciting new possibilities for in vitro genetic analysis of functional nucleic acids. However, the size of the resulting data sets presents computational and analytical challenges. We present an open-source software package that employs a locality-sensitive hashing algorithm to enumerate all unique sequences in an entire Illumina sequencing run (â¼ 10(8) sequences). The algorithm results in quasilinear time processing of entire Illumina lanes (â¼ 10(7) sequences) on a desktop computer in minutes. To facilitate visual analysis of sequencing data, the software produces three-dimensional scatter plots similar in concept to Sewall Wright and John Maynard Smith's adaptive or fitness landscape. The software also contains functions that are particularly useful for doped selections such as mutation frequency analysis, information content calculation, multivariate statistical functions (including principal component analysis), sequence distance metrics, sequence searches and sequence comparisons across multiple Illumina data sets. Source code, executable files and links to sample data sets are available at http://www.sourceforge.net/projects/sewal.
Asunto(s)
Gráficos por Computador , Análisis de Secuencia de ADN/métodos , Programas Informáticos , Análisis de Componente PrincipalRESUMEN
Although the importance of chromosome organization during mitosis is clear, it remains to be determined whether the nucleus assumes other functionally relevant chromosomal topologies. We have previously shown that homologous chromosomes have a tendency to associate during hematopoiesis according to their distribution of coregulated genes, suggesting cell-specific nuclear organization. Here, using the mathematical approaches of distance matrices and coupled oscillators, we model the dynamic relationship between gene expression and chromosomal associations during the differentiation of a multipotential hematopoietic progenitor. Our analysis reveals dramatic changes in total genomic order: Commitment of the progenitor results in an initial increase in entropy at both the level of gene coregulation and chromosomal organization, which we suggest represents a phase transition, followed by a progressive decline in entropy during differentiation. The stabilization of a highly ordered state in the differentiated cell types results in lineage-specific chromosomal topologies and is related to the emergence of coherence-or self-organization-between chromosomal associations and coordinate gene regulation. We discuss how these observations may be generally relevant to cell fate decisions encountered by progenitor/stem cells.
Asunto(s)
Linaje de la Célula/genética , Cromosomas/genética , Regulación de la Expresión Génica , Diferenciación Celular/genéticaRESUMEN
Increased adipose tissue macrophages (ATMs) correlate with metabolic dysfunction in humans and are causal in development of insulin resistance in mice. Recent bulk and single-cell transcriptomics studies reveal a wide spectrum of gene expression signatures possible for macrophages that depends on context, but the signatures of human ATM subtypes are not well defined in obesity and diabetes. We profiled 3 prominent ATM subtypes from human adipose tissue in obesity and determined their relationship to type 2 diabetes. Visceral adipose tissue (VAT) and s.c. adipose tissue (SAT) samples were collected from diabetic and nondiabetic obese participants to evaluate cellular content and gene expression. VAT CD206+CD11c- ATMs were increased in diabetic participants, were scavenger receptor-rich with low intracellular lipids, secreted proinflammatory cytokines, and diverged significantly from 2 CD11c+ ATM subtypes, which were lipid-laden, were lipid antigen presenting, and overlapped with monocyte signatures. Furthermore, diabetic VAT was enriched for CD206+CD11c- ATM and inflammatory signatures, scavenger receptors, and MHC II antigen presentation genes. VAT immunostaining found CD206+CD11c- ATMs concentrated in vascularized lymphoid clusters adjacent to CD206-CD11c+ ATMs, while CD206+CD11c+ were distributed between adipocytes. Our results show ATM subtype-specific profiles that uniquely contribute to the phenotypic variation in obesity.