RESUMO
Eukaryotic genomes are pervasively transcribed by RNA polymerase II. Yet, the molecular and biological implications of such a phenomenon are still largely puzzling. Here, we describe noncoding RNA transcription upstream of the Arabidopsis thaliana DOG1 gene, which governs salt stress responses and is a key regulator of seed dormancy. We find that expression of the DOG1 gene is induced by salt stress, thereby causing a delay in seed germination. We uncover extensive transcriptional activity on the promoter of the DOG1 gene, which produces a variety of lncRNAs. These lncRNAs, named PUPPIES, are co-directionally transcribed and extend into the DOG1 coding region. We show that PUPPIES RNAs respond to salt stress and boost DOG1 expression, resulting in delayed germination. This positive role of pervasive PUPPIES transcription on DOG1 gene expression is associated with augmented pausing of RNA polymerase II, slower transcription and higher transcriptional burst size. These findings highlight the positive role of upstream co-directional transcription in controlling transcriptional dynamics of downstream genes.
Assuntos
Proteínas de Arabidopsis , Arabidopsis , RNA Longo não Codificante , Animais , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Regulação da Expressão Gênica de Plantas , Germinação/genética , Mutação , RNA Polimerase II/genética , RNA Polimerase II/metabolismo , RNA Longo não Codificante/metabolismoRESUMO
Quantitative Systems Pharmacology (QSP) models are increasingly being applied for target discovery and dose selection in immuno-oncology (IO). Typical application involves virtual trial, a simulation of a virtual population of hundreds of model instances with model inputs reflecting individual variability. While the structure of the model and initial parameterisation are based on literature describing the underlying biology, calibration of the virtual population by existing clinical data is frequently required to create tumour and patient population specific model instances. Since comparison of a virtual trial with clinical output requires hundreds of large-scale, non-linear model evaluations, the inference of a virtual population is computationally expensive, frequently becoming a bottleneck. Here, we present novel approach to virtual population inference in IO using emulation of the QSP model and an objective function based on Kolmogorov-Smirnov statistics to maximise congruence of simulated and observed clinical tumour size distributions. We sample the parameter space of a QSP IO model to collect a set of tumour growth time profiles. We evaluate performance of several machine learning approaches in interpolating these time profiles and create a surrogate model, which computes tumor growth profiles faster than the original model and allows examination of tens of millions of virtual patients. We use the surrogate model to infer a virtual population maximising congruence with the waterfall plot of a pembrolizumab clinical trial. We believe that our approach is applicable not only in QSP IO, but also in other applications where virtual populations need to be inferred for computationally expensive mechanistic models.
Assuntos
Neoplasias , Farmacologia em Rede , Humanos , Neoplasias/tratamento farmacológico , Neoplasias/patologia , Oncologia , Simulação por Computador , CalibragemRESUMO
BACKGROUND: Bioimaging techniques offer a robust tool for studying molecular pathways and morphological phenotypes of cell populations subjected to various conditions. As modern high-resolution 3D microscopy provides access to an ever-increasing amount of high-quality images, there arises a need for their analysis in an automated, unbiased, and simple way. Segmentation of structures within the cell nucleus, which is the focus of this paper, presents a new layer of complexity in the form of dense packing and significant signal overlap. At the same time, the available segmentation tools provide a steep learning curve for new users with a limited technical background. This is especially apparent in the bulk processing of image sets, which requires the use of some form of programming notation. RESULTS: In this paper, we present PartSeg, a tool for segmentation and reconstruction of 3D microscopy images, optimised for the study of the cell nucleus. PartSeg integrates refined versions of several state-of-the-art algorithms, including a new multi-scale approach for segmentation and quantitative analysis of 3D microscopy images. The features and user-friendly interface of PartSeg were carefully planned with biologists in mind, based on analysis of multiple use cases and difficulties encountered with other tools, to offer an ergonomic interface with a minimal entry barrier. Bulk processing in an ad-hoc manner is possible without the need for programmer support. As the size of datasets of interest grows, such bulk processing solutions become essential for proper statistical analysis of results. Advanced users can use PartSeg components as a library within Python data processing and visualisation pipelines, for example within Jupyter notebooks. The tool is extensible so that new functionality and algorithms can be added by the use of plugins. For biologists, the utility of PartSeg is presented in several scenarios, showing the quantitative analysis of nuclear structures. CONCLUSIONS: In this paper, we have presented PartSeg which is a tool for precise and verifiable segmentation and reconstruction of 3D microscopy images. PartSeg is optimised for cell nucleus analysis and offers multi-scale segmentation algorithms best-suited for this task. PartSeg can also be used for the bulk processing of multiple images and its components can be reused in other systems or computational experiments.
Assuntos
Imageamento Tridimensional , Microscopia , Algoritmos , Núcleo Celular , Processamento de Imagem Assistida por ComputadorRESUMO
Production of morphologically and physiologically variable seeds is an important strategy that helps plants to survive in unpredictable natural conditions. However, the model plant Arabidopsis thaliana and most agronomically essential crops produce visually homogenous seeds. Using automated phenotype analysis, we observed that small seeds in Arabidopsis tend to have higher primary and secondary dormancy levels than large seeds. Transcriptomic analysis revealed distinct gene expression profiles between large and small seeds. Large seeds have higher expression of translation-related genes implicated in germination competence. By contrast, small seeds have elevated expression of many positive regulators of dormancy, including a key regulator of this process, the DOG1 gene. Differences in DOG1 expression are associated with differential production of its alternative cleavage and polyadenylation isoforms; in small seeds, the proximal poly(A) site is selected, resulting in a short mRNA isoform. Furthermore, single-seed RNA sequencing analysis demonstrated that large seeds resemble DOG1 knockout mutant seeds. Finally, on the single-seed level, expression of genes affected by seed size is correlated with expression of genes that position seeds on the path toward germination. Our results demonstrate an unexpected link between seed size and dormancy phenotypes in a species that produces highly homogenous seed pools, suggesting that the correlation between seed morphology and physiology is more widespread than initially assumed.
Assuntos
Proteínas de Arabidopsis , Arabidopsis , Arabidopsis/metabolismo , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Dormência de Plantas/genética , Germinação/genética , Sementes/genéticaRESUMO
Neutrophil extracellular traps (NETs), pathogen-ensnaring structures formed by neutrophils by expelling their DNA into the environment, are believed to play an important role in immunity and autoimmune diseases. In recent years, a growing attention has been put into developing software tools to quantify NETs in fluorescent microscopy images. However, current solutions require large, manually-prepared training data sets, are difficult to use for users without background in computer science, or have limited capabilities. To overcome these problems, we developed Trapalyzer, a computer program for automatic quantification of NETs. Trapalyzer analyzes fluorescent microscopy images of samples double-stained with a cell-permeable and a cell-impermeable dye, such as the popular combination of Hoechst 33342 and SYTOX™ Green. The program is designed with emphasis on software ergonomy and accompanied with step-by-step tutorials to make its use easy and intuitive. The installation and configuration of the software takes less than half an hour for an untrained user. In addition to NETs, Trapalyzer detects, classifies and counts neutrophils at different stages of NET formation, allowing for gaining a greater insight into this process. It is the first tool that makes this possible without large training data sets. At the same time, it attains a precision of classification on par with state-of-the-art machine learning algorithms. As an example application, we show how to use Trapalyzer to study NET release in a neutrophil-bacteria co-culture. Here, after configuration, Trapalyzer processed 121 images and detected and classified 16 000 ROIs in approximately three minutes on a personal computer. The software and usage tutorials are available at https://github.com/Czaki/Trapalyzer.
Assuntos
Armadilhas Extracelulares , Neutrófilos , Software , Algoritmos , Microscopia de Fluorescência/métodosRESUMO
Spatial chromatin organization is crucial for transcriptional regulation and might be particularly important in neurons since they dramatically change their transcriptome in response to external stimuli. We show that stimulation of neurons causes condensation of large chromatin domains. This phenomenon can be observed in vitro in cultured rat hippocampal neurons as well as in vivo in the amygdala and hippocampal neurons. Activity-induced chromatin condensation is an active, rapid, energy-dependent, and reversible process. It involves calcium-dependent pathways but is independent of active transcription. It is accompanied by the redistribution of posttranslational histone modifications and rearrangements in the spatial organization of chromosome territories. Moreover, it leads to the reorganization of nuclear speckles and active domains located in their proximity. Finally, we find that the histone deacetylase HDAC1 is the key regulator of this process. Our results suggest that HDAC1-dependent chromatin reorganization constitutes an important level of transcriptional regulation in neurons.
Assuntos
Cromatina/metabolismo , Histona Desacetilase 1/metabolismo , Neurônios/metabolismo , Animais , Sinalização do Cálcio , Cromatina/ultraestrutura , Cromossomos de Mamíferos/metabolismo , Metabolismo Energético , Hipocampo/citologia , Potenciação de Longa Duração , Camundongos Endogâmicos C57BL , Ratos Wistar , Transcrição GênicaRESUMO
In the Special Issue on Tools for Protein Science in 2018, we presented Molstack: a concept of a cloud-based platform for sharing electron density maps and their interpretations. Molstack is a web platform that allows the interactive visualization of density maps through the simultaneous presentation of multiple datasets and models in a way that allows for easy pairwise comparison. We anticipated that the users of this conceptually simple platform would find many different uses for their projects, and we were not mistaken. We have observed researchers use Molstack to present experimental evidence for their models in the form of electron density maps, omit maps, and anomalous difference density maps. Users also employed Molstack to present alternative interpretations of densities, including rerefinements and speculative interpretations. While we anticipated these types of projects to be the main use cases, we were pleased to see Molstack used to display superpositions of different models, as a tool for story-driven presentations, and for collaboration as well. Here, we present developments in the platform that were driven by user feedback, highlight several cases that used Molstack to enhance the publication, and discuss possible directions for the platform.
Assuntos
Biologia Computacional/métodos , Proteínas/química , Computação em Nuvem , Microscopia Crioeletrônica , Modelos Moleculares , Conformação Proteica , Software , Interface Usuário-ComputadorRESUMO
Genome-wide analysis of miRNA molecules can reveal important information for understanding the biology of cancer. Typically, miRNAs are used as features in statistical learning methods in order to train learning models to predict cancer. This motivates us to propose a method that integrates clustering and classification techniques for diverse cancer types with survival analysis via regression to identify miRNAs that can potentially play a crucial role in the prediction of different types of tumors. Our method has two parts. The first part is a feature selection procedure, called the stochastic covariance evolutionary strategy with forward selection (SCES-FS), which is developed by integrating stochastic neighbor embedding (SNE), the covariance matrix adaptation evolutionary strategy (CMA-ES), and classifiers, with the primary objective of selecting biomarkers. SNE is used to reorder the features by performing an implicit clustering with highly correlated neighboring features. A subset of features is selected heuristically to perform multi-class classification for diverse cancer types. In the second part of our method, the most important features identified in the first part are used to perform survival analysis via Cox regression, primarily to examine the effectiveness of the selected features. For this purpose, we have analyzed next generation sequencing data from The Cancer Genome Atlas in form of miRNA expression of 1,707 samples of 10 different cancer types and 333 normal samples. The SCES-FS method is compared with well-known feature selection methods and it is found to perform better in multi-class classification for the 17 selected miRNAs, achieving an accuracy of 96%. Moreover, the biological significance of the selected miRNAs is demonstrated with the help of network analysis, expression analysis using hierarchical clustering, KEGG pathway analysis, GO enrichment analysis, and protein-protein interaction analysis. Overall, the results indicate that the 17 selected miRNAs are associated with many key cancer regulators, such as MYC, VEGFA, AKT1, CDKN1A, RHOA, and PTEN, through their targets. Therefore the selected miRNAs can be regarded as putative biomarkers for 10 types of cancer.
RESUMO
The dendritic spines play a crucial role in learning and memory processes, epileptogenesis, drug addiction, and postinjury recovery. The shape of the dendritic spine is a morphological key to understand learning and memory process. The classification of the dendritic spines is based on their shapes but the major questions are how the shapes changes in time, how the synaptic strength changes, and is there a correlation between shapes and synaptic strength? Because the changes of the classes by dendritic spines during activation are time dependent, the forward-directed autoregressive hidden Markov model (ARHMM) can be used to model these changes. It is also more appropriate to use an ARHMM directed backward in time. Thus, the mixture of forward-directed ARHMM and backward-directed ARHMM (MARHMM) is used to model time-dependent data related to the dendritic spines. In this article, we discuss (1) how to choose the initial probability vector and transition and dependence matrices in ARHMM and MARHMM for modeling the dendritic spines changes and (2) how to estimate these matrices. Many descriptors to classify dendritic spines in two-dimensional or/and three-dimensional (3D) are available. Our results from sensitivity analysis show that the classification that comes from 3D descriptors is closer to the truth, and estimated transition and dependence probability matrices are connected with the molecular mechanism of the dendritic spines activation.
Assuntos
Células Dendríticas/fisiologia , Espinhas Dendríticas/fisiologia , Cadeias de Markov , Modelos Teóricos , Animais , Espinhas Dendríticas/patologia , Humanos , Aprendizagem/fisiologia , Memória/fisiologiaRESUMO
The human genome is extensively folded into 3-dimensional organization. However, the detailed 3D chromatin folding structures have not been fully visualized due to the lack of robust and ultra-resolution imaging capability. Here, we report the development of an electron microscopy method that combines serial block-face scanning electron microscopy with in situ hybridization (3D-EMISH) to visualize 3D chromatin folding at targeted genomic regions with ultra-resolution (5 × 5 × 30 nm in xyz dimensions) that is superior to the current super-resolution by fluorescence light microscopy. We apply 3D-EMISH to human lymphoblastoid cells at a 1.7 Mb segment of the genome and visualize a large number of distinctive 3D chromatin folding structures in ultra-resolution. We further quantitatively characterize the reconstituted chromatin folding structures by identifying sub-domains, and uncover a high level heterogeneity of chromatin folding ultrastructures in individual nuclei, suggestive of extensive dynamic fluidity in 3D chromatin states.
Assuntos
Cromatina/metabolismo , Cromatina/ultraestrutura , Algoritmos , Linhagem Celular , Núcleo Celular/metabolismo , Núcleo Celular/ultraestrutura , DNA/ultraestrutura , Humanos , Hibridização In Situ , Microscopia Confocal , Microscopia Eletrônica , Microscopia Eletrônica de VarreduraRESUMO
Categorizing spines into four subpopulations, stubby, mushroom, thin, or filopodia, is one of the common approaches in morphological analysis. Most cellular models describing synaptic plasticity, long-term potentiation (LTP), and long-term depression associate synaptic strength with either spine enlargement or spine shrinkage. Unfortunately, although we have a lot of available software with automatic spine segmentation and feature extraction methods, at present none of them allows for automatic and unbiased distinction between dendritic spine subpopulations, or for the detailed computational models of spine behavior. Therefore, we propose structural classification based on two different mathematical approaches: unsupervised construction of spine shape taxonomy based on arbitrary features (SpineTool) and supervised classification exploiting convolution kernels theory (2dSpAn). We compared two populations of spines in a form of static and dynamic data sets gathered at three time points. The dynamic data contain two sets of spines: the active set and the control set. The first population was stimulated with LTP, and the other population in its resting state was used as a control population. We propose one equation describing the distribution of variables that best fits all dendritic spine parameters.
Assuntos
Espinhas Dendríticas/fisiologia , Processamento de Imagem Assistida por Computador/métodos , Animais , Humanos , Potenciação de Longa Duração , Aprendizado de Máquina , Cadeias de Markov , Modelos EstatísticosRESUMO
The detailed architectural examination of the neuronal nuclei in any brain region, using confocal microscopy, requires quantification of fluorescent signals in three-dimensional stacks of confocal images. An essential prerequisite to any quantification is the segmentation of the nuclei which are typically tightly packed in the tissue, the extreme being the hippocampal dentate gyrus (DG), in which nuclei frequently appear to overlap due to limitations in microscope resolution. Segmentation in DG is a challenging task due to the presence of a significant amount of image artifacts and densely packed nuclei. Accordingly, we established an algorithm based on continuous boundary tracing criterion aiming to reconstruct the nucleus surface and to separate the adjacent nuclei. The presented algorithm neither uses a pre-built nucleus model, nor performs image thresholding, which makes it robust against variations in image intensity and poor contrast. Further, the reconstructed surface is used to study morphology and spatial arrangement of the nuclear interior. The presented method is generally dedicated to segmentation of crowded, overlapping objects in 3D space. In particular, it allows us to study quantitatively the architecture of the neuronal nucleus using confocal-microscopic approach.
RESUMO
The common approach in morphological analysis of dendritic spines of mammalian neuronal cells is to categorize spines into subpopulations based on whether they are stubby, mushroom, thin, or filopodia shaped. The corresponding cellular models of synaptic plasticity, long-term potentiation, and long-term depression associate the synaptic strength with either spine enlargement or spine shrinkage. Although a variety of automatic spine segmentation and feature extraction methods were developed recently, no approaches allowing for an automatic and unbiased distinction between dendritic spine subpopulations and detailed computational models of spine behavior exist. We propose an automatic and statistically based method for the unsupervised construction of spine shape taxonomy based on arbitrary features. The taxonomy is then utilized in the newly introduced computational model of behavior, which relies on transitions between shapes. Models of different populations are compared using supplied bootstrap-based statistical tests. We compared two populations of spines at two time points. The first population was stimulated with long-term potentiation, and the other in the resting state was used as a control. The comparison of shape transition characteristics allowed us to identify the differences between population behaviors. Although some extreme changes were observed in the stimulated population, statistically significant differences were found only when whole models were compared. The source code of our software is freely available for non-commercial use. CONTACT: d.plewczynski@cent.uw.edu.pl.