RESUMO
BACKGROUND: The majority of high-throughput single-cell molecular profiling methods quantify RNA expression; however, recent multimodal profiling methods add simultaneous measurement of genomic, proteomic, epigenetic, and/or spatial information on the same cells. The development of new statistical and computational methods in Bioconductor for such data will be facilitated by easy availability of landmark datasets using standard data classes. RESULTS: We collected, processed, and packaged publicly available landmark datasets from important single-cell multimodal protocols, including CITE-Seq, ECCITE-Seq, SCoPE2, scNMT, 10X Multiome, seqFISH, and G&T. We integrate data modalities via the MultiAssayExperiment Bioconductor class, document and re-distribute datasets as the SingleCellMultiModal package in Bioconductor's Cloud-based ExperimentHub. The result is single-command actualization of landmark datasets from seven single-cell multimodal data generation technologies, without need for further data processing or wrangling in order to analyze and develop methods within Bioconductor's ecosystem of hundreds of packages for single-cell and multimodal data. CONCLUSIONS: We provide two examples of integrative analyses that are greatly simplified by SingleCellMultiModal. The package will facilitate development of bioinformatic and statistical methods in Bioconductor to meet the challenges of integrating molecular layers and analyzing phenotypic outputs including cell differentiation, activity, and disease.
Assuntos
Ecossistema , Proteômica , Diferenciação Celular , Biologia Computacional , EpigenômicaRESUMO
Differences in the level, timing, or location of gene expression can contribute to alternative phenotypes at the molecular and organismal level. Understanding the origins of expression differences is complicated by the fact that organismal morphology and gene regulatory networks could potentially vary even between closely related species. To assess the scope of such changes, we used high-resolution imaging methods to measure mRNA expression in blastoderm embryos of Drosophila yakuba and Drosophila pseudoobscura and assembled these data into cellular resolution atlases, where expression levels for 13 genes in the segmentation network are averaged into species-specific, cellular resolution morphological frameworks. We demonstrate that the blastoderm embryos of these species differ in their morphology in terms of size, shape, and number of nuclei. We present an approach to compare cellular gene expression patterns between species, while accounting for varying embryo morphology, and apply it to our data and an equivalent dataset for Drosophila melanogaster. Our analysis reveals that all individual genes differ quantitatively in their spatio-temporal expression patterns between these species, primarily in terms of their relative position and dynamics. Despite many small quantitative differences, cellular gene expression profiles for the whole set of genes examined are largely similar. This suggests that cell types at this stage of development are conserved, though they can differ in their relative position by up to 3-4 cell widths and in their relative proportion between species by as much as 5-fold. Quantitative differences in the dynamics and relative level of a subset of genes between corresponding cell types may reflect altered regulatory functions between species. Our results emphasize that transcriptional networks can diverge over short evolutionary timescales and that even small changes can lead to distinct output in terms of the placement and number of equivalent cells.
Assuntos
Padronização Corporal/genética , Proteínas de Drosophila/metabolismo , Drosophila/embriologia , Drosophila/genética , Animais , Evolução Biológica , Blastoderma/crescimento & desenvolvimento , Proteínas de Drosophila/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Redes Reguladoras de Genes/genética , Hibridização in Situ Fluorescente , Especificidade da EspécieRESUMO
Gene expression patterns can diverge between species due to changes in a gene's regulatory DNA or changes in the proteins, e.g., transcription factors (TFs), that regulate the gene. We developed a modeling framework to uncover the sources of expression differences in blastoderm embryos of three Drosophila species, focusing on the regulatory circuit controlling expression of the hunchback (hb) posterior stripe. Using this framework and cellular-resolution expression measurements of hb and its regulating TFs, we found that changes in the expression patterns of hb's TFs account for much of the expression divergence. We confirmed our predictions using transgenic D. melanogaster lines, which demonstrate that this set of orthologous cis-regulatory elements (CREs) direct similar, but not identical, expression patterns. We related expression pattern differences to sequence changes in the CRE using a calculation of the CRE's TF binding site content. By applying this calculation in both the transgenic and endogenous contexts, we found that changes in binding site content affect sensitivity to regulating TFs and that compensatory evolution may occur in circuit components other than the CRE.
Assuntos
Proteínas de Drosophila/genética , Drosophila/genética , Fatores de Transcrição/metabolismo , Transcriptoma , Animais , Animais Geneticamente Modificados/genética , Animais Geneticamente Modificados/metabolismo , Sequência de Bases , Sítios de Ligação/genética , Blastoderma/metabolismo , Proteínas de Ligação a DNA/genética , Bases de Dados Genéticas , Drosophila/metabolismo , Evolução Molecular , Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Genes Reguladores/genética , Redes e Vias Metabólicas , Modelos Biológicos , Especificidade da Espécie , Fatores de Transcrição/genéticaRESUMO
Complex spatiotemporal gene expression patterns direct the development of the fertilized egg into an adult animal. Comparisons across species show that, in spite of changes in the underlying regulatory DNA sequence, developmental programs can be maintained across millions of years of evolution. Reciprocally, changes in gene expression can be used to generate morphological novelty. Distinguishing between changes in regulatory DNA that lead to changes in gene expression and those that do not is therefore a central goal of evolutionary developmental biology. Quantitative, spatially-resolved measurements of developmental gene expression patterns play a crucial role in this goal, enabling the detection of subtle phenotypic differences between species and the development of computations models that link the sequence of regulatory DNA to expression patterns. Here we report the generation of two atlases of cellular resolution gene expression measurements for the primary anterior-posterior patterning genes in Drosophila simulans and Drosophila virilis By combining these data sets with existing atlases for three other Drosophila species, we detect subtle differences in the gene expression patterns and dynamics driving the highly conserved axis patterning system and delineate inter-species differences in the embryonic morphology. These data sets will be a resource for future modeling studies of the evolution of developmental gene regulatory networks.
Assuntos
Padronização Corporal , Drosophila/embriologia , Desenvolvimento Embrionário , Animais , Biomarcadores , Padronização Corporal/genética , Embrião não Mamífero , Perfilação da Expressão Gênica , Regulação da Expressão Gênica no Desenvolvimento , Redes Reguladoras de Genes , Especificidade de Órgãos , Especificidade da Espécie , TranscriptomaRESUMO
Microcolonies are aggregates of a few dozen to a few thousand cells exhibited by many bacteria. The formation of microcolonies is a crucial step towards the formation of more mature bacterial communities known as biofilms, but also marks a significant change in bacterial physiology. Within a microcolony, bacteria forgo a single cell lifestyle for a communal lifestyle hallmarked by high cell density and physical interactions between cells potentially altering their behaviour. It is thus crucial to understand how initially identical single cells start to behave differently while assembling in these tight communities. Here we show that cells in the microcolonies formed by the human pathogen Neisseria gonorrhoeae (Ng) present differential motility behaviors within an hour upon colony formation. Observation of merging microcolonies and tracking of single cells within microcolonies reveal a heterogeneous motility behavior: cells close to the surface of the microcolony exhibit a much higher motility compared to cells towards the center. Numerical simulations of a biophysical model for the microcolonies at the single cell level suggest that the emergence of differential behavior within a multicellular microcolony of otherwise identical cells is of mechanical origin. It could suggest a route toward further bacterial differentiation and ultimately mature biofilms.
Assuntos
Fímbrias Bacterianas/metabolismo , Neisseria gonorrhoeae/fisiologia , Rastreamento de Células/métodos , Microscopia Eletrônica de Varredura , Neisseria gonorrhoeae/metabolismo , Fenômenos Físicos , Análise de Célula ÚnicaRESUMO
Computational models of enhancer function generally assume that transcription factors (TFs) exert their regulatory effects independently, modeling an enhancer as a "bag of sites." These models fail on endogenous loci that harbor multiple enhancers, and a "two-tier" model appears better suited: in each enhancer TFs work independently, and the total expression is a weighted sum of their expression readouts. Here, we test these two opposing views on how cis-regulatory information is integrated. We fused two Drosophila blastoderm enhancers, measured their readouts, and applied the above two models to these data. The two-tier mechanism better fits these readouts, suggesting that these fused enhancers comprise multiple independent modules, despite having sequence characteristics typical of single enhancers. We show that short-range TF-TF interactions are not sufficient to designate such modules, suggesting unknown underlying mechanisms. Our results underscore that mechanisms of how modules are defined and how their outputs are combined remain to be elucidated.