Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 31
Filter
Add more filters










Publication year range
1.
Bioinformatics ; 40(Supplement_1): i446-i452, 2024 Jun 28.
Article in English | MEDLINE | ID: mdl-38940162

ABSTRACT

BACKGROUND: Charting cellular trajectories over gene expression is key to understanding dynamic cellular processes and their underlying mechanisms. While advances in single-cell RNA-sequencing technologies and computational methods have pushed forward the recovery of such trajectories, trajectory inference remains a challenge due to the noisy, sparse, and high-dimensional nature of single-cell data. This challenge can be alleviated by increasing either the number of cells sampled along the trajectory (breadth) or the sequencing depth, i.e. the number of reads captured per cell (depth). Generally, these two factors are coupled due to an inherent breadth-depth tradeoff that arises when the sequencing budget is constrained due to financial or technical limitations. RESULTS: Here we study the optimal allocation of a fixed sequencing budget to optimize the recovery of trajectory attributes. Empirical results reveal that reconstruction accuracy of internal cell structure in expression space scales with the logarithm of either the breadth or depth of sequencing. We additionally observe a power law relationship between the optimal number of sampled cells and the corresponding sequencing budget. For linear trajectories, non-monotonicity in trajectory reconstruction across the breadth-depth tradeoff can impact downstream inference, such as expression pattern analysis along the trajectory. We demonstrate these results for five single-cell RNA-sequencing datasets encompassing differentiation of embryonic stem cells, pancreatic beta cells, hepatoblast and multipotent hematopoietic cells, as well as induced reprogramming of embryonic fibroblasts into neurons. By addressing the challenges of single-cell data, our study offers insights into maximizing the efficiency of cellular trajectory analysis through strategic allocation of sequencing resources.


Subject(s)
Single-Cell Analysis , Single-Cell Analysis/methods , Sequence Analysis, RNA/methods , Humans , Animals , High-Throughput Nucleotide Sequencing/methods
2.
Nat Commun ; 15(1): 760, 2024 Jan 26.
Article in English | MEDLINE | ID: mdl-38278815

ABSTRACT

Cellular populations simultaneously encode multiple biological attributes, including spatial configuration, temporal trajectories, and cell-cell interactions. Some of these signals may be overshadowed by others and harder to recover, despite the great progress made to computationally reconstruct biological processes from single-cell data. To address this, we present SiFT, a kernel-based projection method for filtering biological signals in single-cell data, thus uncovering underlying biological processes. SiFT applies to a wide range of tasks, from the removal of unwanted variation in the data to revealing hidden biological structures. We demonstrate how SiFT enhances the liver circadian signal by filtering spatial zonation, recovers regenerative cell subpopulations in spatially-resolved liver data, and exposes COVID-19 disease-related cells, pathways, and dynamics by filtering healthy reference signals. SiFT performs the correction at the gene expression level, can scale to large datasets, and compares favorably to state-of-the-art methods.


Subject(s)
Algorithms , Single-Cell Analysis
3.
Nat Biotechnol ; 2024 Jan 15.
Article in English | MEDLINE | ID: mdl-38225466

ABSTRACT

Biolord is a deep generative method for disentangling single-cell multi-omic data to known and unknown attributes, including spatial, temporal and disease states, used to reveal the decoupled biological signatures over diverse single-cell modalities and biological systems. By virtually shifting cells across states, biolord generates experimentally inaccessible samples, outperforming state-of-the-art methods in predictions of cellular response to unseen drugs and genetic perturbations. Biolord is available at https://github.com/nitzanlab/biolord .

4.
Bioinformatics ; 39(39 Suppl 1): i423-i430, 2023 06 30.
Article in English | MEDLINE | ID: mdl-37387155

ABSTRACT

MOTIVATION: Single-cell RNA-sequencing technologies have greatly enhanced our understanding of heterogeneous cell populations and underlying regulatory processes. However, structural (spatial or temporal) relations between cells are lost during cell dissociation. These relations are crucial for identifying associated biological processes. Many existing tissue-reconstruction algorithms use prior information about subsets of genes that are informative with respect to the structure or process to be reconstructed. When such information is not available, and in the general case when the input genes code for multiple processes, including being susceptible to noise, biological reconstruction is often computationally challenging. RESULTS: We propose an algorithm that iteratively identifies manifold-informative genes using existing reconstruction algorithms for single-cell RNA-seq data as subroutine. We show that our algorithm improves the quality of tissue reconstruction for diverse synthetic and real scRNA-seq data, including data from the mammalian intestinal epithelium and liver lobules. AVAILABILITY AND IMPLEMENTATION: The code and data for benchmarking are available at github.com/syq2012/iterative_weight_update_for_reconstruction.


Subject(s)
Algorithms , Single-Cell Gene Expression Analysis , Animals , Benchmarking , Mammals
5.
Cell Rep ; 42(5): 112412, 2023 05 30.
Article in English | MEDLINE | ID: mdl-37086403

ABSTRACT

Most cell types in multicellular organisms can perform multiple functions. However, not all functions can be optimally performed simultaneously by the same cells. Functions incompatible at the level of individual cells can be performed at the cell population level, where cells divide labor and specialize in different functions. Division of labor can arise due to instruction by tissue environment or through self-organization. Here, we develop a computational framework to investigate the contribution of these mechanisms to division of labor within a cell-type population. By optimizing collective cellular task performance under trade-offs, we find that distinguishable expression patterns can emerge from cell-cell interactions versus instructive signals. We propose a method to construct ligand-receptor networks between specialist cells and use it to infer division-of-labor mechanisms from single-cell RNA sequencing (RNA-seq) and spatial transcriptomics data of stromal, epithelial, and immune cells. Our framework can be used to characterize the complexity of cell interactions within tissues.


Subject(s)
Cell Communication , Cues , Gene Expression Profiling
6.
Nat Biotechnol ; 41(10): 1465-1473, 2023 10.
Article in English | MEDLINE | ID: mdl-36797494

ABSTRACT

Transferring annotations of single-cell-, spatial- and multi-omics data is often challenging owing both to technical limitations, such as low spatial resolution or high dropout fraction, and to biological variations, such as continuous spectra of cell states. Based on the concept that these data are often best described as continuous mixtures of cells or molecules, we present a computational framework for the transfer of annotations to cells and their combinations (TACCO), which consists of an optimal transport model extended with different wrappers to annotate a wide variety of data. We apply TACCO to identify cell types and states, decipher spatiomolecular tissue structure at the cell and molecular level and resolve differentiation trajectories using synthetic and biological datasets. While matching or exceeding the accuracy of specialized tools for the individual tasks, TACCO reduces the computational requirements by up to an order of magnitude and scales to larger datasets (for example, considering the runtime of annotation transfer for 1 M simulated dropout observations).


Subject(s)
Multiomics , Single-Cell Analysis , Data Curation
7.
Nat Biotechnol ; 41(11): 1645-1654, 2023 Nov.
Article in English | MEDLINE | ID: mdl-36849830

ABSTRACT

Single-cell RNA sequencing has been instrumental in uncovering cellular spatiotemporal context. This task is challenging as cells simultaneously encode multiple, potentially cross-interfering, biological signals. Here we propose scPrisma, a spectral computational method that uses topological priors to decouple, enhance and filter different classes of biological processes in single-cell data, such as periodic and linear signals. We apply scPrisma to the analysis of the cell cycle in HeLa cells, circadian rhythm and spatial zonation in liver lobules, diurnal cycle in Chlamydomonas and circadian rhythm in the suprachiasmatic nucleus in the brain. scPrisma can be used to distinguish mixed cellular populations by specific characteristics such as cell type and uncover regulatory networks and cell-cell interactions specific to predefined biological signals, such as the circadian rhythm. We show scPrisma's flexibility in incorporating prior knowledge, inference of topologically informative genes and generalization to additional diverse templates and systems. scPrisma can be used as a stand-alone workflow for signal analysis and as a prior step for downstream single-cell analysis.


Subject(s)
Circadian Rhythm , Suprachiasmatic Nucleus , Humans , HeLa Cells , Circadian Rhythm/genetics , Suprachiasmatic Nucleus/metabolism , Cell Cycle , Cell Cycle Proteins/metabolism
8.
PLoS Comput Biol ; 17(11): e1009576, 2021 11.
Article in English | MEDLINE | ID: mdl-34748539

ABSTRACT

Advances in genetic engineering technologies have allowed the construction of artificial genetic circuits, which have been used to generate spatial patterns of differential gene expression. However, the question of how cells can be programmed, and how complex the rules need to be, to achieve a desired tissue morphology has received less attention. Here, we address these questions by developing a mathematical model to study how cells can collectively grow into clusters with different structural morphologies by secreting diffusible signals that can influence cellular growth rates. We formulate how growth regulators can be used to control the formation of cellular protrusions and how the range of achievable structures scales with the number of distinct signals. We show that a single growth inhibitor is insufficient for the formation of multiple protrusions but may be achieved with multiple growth inhibitors, and that other types of signals can regulate the shape of protrusion tips. These examples illustrate how our approach could potentially be used to guide the design of regulatory circuits for achieving a desired target structure.


Subject(s)
Cell Proliferation/physiology , Cell Shape/physiology , Cellular Reprogramming Techniques/methods , Models, Biological , Animals , Cell Aggregation/physiology , Cell Communication/physiology , Cell Surface Extensions/physiology , Cellular Reprogramming Techniques/statistics & numerical data , Computational Biology , Computer Simulation , Gene Regulatory Networks , Genetic Engineering/methods , Genetic Engineering/statistics & numerical data , Growth Inhibitors/physiology , Humans , Morphogenesis/physiology , Synthetic Biology
9.
Nat Methods ; 18(11): 1352-1362, 2021 11.
Article in English | MEDLINE | ID: mdl-34711971

ABSTRACT

Charting an organs' biological atlas requires us to spatially resolve the entire single-cell transcriptome, and to relate such cellular features to the anatomical scale. Single-cell and single-nucleus RNA-seq (sc/snRNA-seq) can profile cells comprehensively, but lose spatial information. Spatial transcriptomics allows for spatial measurements, but at lower resolution and with limited sensitivity. Targeted in situ technologies solve both issues, but are limited in gene throughput. To overcome these limitations we present Tangram, a method that aligns sc/snRNA-seq data to various forms of spatial data collected from the same region, including MERFISH, STARmap, smFISH, Spatial Transcriptomics (Visium) and histological images. Tangram can map any type of sc/snRNA-seq data, including multimodal data such as those from SHARE-seq, which we used to reveal spatial patterns of chromatin accessibility. We demonstrate Tangram on healthy mouse brain tissue, by reconstructing a genome-wide anatomically integrated spatial map at single-cell resolution of the visual and somatomotor areas.


Subject(s)
Brain/metabolism , Chromatin/genetics , Deep Learning , Gene Expression Regulation , Single-Cell Analysis/methods , Software , Transcriptome , Animals , Chromatin/chemistry , Chromatin/metabolism , Female , Gene Expression Profiling , Male , Mice , Mice, Inbred C57BL , RNA-Seq , Regulatory Sequences, Nucleic Acid
10.
Nat Protoc ; 16(9): 4177-4200, 2021 09.
Article in English | MEDLINE | ID: mdl-34349282

ABSTRACT

Single-cell RNA-sequencing (scRNA-seq) technologies have revolutionized modern biomedical sciences. A fundamental challenge is to incorporate spatial information to study tissue organization and spatial gene expression patterns. Here, we describe a detailed protocol for using novoSpaRc, a computational framework that probabilistically assigns cells to tissue locations. At the core of this framework lies a structural correspondence hypothesis, that cells in physical proximity share similar gene expression profiles. Given scRNA-seq data, novoSpaRc spatially reconstructs tissues based on this hypothesis, and optionally, by including a reference atlas of marker genes to improve reconstruction. We describe the novoSpaRc algorithm, and its implementation in an open-source Python package ( https://pypi.org/project/novosparc ). NovoSpaRc maps a scRNA-seq dataset of 10,000 cells onto 1,000 locations in <5 min. We describe results obtained using novoSpaRc to reconstruct the mouse organ of Corti de novo based on the structural correspondence assumption and human osteosarcoma cultured cells based on marker gene information, and provide a step-by-step guide to Drosophila embryo reconstruction in the Procedure to demonstrate how these two strategies can be combined.


Subject(s)
Gene Expression , Sequence Analysis, RNA , Single-Cell Analysis , Software , Spatial Analysis , Algorithms , Animals , Embryo, Nonmammalian/cytology , Embryo, Nonmammalian/metabolism , Humans , Organ of Corti/cytology , Organ of Corti/metabolism , Osteosarcoma/metabolism , Osteosarcoma/pathology
11.
Proc Natl Acad Sci U S A ; 118(11)2021 03 16.
Article in English | MEDLINE | ID: mdl-33836557

ABSTRACT

Gene expression profiles of a cellular population, generated by single-cell RNA sequencing, contains rich information about biological state, including cell type, cell cycle phase, gene regulatory patterns, and location within the tissue of origin. A major challenge is to disentangle information about these different biological states from each other, including distinguishing from cell lineage, since the correlation of cellular expression patterns is necessarily contaminated by ancestry. Here, we use a recent advance in random matrix theory, discovered in the context of protein phylogeny, to identify differentiation or ancestry-related processes in single-cell data. Qin and Colwell [C. Qin, L. J. Colwell, Proc. Natl. Acad. Sci. U.S.A. 115, 690-695 (2018)] showed that ancestral relationships in protein sequences create a power-law signature in the covariance eigenvalue distribution. We demonstrate the existence of such signatures in scRNA-seq data and that the genes driving them are indeed related to differentiation and developmental pathways. We predict the existence of similar power-law signatures for cells along linear trajectories and demonstrate this for linearly differentiating systems. Furthermore, we generalize to show that the same signatures can arise for cells along tissue-specific spatial trajectories. We illustrate these principles in diverse tissues and organisms, including the mammalian epidermis and lung, Drosophila whole-embryo, adult Hydra, dendritic cells, the intestinal epithelium, and cells undergoing induced pluripotent stem cells (iPSC) reprogramming. We show how these results can be used to interpret the gradual dynamics of lineage structure along iPSC reprogramming. Together, we provide a framework that can be used to identify signatures of specific biological processes in single-cell data without prior knowledge and identify candidate genes associated with these processes.


Subject(s)
Cell Lineage , Gene Expression , Single-Cell Analysis/methods , Animals , Humans , Sequence Analysis, RNA/methods
13.
Nat Biotechnol ; 39(5): 586-598, 2021 05.
Article in English | MEDLINE | ID: mdl-33432199

ABSTRACT

Cell-free DNA (cfDNA) in human plasma provides access to molecular information about the pathological processes in the organs or tumors from which it originates. These DNA fragments are derived from fragmented chromatin in dying cells and retain some of the cell-of-origin histone modifications. In this study, we applied chromatin immunoprecipitation of cell-free nucleosomes carrying active chromatin modifications followed by sequencing (cfChIP-seq) to 268 human samples. In healthy donors, we identified bone marrow megakaryocytes, but not erythroblasts, as major contributors to the cfDNA pool. In patients with a range of liver diseases, we showed that we can identify pathology-related changes in hepatocyte transcriptional programs. In patients with metastatic colorectal carcinoma, we detected clinically relevant and patient-specific information, including transcriptionally active human epidermal growth factor receptor 2 (HER2) amplifications. Altogether, cfChIP-seq, using low sequencing depth, provides systemic and genome-wide information and can inform diagnosis and facilitate interrogation of physiological and pathological processes using blood samples.


Subject(s)
Chromatin Immunoprecipitation , Colorectal Neoplasms/genetics , Enhancer Elements, Genetic/genetics , Promoter Regions, Genetic/genetics , Cell-Free System , Colorectal Neoplasms/pathology , Gene Expression Regulation, Neoplastic , Hepatocytes/metabolism , Hepatocytes/pathology , Humans , Neoplasm Metastasis , Nucleosomes/genetics , Sequence Analysis, DNA/methods
15.
Nat Commun ; 11(1): 4355, 2020 08 28.
Article in English | MEDLINE | ID: mdl-32859915

ABSTRACT

The genome of influenza A viruses (IAV) is encoded in eight distinct viral ribonucleoproteins (vRNPs) that consist of negative sense viral RNA (vRNA) covered by the IAV nucleoprotein. Previous studies strongly support a selective packaging model by which vRNP segments are bundling to an octameric complex, which is integrated into budding virions. However, the pathway(s) generating a complete genome bundle is not known. We here use a multiplexed FISH assay to monitor all eight vRNAs in parallel in human lung epithelial cells. Analysis of 3.9 × 105 spots of colocalizing vRNAs provides quantitative insights into segment composition of vRNP complexes and, thus, implications for bundling routes. The complexes rarely contain multiple copies of a specific segment. The data suggest a selective packaging mechanism with limited flexibility by which vRNPs assemble into a complete IAV genome. We surmise that this flexibility forms an essential basis for the development of reassortant viruses with pandemic potential.


Subject(s)
Influenza A virus/genetics , Influenza A virus/physiology , RNA, Viral/genetics , Virus Assembly/genetics , Virus Assembly/physiology , A549 Cells , Epithelial Cells/virology , Evolution, Molecular , Humans , In Situ Hybridization , Influenza A Virus, H3N2 Subtype , Influenza, Human/virology , Lung , Models, Theoretical , Ribonucleoproteins/metabolism
16.
J Exp Med ; 217(10)2020 10 05.
Article in English | MEDLINE | ID: mdl-32603407

ABSTRACT

In response to T cell-dependent antigens, mature B cells are stimulated to form germinal centers (GCs), the sites of B cell affinity maturation and the cell of origin (COO) of most B cell lymphomas. To explore the dynamics of GC B cell development beyond the known dark zone and light zone compartments, we performed single-cell (sc) transcriptomic analysis on human GC B cells and identified multiple functionally linked subpopulations, including the distinct precursors of memory B cells and plasma cells. The gene expression signatures associated with these GC subpopulations were effective in providing a sc-COO for ∼80% of diffuse large B cell lymphomas (DLBCLs) and identified novel prognostic subgroups of DLBCL.


Subject(s)
B-Lymphocytes/pathology , Germinal Center/pathology , Lymphoma/pathology , B-Lymphocytes/metabolism , Cell Lineage , Fluorescent Antibody Technique , Gene Expression Profiling , Germinal Center/metabolism , Humans , Lymphoma/metabolism , Single-Cell Analysis
17.
Nature ; 576(7785): 132-137, 2019 12.
Article in English | MEDLINE | ID: mdl-31748748

ABSTRACT

Multiplexed RNA sequencing in individual cells is transforming basic and clinical life sciences1-4. Often, however, tissues must first be dissociated, and crucial information about spatial relationships and communication between cells is thus lost. Existing approaches to reconstruct tissues assign spatial positions to each cell, independently of other cells, by using spatial patterns of expression of marker genes5,6-which often do not exist. Here we reconstruct spatial positions with little or no prior knowledge, by searching for spatial arrangements of sequenced cells in which nearby cells have transcriptional profiles that are often (but not always) more similar than cells that are farther apart. We formulate this task as a generalized optimal-transport problem for probabilistic embedding and derive an efficient iterative algorithm to solve it. We reconstruct the spatial expression of genes in mammalian liver and intestinal epithelium, fly and zebrafish embryos, sections from the mammalian cerebellum and whole kidney, and use the reconstructed tissues to identify genes that are spatially informative. Thus, we identify an organization principle for the spatial expression of genes in animal tissues, which can be exploited to infer meaningful probabilities of spatial position for individual cells. Our framework ('novoSpaRc') can incorporate prior spatial information and is compatible with any single-cell technology. Additional principles that underlie the cartography of gene expression can be tested using our approach.


Subject(s)
Gene Expression , Animals , Drosophila melanogaster , Gene Expression Profiling , Gene Expression Regulation, Developmental , Sequence Analysis, RNA , Single-Cell Analysis , Software
18.
Front Comput Neurosci ; 12: 11, 2018.
Article in English | MEDLINE | ID: mdl-29541024

ABSTRACT

In mental time travel (MTT) one is "traveling" back-and-forth in time, remembering, and imagining events. Despite intensive research regarding memory processes in the hippocampus, it was only recently shown that the hippocampus plays an essential role in encoding the temporal order of events remembered, and therefore plays an important role in MTT. Does it also encode the temporal relations of these events to the remembering self? We asked patients undergoing pre-surgical evaluation with depth electrodes penetrating the temporal lobes bilaterally toward the hippocampus to project themselves in time to a past, future, or present time-point, and then make judgments regarding various events. Classification analysis of intracranial evoked potentials revealed clear temporal dissociation in the left hemisphere between lateral-temporal electrodes, activated at ~100-300 ms, and hippocampal electrodes, activated at ~400-600 ms. This dissociation may suggest a division of labor in the temporal lobe during self-projection in time, hinting toward the different roles of the lateral-temporal cortex and the hippocampus in MTT and the temporal organization of the related events with respect to the experiencing self.

19.
Nat Commun ; 8(1): 2192, 2017 12 19.
Article in English | MEDLINE | ID: mdl-29259167

ABSTRACT

The topology of interactions in network dynamical systems fundamentally underlies their function. Accelerating technological progress creates massively available data about collective nonlinear dynamics in physical, biological, and technological systems. Detecting direct interaction patterns from those dynamics still constitutes a major open problem. In particular, current nonlinear dynamics approaches mostly require to know a priori a model of the (often high dimensional) system dynamics. Here we develop a model-independent framework for inferring direct interactions solely from recording the nonlinear collective dynamics generated. Introducing an explicit dependency matrix in combination with a block-orthogonal regression algorithm, the approach works reliably across many dynamical regimes, including transient dynamics toward steady states, periodic and non-periodic dynamics, and chaos. Together with its capabilities to reveal network (two point) as well as hypernetwork (e.g., three point) interactions, this framework may thus open up nonlinear dynamics options of inferring direct interaction patterns across systems where no model is known.

20.
J Neurosci ; 37(27): 6394-6407, 2017 07 05.
Article in English | MEDLINE | ID: mdl-28546311

ABSTRACT

Investigation of the functional macro-scale organization of the human cortex is fundamental in modern neuroscience. Although numerous studies have identified networks of interacting functional modules in the gray-matter, limited research was directed to the functional organization of the white-matter. Recent studies have demonstrated that the white-matter exhibits blood oxygen level-dependent signal fluctuations similar to those of the gray-matter. Here we used these signal fluctuations to investigate whether the white-matter is organized as functional networks by applying a clustering analysis on resting-state functional MRI (RSfMRI) data from white-matter voxels, in 176 subjects (of both sexes). This analysis indicated the existence of 12 symmetrical white-matter functional networks, corresponding to combinations of white-matter tracts identified by diffusion tensor imaging. Six of the networks included interhemispheric commissural bridges traversing the corpus callosum. Signals in white-matter networks correlated with signals from functional gray-matter networks, providing missing knowledge on how these distributed networks communicate across large distances. These findings were replicated in an independent subject group and were corroborated by seed-based analysis in small groups and individual subjects. The identified white-matter functional atlases and analysis codes are available at http://mind.huji.ac.il/white-matter.aspx Our results demonstrate that the white-matter manifests an intrinsic functional organization as interacting networks of functional modules, similarly to the gray-matter, which can be investigated using RSfMRI. The discovery of functional networks within the white-matter may open new avenues of research in cognitive neuroscience and clinical neuropsychiatry.SIGNIFICANCE STATEMENT In recent years, functional MRI (fMRI) has revolutionized all fields of neuroscience, enabling identifications of functional modules and networks in the human brain. However, most fMRI studies ignored a major part of the brain, the white-matter, discarding signals from it as arising from noise. Here we use resting-state fMRI data from 176 subjects to show that signals from the human white-matter contain meaningful information. We identify 12 functional networks composed of interacting long-distance white-matter tracts. Moreover, we show that these networks are highly correlated to resting-state gray-matter networks, highlighting their functional role. Our findings enable reinterpretation of many existing fMRI datasets, and suggest a new way to explore the white-matter role in cognition and its disturbances in neuropsychiatric disorders.


Subject(s)
Brain Mapping/methods , Brain/physiology , Evoked Potentials/physiology , Nerve Net/physiology , White Matter/physiology , Adult , Female , Humans , Male , Neural Pathways/physiology
SELECTION OF CITATIONS
SEARCH DETAIL