Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
bioRxiv ; 2024 Feb 17.
Article in English | MEDLINE | ID: mdl-38405704

ABSTRACT

Neural networks have emerged as immensely powerful tools in predicting functional genomic regions, notably evidenced by recent successes in deciphering gene regulatory logic. However, a systematic evaluation of how model architectures and training strategies impact genomics model performance is lacking. To address this gap, we held a DREAM Challenge where competitors trained models on a dataset of millions of random promoter DNA sequences and corresponding expression levels, experimentally determined in yeast, to best capture the relationship between regulatory DNA and gene expression. For a robust evaluation of the models, we designed a comprehensive suite of benchmarks encompassing various sequence types. While some benchmarks produced similar results across the top-performing models, others differed substantially. All top-performing models used neural networks, but diverged in architectures and novel training strategies, tailored to genomics sequence data. To dissect how architectural and training choices impact performance, we developed the Prix Fixe framework to divide any given model into logically equivalent building blocks. We tested all possible combinations for the top three models and observed performance improvements for each. The DREAM Challenge models not only achieved state-of-the-art results on our comprehensive yeast dataset but also consistently surpassed existing benchmarks on Drosophila and human genomic datasets. Overall, we demonstrate that high-quality gold-standard genomics datasets can drive significant progress in model development.

2.
Nat Genet ; 54(8): 1178-1191, 2022 08.
Article in English | MEDLINE | ID: mdl-35902743

ABSTRACT

Pancreatic ductal adenocarcinoma (PDAC) is a highly lethal and treatment-refractory cancer. Molecular stratification in pancreatic cancer remains rudimentary and does not yet inform clinical management or therapeutic development. Here, we construct a high-resolution molecular landscape of the cellular subtypes and spatial communities that compose PDAC using single-nucleus RNA sequencing and whole-transcriptome digital spatial profiling (DSP) of 43 primary PDAC tumor specimens that either received neoadjuvant therapy or were treatment naive. We uncovered recurrent expression programs across malignant cells and fibroblasts, including a newly identified neural-like progenitor malignant cell program that was enriched after chemotherapy and radiotherapy and associated with poor prognosis in independent cohorts. Integrating spatial and cellular profiles revealed three multicellular communities with distinct contributions from malignant, fibroblast and immune subtypes: classical, squamoid-basaloid and treatment enriched. Our refined molecular and cellular taxonomy can provide a framework for stratification in clinical trials and serve as a roadmap for therapeutic targeting of specific cellular phenotypes and multicellular interactions.


Subject(s)
Carcinoma, Pancreatic Ductal , Pancreatic Neoplasms , Biomarkers, Tumor/genetics , Carcinoma, Pancreatic Ductal/genetics , Carcinoma, Pancreatic Ductal/pathology , Carcinoma, Pancreatic Ductal/therapy , Gene Expression Profiling , Humans , Neoadjuvant Therapy , Pancreatic Neoplasms/drug therapy , Pancreatic Neoplasms/genetics , Prognosis , Transcriptome/genetics , Pancreatic Neoplasms
4.
Nature ; 595(7868): 554-559, 2021 07.
Article in English | MEDLINE | ID: mdl-34163074

ABSTRACT

The mammalian cerebral cortex has an unparalleled diversity of cell types, which are generated during development through a series of temporally orchestrated events that are under tight evolutionary constraint and are critical for proper cortical assembly and function1,2. However, the molecular logic that governs the establishment and organization of cortical cell types remains unknown, largely due to the large number of cell classes that undergo dynamic cell-state transitions over extended developmental timelines. Here we generate a comprehensive atlas of the developing mouse neocortex, using single-cell RNA sequencing and single-cell assay for transposase-accessible chromatin using sequencing. We sampled the neocortex every day throughout embryonic corticogenesis and at early postnatal ages, and complemented the sequencing data with a spatial transcriptomics time course. We computationally reconstruct developmental trajectories across the diversity of cortical cell classes, and infer their spatial organization and the gene regulatory programs that accompany their lineage bifurcation decisions and differentiation trajectories. Finally, we demonstrate how this developmental map pinpoints the origin of lineage-specific developmental abnormalities that are linked to aberrant corticogenesis in mutant mice. The data provide a global picture of the regulatory mechanisms that govern cellular diversification in the neocortex.


Subject(s)
Neocortex/cytology , Neurogenesis , Animals , Cell Differentiation , DNA-Binding Proteins/genetics , Embryo, Mammalian , Gene Expression Regulation, Developmental , Mice , Mice, Inbred C57BL , Mice, Knockout , Neocortex/embryology , Nerve Tissue Proteins/genetics , Sequence Analysis, RNA , Single-Cell Analysis , Transcriptome
5.
Proc Mach Learn Res ; 149: 478-505, 2021.
Article in English | MEDLINE | ID: mdl-35098143

ABSTRACT

Probabilistic topic models, have been widely deployed for various applications such as learning disease or tissue subtypes. Yet, learning the parameters of such models is usually an ill-posed problem and may result in losing valuable information about disease severity. A common approach is to add a discriminative loss term to the generative model's loss in order to learn a representation that is also predictive of disease severity. However, finding a balance between these two losses is not straightforward. We propose an alternative way in this paper. We develop a framework which allows for incorporating external covariates into the generative model's approximate posterior. These covariates can have more discriminative power for disease severity compared to the representation that we extract from the posterior distribution. For instance, they can be features extracted from a neural network which predicts disease severity from CT images. Effectively, we enforce the generative model's approximate posterior to reside in the subspace of these discriminative covariates. We illustrate our method's application on a large-scale lung CT study of Chronic Obstructive Pulmonary Disease (COPD), a highly heterogeneous disease. We aim at identifying tissue subtypes by using a variant of topic model as a generative model. We quantitatively evaluate the predictive performance of the inferred subtypes and demonstrate that our method outperforms or performs on par with some reasonable baselines. We also show that some of the discovered subtypes are correlated with genetic measurements, suggesting that the identified subtypes may characterize the disease's underlying etiology.

6.
J Neurosci ; 30(29): 9659-69, 2010 Jul 21.
Article in English | MEDLINE | ID: mdl-20660249

ABSTRACT

How the activity of populations of cortical neurons generates coordinated multijoint actions of the arm, wrist, and hand is poorly understood. This study combined multielectrode recording techniques with full arm motion capture to relate neural activity in primary motor cortex (M1) of macaques (Macaca mulatta) to arm, wrist, and hand postures during movement. We find that the firing rate of individual M1 neurons is typically modulated by the kinematics of multiple joints and that small, local ensembles of M1 neurons contain sufficient information to reconstruct 25 measured joint angles (representing an estimated 10 functionally independent degrees of freedom). Beyond showing that the spiking patterns of local M1 ensembles represent a rich set of naturalistic movements involving the entire upper limb, the results also suggest that achieving high-dimensional reach and grasp actions with neuroprosthetic devices may be possible using small intracortical arrays like those already being tested in human pilot clinical trials.


Subject(s)
Arm/physiology , Hand Strength/physiology , Motor Cortex/physiology , Movement/physiology , Neurons/physiology , Psychomotor Performance/physiology , Animals , Biomechanical Phenomena/physiology , Hand/physiology , Macaca mulatta , Male , Models, Neurological , Motor Skills
SELECTION OF CITATIONS
SEARCH DETAIL
...