Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 33
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
2.
BMC Bioinformatics ; 25(1): 25, 2024 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-38221640

RESUMO

With the growing number of single-cell datasets collected under more complex experimental conditions, there is an opportunity to leverage single-cell variability to reveal deeper insights into how cells respond to perturbations. Many existing approaches rely on discretizing the data into clusters for differential gene expression (DGE), effectively ironing out any information unveiled by the single-cell variability across cell-types. In addition, DGE often assumes a statistical distribution that, if erroneous, can lead to false positive differentially expressed genes. Here, we present Cellograph: a semi-supervised framework that uses graph neural networks to quantify the effects of perturbations at single-cell granularity. Cellograph not only measures how prototypical cells are of each condition but also learns a latent space that is amenable to interpretable data visualization and clustering. The learned gene weight matrix from training reveals pertinent genes driving the differences between conditions. We demonstrate the utility of our approach on publicly-available datasets including cancer drug therapy, stem cell reprogramming, and organoid differentiation. Cellograph outperforms existing methods for quantifying the effects of experimental perturbations and offers a novel framework to analyze single-cell data using deep learning.


Assuntos
Visualização de Dados , Redes Neurais de Computação , Diferenciação Celular , Análise por Conglomerados , RNA
3.
Ann Surg ; 277(3): e503-e512, 2023 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-35129529

RESUMO

OBJECTIVE: The longitudinal assessment of physical function with high temporal resolution at a scalable and objective level in patients recovering from surgery is highly desirable to understand the biological and clinical factors that drive the clinical outcome. However, physical recovery from surgery itself remains poorly defined and the utility of wearable technologies to study recovery after surgery has not been established. BACKGROUND: Prolonged postoperative recovery is often associated with long-lasting impairment of physical, mental, and social functions. Although phenotypical and clinical patient characteristics account for some variation of individual recovery trajectories, biological differences likely play a major role. Specifically, patient-specific immune states have been linked to prolonged physical impairment after surgery. However, current methods of quantifying physical recovery lack patient specificity and objectivity. METHODS: Here, a combined high-fidelity accelerometry and state-of-the-art deep immune profiling approach was studied in patients undergoing major joint replacement surgery. The aim was to determine whether objective physical parameters derived from accelerometry data can accurately track patient-specific physical recovery profiles (suggestive of a 'clock of postoperative recovery'), compare the performance of derived parameters with benchmark metrics including step count, and link individual recovery profiles with patients' preoperative immune state. RESULTS: The results of our models indicate that patient-specific temporal patterns of physical function can be derived with a precision superior to benchmark metrics. Notably, 6 distinct domains of physical function and sleep are identified to represent the objective temporal patterns: ''activity capacity'' and ''moderate and overall activity (declined immediately after surgery); ''sleep disruption and sedentary activity (increased after surgery); ''overall sleep'', ''sleep onset'', and ''light activity'' (no clear changes were observed after surgery). These patterns can be linked to individual patients preopera-tive immune state using cross-validated canonical-correlation analysis. Importantly, the pSTAT3 signal activity in monocytic myeloid-derived suppressor cells predicted a slower recovery. CONCLUSIONS: Accelerometry-based recovery trajectories are scalable and objective outcomes to study patient-specific factors that drive physical recovery.


Assuntos
Benchmarking , Exercício Físico , Humanos , Monócitos , Exame Físico , Período Pós-Operatório
4.
EMBO J ; 37(16)2018 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-29973362

RESUMO

The anaphase-promoting complex/cyclosome (APC/C) is an E3 ubiquitin ligase and key regulator of cell cycle progression. Since APC/C promotes the degradation of mitotic cyclins, it controls cell cycle-dependent oscillations in cyclin-dependent kinase (CDK) activity. Both CDKs and APC/C control a large number of substrates and are regulated by analogous mechanisms, including cofactor-dependent activation. However, whereas substrate dephosphorylation is known to counteract CDK, it remains largely unknown whether deubiquitinating enzymes (DUBs) antagonize APC/C substrate ubiquitination during mitosis. Here, we demonstrate that Cezanne/OTUD7B is a cell cycle-regulated DUB that opposes the ubiquitination of APC/C targets. Cezanne is remarkably specific for K11-linked ubiquitin chains, which are formed by APC/C in mitosis. Accordingly, Cezanne binds established APC/C substrates and reverses their APC/C-mediated ubiquitination. Cezanne depletion accelerates APC/C substrate degradation and causes errors in mitotic progression and formation of micronuclei. These data highlight the importance of tempered APC/C substrate destruction in maintaining chromosome stability. Furthermore, Cezanne is recurrently amplified and overexpressed in numerous malignancies, suggesting a potential role in genome maintenance and cancer cell proliferation.


Assuntos
Ciclossomo-Complexo Promotor de Anáfase/metabolismo , Instabilidade Cromossômica , Enzimas Desubiquitinantes/metabolismo , Endopeptidases/metabolismo , Mitose , Proteínas de Neoplasias/metabolismo , Neoplasias/metabolismo , Proteólise , Ciclossomo-Complexo Promotor de Anáfase/genética , Enzimas Desubiquitinantes/genética , Endopeptidases/genética , Células HCT116 , Células HEK293 , Células HeLa , Humanos , Micronúcleos com Defeito Cromossômico , Proteínas de Neoplasias/genética , Neoplasias/genética , Neoplasias/patologia , Ubiquitinação
5.
Ann Surg ; 275(3): 582-590, 2022 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-34954754

RESUMO

OBJECTIVE: The aim of this study was to determine whether single-cell and plasma proteomic elements of the host's immune response to surgery accurately identify patients who develop a surgical site complication (SSC) after major abdominal surgery. SUMMARY BACKGROUND DATA: SSCs may occur in up to 25% of patients undergoing bowel resection, resulting in significant morbidity and economic burden. However, the accurate prediction of SSCs remains clinically challenging. Leveraging high-content proteomic technologies to comprehensively profile patients' immune response to surgery is a promising approach to identify predictive biological factors of SSCs. METHODS: Forty-one patients undergoing non-cancer bowel resection were prospectively enrolled. Blood samples collected before surgery and on postoperative day one (POD1) were analyzed using a combination of single-cell mass cytometry and plasma proteomics. The primary outcome was the occurrence of an SSC, including surgical site infection, anastomotic leak, or wound dehiscence within 30 days of surgery. RESULTS: A multiomic model integrating the single-cell and plasma proteomic data collected on POD1 accurately differentiated patients with (n = 11) and without (n = 30) an SSC [area under the curve (AUC) = 0.86]. Model features included coregulated proinflammatory (eg, IL-6- and MyD88- signaling responses in myeloid cells) and immunosuppressive (eg, JAK/STAT signaling responses in M-MDSCs and Tregs) events preceding an SSC. Importantly, analysis of the immunological data obtained before surgery also yielded a model accurately predicting SSCs (AUC = 0.82). CONCLUSIONS: The multiomic analysis of patients' immune response after surgery and immune state before surgery revealed systemic immune signatures preceding the development of SSCs. Our results suggest that integrating immunological data in perioperative risk assessment paradigms is a plausible strategy to guide individualized clinical care.


Assuntos
Fístula Anastomótica/epidemiologia , Proteínas Sanguíneas/análise , Proteínas Alimentares/sangue , Deiscência da Ferida Operatória/epidemiologia , Infecção da Ferida Cirúrgica/epidemiologia , Adulto , Estudos de Coortes , Procedimentos Cirúrgicos do Sistema Digestório , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Modelos Teóricos , Prognóstico , Estudos Prospectivos , Proteoma , Análise de Célula Única
6.
Bioinformatics ; 35(1): 95-103, 2019 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-30561547

RESUMO

Motivation: Multiple biological clocks govern a healthy pregnancy. These biological mechanisms produce immunologic, metabolomic, proteomic, genomic and microbiomic adaptations during the course of pregnancy. Modeling the chronology of these adaptations during full-term pregnancy provides the frameworks for future studies examining deviations implicated in pregnancy-related pathologies including preterm birth and preeclampsia. Results: We performed a multiomics analysis of 51 samples from 17 pregnant women, delivering at term. The datasets included measurements from the immunome, transcriptome, microbiome, proteome and metabolome of samples obtained simultaneously from the same patients. Multivariate predictive modeling using the Elastic Net (EN) algorithm was used to measure the ability of each dataset to predict gestational age. Using stacked generalization, these datasets were combined into a single model. This model not only significantly increased predictive power by combining all datasets, but also revealed novel interactions between different biological modalities. Future work includes expansion of the cohort to preterm-enriched populations and in vivo analysis of immune-modulating interventions based on the mechanisms identified. Availability and implementation: Datasets and scripts for reproduction of results are available through: https://nalab.stanford.edu/multiomics-pregnancy/. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Metaboloma , Microbiota , Gravidez , Proteoma , Transcriptoma , Biologia Computacional , Feminino , Humanos
7.
Brain ; 142(4): 978-991, 2019 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-30860258

RESUMO

Stroke is a leading cause of cognitive impairment and dementia, but the mechanisms that underlie post-stroke cognitive decline are not well understood. Stroke produces profound local and systemic immune responses that engage all major innate and adaptive immune compartments. However, whether the systemic immune response to stroke contributes to long-term disability remains ill-defined. We used a single-cell mass cytometry approach to comprehensively and functionally characterize the systemic immune response to stroke in longitudinal blood samples from 24 patients over the course of 1 year and correlated the immune response with changes in cognitive functioning between 90 and 365 days post-stroke. Using elastic net regularized regression modelling, we identified key elements of a robust and prolonged systemic immune response to ischaemic stroke that occurs in three phases: an acute phase (Day 2) characterized by increased signal transducer and activator of transcription 3 (STAT3) signalling responses in innate immune cell types, an intermediate phase (Day 5) characterized by increased cAMP response element-binding protein (CREB) signalling responses in adaptive immune cell types, and a late phase (Day 90) by persistent elevation of neutrophils, and immunoglobulin M+ (IgM+) B cells. By Day 365 there was no detectable difference between these samples and those from an age- and gender-matched patient cohort without stroke. When regressed against the change in the Montreal Cognitive Assessment scores between Days 90 and 365 after stroke, the acute inflammatory phase Elastic Net model correlated with post-stroke cognitive trajectories (r = -0.692, Bonferroni-corrected P = 0.039). The results demonstrate the utility of a deep immune profiling approach with mass cytometry for the identification of clinically relevant immune correlates of long-term cognitive trajectories.


Assuntos
Cognição/fisiologia , Acidente Vascular Cerebral/imunologia , Acidente Vascular Cerebral/fisiopatologia , Idoso , Idoso de 80 Anos ou mais , Isquemia Encefálica/complicações , Proteína de Ligação a CREB/metabolismo , Transtornos Cognitivos/etiologia , Transtornos Cognitivos/imunologia , Disfunção Cognitiva/complicações , Disfunção Cognitiva/imunologia , Estudos de Coortes , Feminino , Humanos , Imunoglobulina M , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Neutrófilos , Fator de Transcrição STAT3/metabolismo , Transdução de Sinais , Acidente Vascular Cerebral/complicações , Sobreviventes
8.
Phys Rev Lett ; 116(22): 228301, 2016 Jun 03.
Artigo em Inglês | MEDLINE | ID: mdl-27314740

RESUMO

Many systems are naturally represented by a multilayer network in which edges exist in multiple layers that encode different, but potentially related, types of interactions, and it is important to understand limitations on the detectability of community structure in these networks. Using random matrix theory, we analyze detectability limitations for multilayer (specifically, multiplex) stochastic block models (SBMs) in which L layers are derived from a common SBM. We study the effect of layer aggregation on detectability for several aggregation methods, including summation of the layers' adjacency matrices for which we show the detectability limit vanishes as O(L^{-1/2}) with increasing number of layers, L. Importantly, we find a similar scaling behavior when the summation is thresholded at an optimal value, providing insight into the common-but not well understood-practice of thresholding pairwise-interaction data to obtain sparse network representations.

9.
Genome Biol ; 25(1): 9, 2024 01 03.
Artigo em Inglês | MEDLINE | ID: mdl-38172966

RESUMO

BACKGROUND: To analyze the large volume of data generated by single-cell technologies and to identify cellular correlates of particular clinical or experimental outcomes, differential abundance analyses are often applied. These algorithms identify subgroups of cells whose abundances change significantly in response to disease progression, or to an experimental perturbation. Despite the effectiveness of differential abundance analyses in identifying critical cell-states, there is currently no systematic benchmarking study to compare their applicability, usefulness, and accuracy in practice across single-cell modalities. RESULTS: Here, we perform a comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art differential abundance testing methods. We benchmarked six single-cell testing methods on several practical tasks, using both synthetic and real single-cell datasets. The tasks evaluated include effectiveness in identifying true differentially abundant subpopulations, accuracy in the adequate handling of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the practical use of differential abundance testing approaches. CONCLUSIONS: Based on our benchmarking study, we provide a set of recommendations for the optimal usage of single-cell DA testing methods in practice, particularly with respect to factors such as the presence of technical noise (for example batch effects), dataset size, and hyperparameter sensitivity.


Assuntos
Algoritmos , Benchmarking , Projetos de Pesquisa , Análise de Célula Única/métodos
10.
Res Sq ; 2024 Sep 12.
Artigo em Inglês | MEDLINE | ID: mdl-39315265

RESUMO

Single-cell technologies enable comprehensive profiling of diverse immune cell-types through the measurement of multiple genes or proteins per cell. In order to translate data from immune profiling assays into powerful diagnostics, machine learning approaches are used to compute per-sample immunological summaries, or featurizations that can be used as inputs to models for outcomes of interest. Current supervised learning approaches for computing per-sample representations are optimized based only on the outcome variable to be predicted and do not take into account clinically-relevant covariates that are likely to also be measured. Here we expand the optimization problem to also take into account such additional patient covariates to directly inform the learned per-sample representations. To do this, we introduce CytoCoSet, a set-based encoding method, which formulates a loss function with an additional triplet term penalizing samples with similar covariates from having disparate embedding results in per-sample representations. Overall, incorporating clinical covariates leads to improved prediction of clinical phenotypes.

11.
Nat Commun ; 15(1): 2765, 2024 Mar 29.
Artigo em Inglês | MEDLINE | ID: mdl-38553455

RESUMO

Single-cell technologies can measure the expression of thousands of molecular features in individual cells undergoing dynamic biological processes. While examining cells along a computationally-ordered pseudotime trajectory can reveal how changes in gene or protein expression impact cell fate, identifying such dynamic features is challenging due to the inherent noise in single-cell data. Here, we present DELVE, an unsupervised feature selection method for identifying a representative subset of molecular features which robustly recapitulate cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effects of confounding sources of variation, and instead models cell states from dynamic gene or protein modules based on core regulatory complexes. Using simulations, single-cell RNA sequencing, and iterative immunofluorescence imaging data in the context of cell cycle and cellular differentiation, we demonstrate how DELVE selects features that better define cell-types and cell-type transitions. DELVE is available as an open-source python package: https://github.com/jranek/delve .


Assuntos
Perfilação da Expressão Gênica , Software , Perfilação da Expressão Gênica/métodos , Análise de Célula Única/métodos , Diferenciação Celular , Ciclo Celular/genética , Análise de Sequência de RNA/métodos
12.
bioRxiv ; 2024 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-38260395

RESUMO

Amyotrophic lateral sclerosis is the most common fatal motor neuron disease. Approximately 90% of ALS patients exhibit pathology of the master RNA regulator, Transactive Response DNA Binding protein (TDP-43). Despite the prevalence TDP-43 pathology in ALS motor neurons, recent findings suggest immune dysfunction is a determinant of disease progression in patients. Whether TDP-43 pathology elicits disease-modifying immune responses in ALS remains underexplored. In this study, we demonstrate that TDP-43 pathology is internalized by antigen presenting cells, causes vesicle rupture, and leads to innate and adaptive immune cell activation. Using a multiplex imaging platform, we observed interactions between innate and adaptive immune cells near TDP-43 pathological lesions in ALS brain. We used a mass cytometry-based whole-blood stimulation assay to provide evidence that ALS patient peripheral immune cells exhibit responses to TDP-43 aggregates. Taken together, this study provides a novel link between TDP-43 pathology and ALS immune dysfunction, and further highlights the translational and diagnostic implications of monitoring and manipulating the ALS immune response.

13.
bioRxiv ; 2023 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-38014189

RESUMO

Single-cell technologies enable high-dimensional profiling of individual cells, therefore offering profound insights into subtle variation between specialized cell-types. However, translating the multitude of nuanced cellular profiles into meaningful per-sample representations is challenging due to heterogeneous cellular composition across individual profiled samples. To compute informative per-sample representations, we developed scLKME, a novel approach that uses a landmark-based kernel mean embedding method to convert multi-sample single-cell data into compact per-sample embeddings. Treating each sample as a distribution over cells, scLKME identifies landmarks across samples and maps these distributions into a reproducing kernel Hilbert space. Overall, scLKME outperforms state-of-the-art techniques in robustness, efficiency, accuracy, and practical usefulness of sample embeddings. Its application on a CyTOF dataset profiling immune responses in preterm birth highlighted its capacity to accurately identify patient-specific variations correlating with gestational age, suggesting broad applicability to multi-sample single-cell datasets with complex experimental designs. scLKME is available as an open-sourced python package at https://github.com/CompCy-lab/scLKME.

14.
bioRxiv ; 2023 Feb 27.
Artigo em Inglês | MEDLINE | ID: mdl-36909641

RESUMO

Modern single-cell data analysis relies on statistical testing (e.g. differential expression testing) to identify genes or proteins that are up-or down-regulated in relation to cell-types or clinical outcomes. However, existing algorithms for such statistical testing are often limited by technical noise and cellular heterogeneity, which lead to false-positive results. To constrain the analysis to a compact and phenotype-related cell population, differential abundance (DA) testing methods were employed to identify subgroups of cells whose abundance changed significantly in response to disease progression, or experimental perturbation. Despite the effectiveness of DA testing algorithms of identifying critical cell-states, there are no systematic benchmarking or comparative studies to compare their usages in practice. Herein, we performed the first comprehensive benchmarking study to objectively evaluate and compare the benefits and potential downsides of current state-of-the-art DA testing methods. We benchmarked six DA testing methods on several practical tasks, using both synthetic and real single-cell datasets. The task evaluated include, recognizing true DA subpopulations, appropriate handing of batch effects, runtime efficiency, and hyperparameter usability and robustness. Based on various evaluation results, this paper gives dataset-specific suggestions for the usage of DA testing methods.

15.
Pac Symp Biocomput ; 28: 85-96, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36540967

RESUMO

Graph-based algorithms have become essential in the analysis of single-cell data for numerous tasks, such as automated cell-phenotyping and identifying cellular correlates of experimental perturbations or disease states. In large multi-patient, multi-sample single-cell datasets, the analysis of cell-cell similarity graphs representations of these data becomes computationally prohibitive. Here, we introduce cytocoarsening, a novel graph-coarsening algorithm that significantly reduces the size of single-cell graph representations, which can then be used as input to downstream bioinformatics algorithms for improved computational efficiency. Uniquely, cytocoarsening considers both phenotypical similarity of cells and similarity of cells' associated clinical or experimental attributes in order to more readily identify condition-specific cell populations. The resulting coarse graph representations were evaluated based on both their structural correctness and the capacity of downstream algorithms to uncover the same biological conclusions as if the full graph had been used. Cytocoarsening is provided as open source code at https://github.com/ChenCookie/cytocoarsening.


Assuntos
Algoritmos , Biologia Computacional , Humanos , Biologia Computacional/métodos , Software
16.
bioRxiv ; 2023 May 12.
Artigo em Inglês | MEDLINE | ID: mdl-37214963

RESUMO

Single-cell technologies can readily measure the expression of thousands of molecular features from individual cells undergoing dynamic biological processes, such as cellular differentiation, immune response, and disease progression. While examining cells along a computationally ordered pseudotime offers the potential to study how subtle changes in gene or protein expression impact cell fate decision-making, identifying characteristic features that drive continuous biological processes remains difficult to detect from unenriched and noisy single-cell data. Given that all profiled sources of feature variation contribute to the cell-to-cell distances that define an inferred cellular trajectory, including confounding sources of biological variation (e.g. cell cycle or metabolic state) or noisy and irrelevant features (e.g. measurements with low signal-to-noise ratio) can mask the underlying trajectory of study and hinder inference. Here, we present DELVE (dynamic selection of locally covarying features), an unsupervised feature selection method for identifying a representative subset of dynamically-expressed molecular features that recapitulates cellular trajectories. In contrast to previous work, DELVE uses a bottom-up approach to mitigate the effect of unwanted sources of variation confounding inference, and instead models cell states from dynamic feature modules that constitute core regulatory complexes. Using simulations, single-cell RNA sequencing data, and iterative immunofluorescence imaging data in the context of the cell cycle and cellular differentiation, we demonstrate that DELVE selects features that more accurately characterize cell populations and improve the recovery of cell type transitions. This feature selection framework provides an alternative approach for improving trajectory inference and uncovering co-variation amongst features along a biological trajectory. DELVE is implemented as an open-source python package and is publicly available at: https://github.com/jranek/delve.

17.
Cell Rep Med ; 4(11): 101268, 2023 11 21.
Artigo em Inglês | MEDLINE | ID: mdl-37949070

RESUMO

In people with HIV (PWH), the post-antiretroviral therapy (ART) window is critical for immune restoration and HIV reservoir stabilization. We employ deep immune profiling and T cell receptor (TCR) sequencing and examine proliferation to assess how ART impacts T cell homeostasis. In PWH on long-term ART, lymphocyte frequencies and phenotypes are mostly stable. By contrast, broad phenotypic changes in natural killer (NK) cells, γδ T cells, B cells, and CD4+ and CD8+ T cells are observed in the post-ART window. Whereas CD8+ T cells mostly restore, memory CD4+ T subsets and cytolytic NK cells show incomplete restoration 1.4 years post ART. Surprisingly, the hierarchies and frequencies of dominant CD4 TCR clonotypes (0.1%-11% of all CD4+ T cells) remain stable post ART, suggesting that clonal homeostasis can be independent of homeostatic processes regulating CD4+ T cell absolute number, phenotypes, and function. The slow restoration of host immunity post ART also has implications for the design of ART interruption studies.


Assuntos
Infecções por HIV , Reconstituição Imune , Humanos , Linfócitos T CD8-Positivos , Infecções por HIV/tratamento farmacológico , Linfócitos T CD4-Positivos , Antirretrovirais/uso terapêutico , Receptores de Antígenos de Linfócitos T
18.
Nat Comput Sci ; 3(4): 346-359, 2023 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38116462

RESUMO

Advanced measurement and data storage technologies have enabled high-dimensional profiling of complex biological systems. For this, modern multiomics studies regularly produce datasets with hundreds of thousands of measurements per sample, enabling a new era of precision medicine. Correlation analysis is an important first step to gain deeper insights into the coordination and underlying processes of such complex systems. However, the construction of large correlation networks in modern high-dimensional datasets remains a major computational challenge owing to rapidly growing runtime and memory requirements. Here we address this challenge by introducing CorALS (Correlation Analysis of Large-scale (biological) Systems), an open-source framework for the construction and analysis of large-scale parametric as well as non-parametric correlation networks for high-dimensional biological data. It features off-the-shelf algorithms suitable for both personal and high-performance computers, enabling workflows and downstream analysis approaches. We illustrate the broad scope and potential of CorALS by exploring perspectives on complex biological processes in large-scale multiomics and single-cell studies.

19.
Genome Biol ; 23(1): 186, 2022 09 05.
Artigo em Inglês | MEDLINE | ID: mdl-36064614

RESUMO

BACKGROUND: Current methods for analyzing single-cell datasets have relied primarily on static gene expression measurements to characterize the molecular state of individual cells. However, capturing temporal changes in cell state is crucial for the interpretation of dynamic phenotypes such as the cell cycle, development, or disease progression. RNA velocity infers the direction and speed of transcriptional changes in individual cells, yet it is unclear how these temporal gene expression modalities may be leveraged for predictive modeling of cellular dynamics. RESULTS: Here, we present the first task-oriented benchmarking study that investigates integration of temporal sequencing modalities for dynamic cell state prediction. We benchmark ten integration approaches on ten datasets spanning different biological contexts, sequencing technologies, and species. We find that integrated data more accurately infers biological trajectories and achieves increased performance on classifying cells according to perturbation and disease states. Furthermore, we show that simple concatenation of spliced and unspliced molecules performs consistently well on classification tasks and can be used over more memory intensive and computationally expensive methods. CONCLUSIONS: This work illustrates how integrated temporal gene expression modalities may be leveraged for predicting cellular trajectories and sample-associated perturbation and disease phenotypes. Additionally, this study provides users with practical recommendations for task-specific integration of single-cell gene expression modalities.


Assuntos
Benchmarking , Análise de Célula Única , Expressão Gênica
20.
J Matern Fetal Neonatal Med ; 35(25): 5621-5628, 2022 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-33653202

RESUMO

BACKGROUND: Early identification of pregnant women at risk for preeclampsia (PE) is important, as it will enable targeted interventions ahead of clinical manifestations. The quantitative analyses of plasma proteins feature prominently among molecular approaches used for risk prediction. However, derivation of protein signatures of sufficient predictive power has been challenging. The recent availability of platforms simultaneously assessing over 1000 plasma proteins offers broad examinations of the plasma proteome, which may enable the extraction of proteomic signatures with improved prognostic performance in prenatal care. OBJECTIVE: The primary aim of this study was to examine the generalizability of proteomic signatures predictive of PE in two cohorts of pregnant women whose plasma proteome was interrogated with the same highly multiplexed platform. Establishing generalizability, or lack thereof, is critical to devise strategies facilitating the development of clinically useful predictive tests. A second aim was to examine the generalizability of protein signatures predictive of gestational age (GA) in uncomplicated pregnancies in the same cohorts to contrast physiological and pathological pregnancy outcomes. STUDY DESIGN: Serial blood samples were collected during the first, second, and third trimesters in 18 women who developed PE and 18 women with uncomplicated pregnancies (Stanford cohort). The second cohort (Detroit), used for comparative analysis, consisted of 76 women with PE and 90 women with uncomplicated pregnancies. Multivariate analyses were applied to infer predictive and cohort-specific proteomic models, which were then tested in the alternate cohort. Gene ontology (GO) analysis was performed to identify biological processes that were over-represented among top-ranked proteins associated with PE. RESULTS: The model derived in the Stanford cohort was highly significant (p = 3.9E-15) and predictive (AUC = 0.96), but failed validation in the Detroit cohort (p = 9.7E-01, AUC = 0.50). Similarly, the model derived in the Detroit cohort was highly significant (p = 1.0E-21, AUC = 0.73), but failed validation in the Stanford cohort (p = 7.3E-02, AUC = 0.60). By contrast, proteomic models predicting GA were readily validated across the Stanford (p = 1.1E-454, R = 0.92) and Detroit cohorts (p = 1.1.E-92, R = 0.92) indicating that the proteomic assay performed well enough to infer a generalizable model across studied cohorts, which makes it less likely that technical aspects of the assay, including batch effects, accounted for observed differences. CONCLUSIONS: Results point to a broader issue relevant for proteomic and other omic discovery studies in patient cohorts suffering from a clinical syndrome, such as PE, driven by heterogeneous pathophysiologies. While novel technologies including highly multiplex proteomic arrays and adapted computational algorithms allow for novel discoveries for a particular study cohort, they may not readily generalize across cohorts. A likely reason is that the prevalence of pathophysiologic processes leading up to the "same" clinical syndrome can be distributed differently in different and smaller-sized cohorts. Signatures derived in individual cohorts may simply capture different facets of the spectrum of pathophysiologic processes driving a syndrome. Our findings have important implications for the design of omic studies of a syndrome like PE. They highlight the need for performing such studies in diverse and well-phenotyped patient populations that are large enough to characterize subsets of patients with shared pathophysiologies to then derive subset-specific signatures of sufficient predictive power.


Assuntos
Pré-Eclâmpsia , Proteômica , Feminino , Humanos , Gravidez , Proteômica/métodos , Pré-Eclâmpsia/diagnóstico , Proteoma/metabolismo , Biomarcadores , Proteínas Sanguíneas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA