Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 394
Filtrar
1.
Nature ; 629(8010): 193-200, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38600383

RESUMO

Sex differences in mammalian complex traits are prevalent and are intimately associated with androgens1-7. However, a molecular and cellular profile of sex differences and their modulation by androgens is still lacking. Here we constructed a high-dimensional single-cell transcriptomic atlas comprising over 2.3 million cells from 17 tissues in Mus musculus and explored the effects of sex and androgens on the molecular programs and cellular populations. In particular, we found that sex-biased immune gene expression and immune cell populations, such as group 2 innate lymphoid cells, were modulated by androgens. Integration with the UK Biobank dataset revealed potential cellular targets and risk gene enrichment in antigen presentation for sex-biased diseases. This study lays the groundwork for understanding the sex differences orchestrated by androgens and provides important evidence for targeting the androgen pathway as a broad therapeutic strategy for sex-biased diseases.


Assuntos
Androgênios , Células , Caracteres Sexuais , Análise de Célula Única , Transcriptoma , Animais , Feminino , Humanos , Masculino , Camundongos , Androgênios/metabolismo , Androgênios/farmacologia , Apresentação de Antígeno/efeitos dos fármacos , Apresentação de Antígeno/genética , Imunidade Inata , Linfócitos/metabolismo , Linfócitos/citologia , Linfócitos/imunologia , Linfócitos/efeitos dos fármacos , Camundongos Endogâmicos C57BL , Transcriptoma/efeitos dos fármacos , Transcriptoma/genética , Biobanco do Reino Unido , Células/efeitos dos fármacos , Células/imunologia , Células/metabolismo
2.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38706319

RESUMO

Inference of cell-cell communication (CCC) provides valuable information in understanding the mechanisms of many important life processes. With the rise of spatial transcriptomics in recent years, many methods have emerged to predict CCCs using spatial information of cells. However, most existing methods only describe CCCs based on ligand-receptor interactions, but lack the exploration of their upstream/downstream pathways. In this paper, we proposed a new method to infer CCCs, called Intercellular Gene Association Network (IGAN). Specifically, it is for the first time that we can estimate the gene associations/network between two specific single spatially adjacent cells. By using the IGAN method, we can not only infer CCCs in an accurate manner, but also explore the upstream/downstream pathways of ligands/receptors from the network perspective, which are actually exhibited as a new panoramic cell-interaction-pathway graph, and thus provide extensive information for the regulatory mechanisms behind CCCs. In addition, IGAN can measure the CCC activity at single cell/spot resolution, and help to discover the CCC spatial heterogeneity. Interestingly, we found that CCC patterns from IGAN are highly consistent with the spatial microenvironment patterns for each cell type, which further indicated the accuracy of our method. Analyses on several public datasets validated the advantages of IGAN.


Assuntos
Comunicação Celular , Redes Reguladoras de Genes , Comunicação Celular/genética , Humanos , Biologia Computacional/métodos , Algoritmos , Análise de Célula Única/métodos , Transdução de Sinais
3.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38819253

RESUMO

Spatially resolved transcriptomics (SRT) has emerged as a powerful tool for investigating gene expression in spatial contexts, providing insights into the molecular mechanisms underlying organ development and disease pathology. However, the expression sparsity poses a computational challenge to integrate other modalities (e.g. histological images and spatial locations) that are simultaneously captured in SRT datasets for spatial clustering and variation analyses. In this study, to meet such a challenge, we propose multi-modal domain adaption for spatial transcriptomics (stMDA), a novel multi-modal unsupervised domain adaptation method, which integrates gene expression and other modalities to reveal the spatial functional landscape. Specifically, stMDA first learns the modality-specific representations from spatial multi-modal data using multiple neural network architectures and then aligns the spatial distributions across modal representations to integrate these multi-modal representations, thus facilitating the integration of global and spatially local information and improving the consistency of clustering assignments. Our results demonstrate that stMDA outperforms existing methods in identifying spatial domains across diverse platforms and species. Furthermore, stMDA excels in identifying spatially variable genes with high prognostic potential in cancer tissues. In conclusion, stMDA as a new tool of multi-modal data integration provides a powerful and flexible framework for analyzing SRT datasets, thereby advancing our understanding of intricate biological systems.


Assuntos
Perfilação da Expressão Gênica , Transcriptoma , Humanos , Perfilação da Expressão Gênica/métodos , Análise por Conglomerados , Biologia Computacional/métodos , Redes Neurais de Computação , Neoplasias/genética , Algoritmos
4.
Proc Natl Acad Sci U S A ; 120(37): e2302275120, 2023 Sep 12.
Artigo em Inglês | MEDLINE | ID: mdl-37669376

RESUMO

Alerting for imminent earthquakes is particularly challenging due to the high nonlinearity and nonstationarity of geodynamical phenomena. In this study, based on spatiotemporal information (STI) transformation for high-dimensional real-time data, we developed a model-free framework, i.e., real-time spatiotemporal information transformation learning (RSIT), for extending the nonlinear and nonstationary time series. Specifically, by transforming high-dimensional information of the global navigation satellite system into one-dimensional dynamics via the STI strategy, RSIT efficiently utilizes two criteria of the transformed one-dimensional dynamics, i.e., unpredictability and instability. Such two criteria contemporaneously signal a potential critical transition of the geodynamical system, thereby providing early-warning signals of possible upcoming earthquakes. RSIT explores both the spatial and temporal dynamics of real-world data on the basis of a solid theoretical background in nonlinear dynamics and delay-embedding theory. The effectiveness of RSIT was demonstrated on geodynamical data of recent earthquakes from a number of regions across at least 4 y and through further comparison with existing methods.

5.
Brief Bioinform ; 24(5)2023 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-37544659

RESUMO

Gene regulatory networks (GRNs) reveal the complex molecular interactions that govern cell state. However, it is challenging for identifying causal relations among genes due to noisy data and molecular nonlinearity. Here, we propose a novel causal criterion, neighbor cross-mapping entropy (NME), for inferring GRNs from both steady data and time-series data. NME is designed to quantify 'continuous causality' or functional dependency from one variable to another based on their function continuity with varying neighbor sizes. NME shows superior performance on benchmark datasets, comparing with existing methods. By applying to scRNA-seq datasets, NME not only reliably inferred GRNs for cell types but also identified cell states. Based on the inferred GRNs and further their activity matrices, NME showed better performance in single-cell clustering and downstream analyses. In summary, based on continuous causality, NME provides a powerful tool in inferring causal regulations of GRNs between genes from scRNA-seq data, which is further exploited to identify novel cell types/states and predict cell type-specific network modules.


Assuntos
Algoritmos , Redes Reguladoras de Genes , Entropia , Fatores de Tempo , Análise por Conglomerados
6.
Brief Bioinform ; 24(5)2023 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-37507114

RESUMO

Advances in single-cell multi-omics technology provide an unprecedented opportunity to fully understand cellular heterogeneity. However, integrating omics data from multiple modalities is challenging due to the individual characteristics of each measurement. Here, to solve such a problem, we propose a contrastive and generative deep self-expression model, called single-cell multimodal self-expressive integration (scMSI), which integrates the heterogeneous multimodal data into a unified manifold space. Specifically, scMSI first learns each omics-specific latent representation and self-expression relationship to consider the characteristics of different omics data by deep self-expressive generative model. Then, scMSI combines these omics-specific self-expression relations through contrastive learning. In such a way, scMSI provides a paradigm to integrate multiple omics data even with weak relation, which effectively achieves the representation learning and data integration into a unified framework. We demonstrate that scMSI provides a cohesive solution for a variety of analysis tasks, such as integration analysis, data denoising, batch correction and spatial domain detection. We have applied scMSI on various single-cell and spatial multimodal datasets to validate its high effectiveness and robustness in diverse data types and application scenarios.


Assuntos
Aprendizagem , Multiômica
8.
Nature ; 569(7757): 581-585, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-31043749

RESUMO

Methylation of cytosine to 5-methylcytosine (5mC) is a prevalent DNA modification found in many organisms. Sequential oxidation of 5mC by ten-eleven translocation (TET) dioxygenases results in a cascade of additional epigenetic marks and promotes demethylation of DNA in mammals1,2. However, the enzymatic activity and function of TET homologues in other eukaryotes remains largely unexplored. Here we show that the green alga Chlamydomonas reinhardtii contains a 5mC-modifying enzyme (CMD1) that is a TET homologue and catalyses the conjugation of a glyceryl moiety to the methyl group of 5mC through a carbon-carbon bond, resulting in two stereoisomeric nucleobase products. The catalytic activity of CMD1 requires Fe(II) and the integrity of its binding motif His-X-Asp, which is conserved in Fe-dependent dioxygenases3. However, unlike previously described TET enzymes, which use 2-oxoglutarate as a co-substrate4, CMD1 uses L-ascorbic acid (vitamin C) as an essential co-substrate. Vitamin C donates the glyceryl moiety to 5mC with concurrent formation of glyoxylic acid and CO2. The vitamin-C-derived DNA modification is present in the genome of wild-type C. reinhardtii but at a substantially lower level in a CMD1 mutant strain. The fitness of CMD1 mutant cells during exposure to high light levels is reduced. LHCSR3, a gene that is critical for the protection of C. reinhardtii from photo-oxidative damage under high light conditions, is hypermethylated and downregulated in CMD1 mutant cells compared to wild-type cells, causing a reduced capacity for photoprotective non-photochemical quenching. Our study thus identifies a eukaryotic DNA base modification that is catalysed by a divergent TET homologue and unexpectedly derived from vitamin C, and describes its role as a potential epigenetic mark that may counteract DNA methylation in the regulation of photosynthesis.


Assuntos
5-Metilcitosina/metabolismo , Proteínas de Algas/metabolismo , Ácido Ascórbico/metabolismo , Biocatálise , Chlamydomonas reinhardtii/enzimologia , DNA/química , DNA/metabolismo , 5-Metilcitosina/química , Dióxido de Carbono/metabolismo , Metilação de DNA , Glioxilatos/metabolismo , Nucleosídeos/química , Nucleosídeos/metabolismo , Fotossíntese
9.
Nucleic Acids Res ; 51(20): e103, 2023 11 10.
Artigo em Inglês | MEDLINE | ID: mdl-37811885

RESUMO

Spatial transcriptomics characterizes gene expression profiles while retaining the information of the spatial context, providing an unprecedented opportunity to understand cellular systems. One of the essential tasks in such data analysis is to determine spatially variable genes (SVGs), which demonstrate spatial expression patterns. Existing methods only consider genes individually and fail to model the inter-dependence of genes. To this end, we present an analytic tool STAMarker for robustly determining spatial domain-specific SVGs with saliency maps in deep learning. STAMarker is a three-stage ensemble framework consisting of graph-attention autoencoders, multilayer perceptron (MLP) classifiers, and saliency map computation by the backpropagated gradient. We illustrate the effectiveness of STAMarker and compare it with serveral commonly used competing methods on various spatial transcriptomic data generated by different platforms. STAMarker considers all genes at once and is more robust when the dataset is very sparse. STAMarker could identify spatial domain-specific SVGs for characterizing spatial domains and enable in-depth analysis of the region of interest in the tissue section.


Assuntos
Aprendizado Profundo , Perfilação da Expressão Gênica , Análise de Dados , Redes Neurais de Computação , Transcriptoma
10.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35580862

RESUMO

Complex diseases progression can be generally divided into three states, which are normal state, predisease/critical state and disease state. The sudden deterioration of diseases can be viewed as a bifurcation or a critical transition. Therefore, hunting for the tipping point or critical state is of great importance to prevent the disease deterioration. However, it is still a challenging task to detect the critical states of complex diseases with high-dimensional data, especially based on an individual. In this study, we develop a new method based on network fluctuation of molecules, temporal network flow entropy (TNFE) or temporal differential network flow entropy, to detect the critical states of complex diseases on the basis of each individual. By applying this method to a simulated dataset and six real diseases, including respiratory viral infections and tumors with four time-course and two stage-course high-dimensional omics datasets, the critical states before deterioration were detected and their dynamic network biomarkers were identified successfully. The results on the simulated dataset indicate that the TNFE method is robust under different noise strengths, and is also superior to the existing methods on detecting the critical states. Moreover, the analysis on the real datasets demonstrated the effectiveness of TNFE for providing early-warning signals on various diseases. In addition, we also predicted disease deterioration risk and identified drug targets for cancers based on stage-wise data.


Assuntos
Neoplasias , Biomarcadores , Progressão da Doença , Suscetibilidade a Doenças , Entropia , Humanos , Neoplasias/genética
11.
Brief Bioinform ; 23(2)2022 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-35151228

RESUMO

Identifying differential genes over conditions provides insights into the mechanisms of biological processes and disease progression. Here we present an approach, the Kullback-Leibler divergence-based differential distribution (klDD), which provides a flexible framework for quantifying changes in higher-order statistical information of genes including mean and variance/covariation. The method can well detect subtle differences in gene expression distributions in contrast to mean or variance shifts of the existing methods. In addition to effectively identifying informational genes in terms of differential distribution, klDD can be directly applied to cancer subtyping, single-cell clustering and disease early-warning detection, which were all validated by various benchmark datasets.


Assuntos
Perfilação da Expressão Gênica , Transcriptoma , Análise por Conglomerados , Progressão da Doença , Perfilação da Expressão Gênica/métodos , Humanos
12.
Brief Bioinform ; 23(2)2022 03 10.
Artigo em Inglês | MEDLINE | ID: mdl-35079777

RESUMO

The integration of multi-omics data makes it possible to understand complex biological organisms at the system level. Numerous integration approaches have been developed by assuming a common underlying data space. Due to the noise and heterogeneity of biological data, the performance of these approaches is greatly affected. In this work, we propose a novel deep neural network architecture, named Deep Latent Space Fusion (DLSF), which integrates the multi-omics data by learning consistent manifold in the sample latent space for disease subtypes identification. DLSF is built upon a cycle autoencoder with a shared self-expressive layer, which can naturally and adaptively merge nonlinear features at each omics level into one unified sample manifold and produce adaptive representation of heterogeneous samples at the multi-omics level. We have assessed DLSF on various biological and biomedical datasets to validate its effectiveness. DLSF can efficiently and accurately capture the intrinsic manifold of the sample structures or sample clusters compared with other state-of-the-art methods, and DLSF yielded more significant outcomes for biological significance, survival prognosis and clinical relevance in application of cancer study in The Cancer Genome Atlas. Notably, as a deep case study, we determined a new molecular subtype of kidney renal clear cell carcinoma that may benefit immunotherapy in the viewpoint of multi-omics, and we further found potential subtype-specific biomarkers from multiple omics data, which were validated by independent datasets. In addition, we applied DLSF to identify potential therapeutic agents of different molecular subtypes of chronic lymphocytic leukemia, demonstrating the scalability of DLSF in diverse omics data types and application scenarios.


Assuntos
Neoplasias , Humanos , Neoplasias/genética
13.
Bioinformatics ; 39(7)2023 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-37382572

RESUMO

MOTIVATION: Simultaneous profiling of multi-omics single-cell data represents exciting technological advancements for understanding cellular states and heterogeneity. Cellular indexing of transcriptomes and epitopes by sequencing allowed for parallel quantification of cell-surface protein expression and transcriptome profiling in the same cells; methylome and transcriptome sequencing from single cells allows for analysis of transcriptomic and epigenomic profiling in the same individual cells. However, effective integration method for mining the heterogeneity of cells over the noisy, sparse, and complex multi-modal data is in growing need. RESULTS: In this article, we propose a multi-modal high-order neighborhood Laplacian matrix optimization framework for integrating the multi-omics single-cell data: scHoML. Hierarchical clustering method was presented for analyzing the optimal embedding representation and identifying cell clusters in a robust manner. This novel method by integrating high-order and multi-modal Laplacian matrices would robustly represent the complex data structures and allow for systematic analysis at the multi-omics single-cell level, thus promoting further biological discoveries. AVAILABILITY AND IMPLEMENTATION: Matlab code is available at https://github.com/jianghruc/scHoML.


Assuntos
Algoritmos , Multiômica , Perfilação da Expressão Gênica , Transcriptoma , Análise por Conglomerados , Análise de Célula Única
14.
PLoS Comput Biol ; 19(6): e1011261, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37379341

RESUMO

The recent advances in single-cell RNA sequencing (scRNA-seq) techniques have stimulated efforts to identify and characterize the cellular composition of complex tissues. With the advent of various sequencing techniques, automated cell-type annotation using a well-annotated scRNA-seq reference becomes popular. But it relies on the diversity of cell types in the reference, which may not capture all the cell types present in the query data of interest. There are generally unseen cell types in the query data of interest because most data atlases are obtained for different purposes and techniques. Identifying previously unseen cell types is essential for improving annotation accuracy and uncovering novel biological discoveries. To address this challenge, we propose mtANN (multiple-reference-based scRNA-seq data annotation), a new method to automatically annotate query data while accurately identifying unseen cell types with the aid of multiple references. Key innovations of mtANN include the integration of deep learning and ensemble learning to improve prediction accuracy, and the introduction of a new metric that considers three complementary aspects to distinguish between unseen cell types and shared cell types. Additionally, we provide a data-driven method to adaptively select a threshold for identifying previously unseen cell types. We demonstrate the advantages of mtANN over state-of-the-art methods for unseen cell-type identification and cell-type annotation on two benchmark dataset collections, as well as its predictive power on a collection of COVID-19 datasets. The source code and tutorial are available at https://github.com/Zhangxf-ccnu/mtANN.


Assuntos
Análise de Sequência de RNA , Análise de Célula Única , Análise de Célula Única/métodos , Análise de Sequência de RNA/métodos , Humanos , COVID-19/diagnóstico , Software
15.
Proc Natl Acad Sci U S A ; 118(5)2021 02 02.
Artigo em Inglês | MEDLINE | ID: mdl-33500348

RESUMO

ZFP57 is a master regulator of genomic imprinting. It has both maternal and zygotic functions that are partially redundant in maintaining DNA methylation at some imprinting control regions (ICRs). In this study, we found that DNA methylation was lost at most known ICRs in Zfp57 mutant embryos. Furthermore, loss of ZFP57 caused loss of parent-of-origin-dependent monoallelic expression of the target imprinted genes. The allelic expression switch occurred in the ZFP57 target imprinted genes upon loss of differential DNA methylation at the ICRs in Zfp57 mutant embryos. Specifically, upon loss of ZFP57, the alleles of the imprinted genes located on the same chromosome with the originally methylated ICR switched their expression to mimic their counterparts on the other chromosome with unmethylated ICR. Consistent with our previous study, ZFP57 could regulate the NOTCH signaling pathway in mouse embryos by impacting allelic expression of a few regulators in the NOTCH pathway. In addition, the imprinted Dlk1 gene that has been implicated in the NOTCH pathway was significantly down-regulated in Zfp57 mutant embryos. Our allelic expression switch models apply to the examined target imprinted genes controlled by either maternally or paternally methylated ICRs. Our results support the view that ZFP57 controls imprinted expression of its target imprinted genes primarily through maintaining differential DNA methylation at the ICRs.


Assuntos
Alelos , Impressão Genômica , Proteínas Repressoras/genética , Animais , Metilação de DNA/genética , Embrião de Mamíferos/metabolismo , Feminino , Camundongos , RNA-Seq , Receptores Notch/metabolismo , Proteínas Repressoras/metabolismo , Transdução de Sinais/genética
16.
BMC Genomics ; 24(1): 619, 2023 Oct 18.
Artigo em Inglês | MEDLINE | ID: mdl-37853311

RESUMO

To explore the potential network markers and related signaling pathways of human B cells infected by COVID-19, we performed standardized integration and analysis of single-cell sequencing data to construct conditional cell-specific networks (CCSN) for each cell. Then the peripheral blood cells were clustered and annotated based on the conditional network degree matrix (CNDM) and gene expression matrix (GEM), respectively, and B cells were selected for further analysis. Besides, based on the CNDM of B cells, the hub genes and 'dark' genes (a gene has a significant difference between case and control samples not in a gene expression level but in a conditional network degree level) closely related to COVID-19 were revealed. Interestingly, some of the 'dark' genes and differential degree genes (DDGs) encoded key proteins in the JAK-STAT pathway, which had antiviral effects. The protein p21 encoded by the 'dark' gene CDKN1A was a key regulator for the COVID-19 infection-related signaling pathway. Elevated levels of proteins encoded by some DDGs were directly related to disease severity of patients with COVID-19. In short, the proteins encoded by 'dark' genes complement some missing links in COVID-19 and these signaling pathways played an important role in the growth and activation of B cells.


Assuntos
COVID-19 , Transdução de Sinais , Humanos , Transdução de Sinais/genética , Janus Quinases/genética , Fatores de Transcrição STAT/genética , COVID-19/genética , Redes Reguladoras de Genes , Perfilação da Expressão Gênica
17.
Brief Bioinform ; 22(4)2021 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-33200787

RESUMO

Simultaneous profiling transcriptomic and chromatin accessibility information in the same individual cells offers an unprecedented resolution to understand cell states. However, computationally effective methods for the integration of these inherent sparse and heterogeneous data are lacking. Here, we present a single-cell multimodal variational autoencoder model, which combines three types of joint-learning strategies with a probabilistic Gaussian Mixture Model to learn the joint latent features that accurately represent these multilayer profiles. Studies on both simulated datasets and real datasets demonstrate that it has more preferable capability (i) dissecting cellular heterogeneity in the joint-learning space, (ii) denoising and imputing data and (iii) constructing the association between multilayer omics data, which can be used for understanding transcriptional regulatory mechanisms.


Assuntos
Cromatina/metabolismo , Bases de Dados Factuais , Aprendizado Profundo , Modelos Biológicos , Análise de Célula Única , Transcriptoma , Cromatina/genética , Humanos , Células K562
18.
Brief Bioinform ; 22(3)2021 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-32422654

RESUMO

A single-sample network (SSN) is a biological molecular network constructed from single-sample data given a reference dataset and can provide insights into the mechanisms of individual diseases and aid in the development of personalized medicine. In this study, we proposed a computational method, a partial correlation-based single-sample network (P-SSN), which not only infers a network from each single-sample data given a reference dataset but also retains the direct interactions by excluding indirect interactions (https://github.com/hyhRise/P-SSN). By applying P-SSN to analyze tumor data from the Cancer Genome Atlas and single cell data, we validated the effectiveness of P-SSN in predicting driver mutation genes (DMGs), producing network distance, identifying subtypes and further classifying single cells. In particular, P-SSN is highly effective in predicting DMGs based on single-sample data. P-SSN is also efficient for subtyping complex diseases and for clustering single cells by introducing network distance between any two samples.


Assuntos
Biologia Computacional/métodos , Algoritmos , Conjuntos de Dados como Assunto , Perfilação da Expressão Gênica/métodos , Humanos , Medicina de Precisão , Análise de Célula Única/métodos
19.
Bioinformatics ; 38(9): 2459-2465, 2022 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-35188181

RESUMO

MOTIVATION: Single-cell RNA sequencing (scRNA-seq) technology provides the possibility to study cell heterogeneity and cell development on the resolution of individual cells. Arguably, three of the most important computational targets on scRNA-seq data analysis are data visualization, cell clustering and trajectory inference. Although a substantial number of algorithms have been developed, most of them do not treat the three targets in a systematic or consistent manner. RESULTS: In this article, we propose an efficient scRNA-seq analysis framework, which accomplishes the three targets consistently by non-uniform ε-neighborhood (NEN) network. First, a network is generated by our NEN method, which combines the advantages of both k-nearest neighbors (KNN) and ε-neighborhood (EN) to represent the manifold that data points reside in gene space. Then from such a network, we use its layout, its community and further its shortest path to achieve the purpose of scRNA-seq data visualization, clustering and trajectory inference. The results on both synthetic and real datasets indicate that our NEN method not only can visually provide the global topological structure of a dataset accurately compared with t-SNE (t-Distributed Stochastic Neighbor Embedding) and UMAP (Uniform Manifold Approximation and Projection), but also has superior performances on clustering and pseudotime ordering of cells over the existing approaches. AVAILABILITY AND IMPLEMENTATION: This analysis method has been made into a python package called ccnet and is freely available at https://github.com/Just-Jia/ccNet. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Perfilação da Expressão Gênica , Análise de Célula Única , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Perfilação da Expressão Gênica/métodos , Análise de Dados , Análise por Conglomerados , Algoritmos
20.
Nucleic Acids Res ; 49(7): e37, 2021 04 19.
Artigo em Inglês | MEDLINE | ID: mdl-33434272

RESUMO

Multiple driver genes in individual patient samples may cause resistance to individual drugs in precision medicine. However, current computational methods have not studied how to fill the gap between personalized driver gene identification and combinatorial drug discovery for individual patients. Here, we developed a novel structural network controllability-based personalized driver genes and combinatorial drug identification algorithm (CPGD), aiming to identify combinatorial drugs for an individual patient by targeting personalized driver genes from network controllability perspective. On two benchmark disease datasets (i.e. breast cancer and lung cancer datasets), performance of CPGD is superior to that of other state-of-the-art driver gene-focus methods in terms of discovery rate among prior-known clinical efficacious combinatorial drugs. Especially on breast cancer dataset, CPGD evaluated synergistic effect of pairwise drug combinations by measuring synergistic effect of their corresponding personalized driver gene modules, which are affected by a given targeting personalized driver gene set of drugs. The results showed that CPGD performs better than existing synergistic combinatorial strategies in identifying clinical efficacious paired combinatorial drugs. Furthermore, CPGD enhanced cancer subtyping by computationally providing personalized side effect signatures for individual patients. In addition, CPGD identified 90 drug combinations candidates from SARS-COV2 dataset as potential drug repurposing candidates for recently spreading COVID-19.


Assuntos
Algoritmos , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/genética , Quimioterapia Combinada , Neoplasias Pulmonares/tratamento farmacológico , Neoplasias Pulmonares/genética , Medicina de Precisão/métodos , Neoplasias da Mama/classificação , COVID-19/genética , Conjuntos de Dados como Assunto , Reposicionamento de Medicamentos , Sinergismo Farmacológico , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Regulação Neoplásica da Expressão Gênica/genética , Genes Neoplásicos/genética , Humanos , Medição de Risco , Fluxo de Trabalho , Tratamento Farmacológico da COVID-19
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA