Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 40(2)2024 02 01.
Article in English | MEDLINE | ID: mdl-38273708

ABSTRACT

MOTIVATION: Direct reprogramming (DR) is a process that directly converts somatic cells to target cells. Although DR via small molecules is safer than using transcription factors (TFs) in terms of avoidance of tumorigenic risk, the determination of DR-inducing small molecules is challenging. RESULTS: Here we present a novel in silico method, DIRECTEUR, to predict small molecules that replace TFs for DR. We extracted DR-characteristic genes using transcriptome profiles of cells in which DR was induced by TFs, and performed a variant of simulated annealing to explore small molecule combinations with similar gene expression patterns with DR-inducing TFs. We applied DIRECTEUR to predicting combinations of small molecules that convert fibroblasts into neurons or cardiomyocytes, and were able to reproduce experimentally verified and functionally related molecules inducing the corresponding conversions. The proposed method is expected to be useful for practical applications in regenerative medicine. AVAILABILITY AND IMPLEMENTATION: The code and data are available at the following link: https://github.com/HamanoLaboratory/DIRECTEUR.git.


Subject(s)
Transcription Factors , Transcriptome , Transcription Factors/metabolism , Cellular Reprogramming , Neurons/metabolism , Fibroblasts/metabolism
2.
Bioinformatics ; 38(Suppl_2): ii99-ii105, 2022 09 16.
Article in English | MEDLINE | ID: mdl-36124791

ABSTRACT

MOTIVATION: Direct cell conversion, direct reprogramming (DR), is an innovative technology that directly converts source cells to target cells without bypassing induced pluripotent stem cells. The use of small compounds (e.g. drugs) for DR can help avoid carcinogenic risk induced by gene transfection; however, experimentally identifying small compounds remains challenging because of combinatorial explosion. RESULTS: In this article, we present a new computational method, COMPRENDRE (combinatorial optimization of pathway regulations for direct reprograming), to elucidate the mechanism of small compound-based DR and predict new combinations of small compounds for DR. We estimated the potential target proteins of DR-inducing small compounds and identified a set of target pathways involving DR. We identified multiple DR-related pathways that have not previously been reported to induce neurons or cardiomyocytes from fibroblasts. To overcome the problem of combinatorial explosion, we developed a variant of a simulated annealing algorithm to identify the best set of compounds that can regulate DR-related pathways. Consequently, the proposed method enabled to predict new DR-inducing candidate combinations with fewer compounds and to successfully reproduce experimentally verified compounds inducing the direct conversion from fibroblasts to neurons or cardiomyocytes. The proposed method is expected to be useful for practical applications in regenerative medicine. AVAILABILITY AND IMPLEMENTATION: The code supporting the current study is available at the http://labo.bio.kyutech.ac.jp/~yamani/comprendre. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Induced Pluripotent Stem Cells , Algorithms , Fibroblasts , Neurons , Proteins
3.
Bioinformatics ; 38(10): 2839-2846, 2022 05 13.
Article in English | MEDLINE | ID: mdl-35561200

ABSTRACT

MOTIVATION: Direct reprogramming involves the direct conversion of fully differentiated mature cell types into various other cell types while bypassing an intermediate pluripotent state (e.g. induced pluripotent stem cells). Cell differentiation by direct reprogramming is determined by two types of transcription factors (TFs): pioneer factors (PFs) and cooperative TFs. PFs have the distinct ability to open chromatin aggregations, assemble a collective of cooperative TFs and activate gene expression. The experimental determination of two types of TFs is extremely difficult and costly. RESULTS: In this study, we developed a novel computational method, TRANSDIRE (TRANS-omics-based approach for DIrect REprogramming), to predict the TFs that induce direct reprogramming in various human cell types using multiple omics data. In the algorithm, potential PFs were predicted based on low signal chromatin regions, and the cooperative TFs were predicted through a trans-omics analysis of genomic data (e.g. enhancers), transcriptome data (e.g. gene expression profiles in human cells), epigenome data (e.g. chromatin immunoprecipitation sequencing data) and interactome data. We applied the proposed methods to the reconstruction of TFs that induce direct reprogramming from fibroblasts to six other cell types: hepatocytes, cartilaginous cells, neurons, cardiomyocytes, pancreatic cells and Paneth cells. We demonstrated that the methods successfully predicted TFs for most cell conversions with high accuracy. Thus, the proposed methods are expected to be useful for various practical applications in regenerative medicine. AVAILABILITY AND IMPLEMENTATION: The source code and data are available at the following website: http://figshare.com/s/b653781a5b9e6639972b. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Cellular Reprogramming , Induced Pluripotent Stem Cells , Cell Differentiation/genetics , Chromatin , Humans , Induced Pluripotent Stem Cells/metabolism , Transcription Factors/metabolism
4.
BMC Bioinformatics ; 20(1): 380, 2019 Jul 09.
Article in English | MEDLINE | ID: mdl-31288752

ABSTRACT

BACKGROUND: Alkaloids, a class of organic compounds that contain nitrogen bases, are mainly synthesized as secondary metabolites in plants and fungi, and they have a wide range of bioactivities. Although there are thousands of compounds in this class, few of their biosynthesis pathways are fully identified. In this study, we constructed a model to predict their precursors based on a novel kind of neural network called the molecular graph convolutional neural network. Molecular similarity is a crucial metric in the analysis of qualitative structure-activity relationships. However, it is sometimes difficult for current fingerprint representations to emphasize specific features for the target problems efficiently. It is advantageous to allow the model to select the appropriate features according to data-driven decisions for extracting more useful information, which influences a classification or regression problem substantially. RESULTS: In this study, we applied a neural network architecture for undirected graph representation of molecules. By encoding a molecule as an abstract graph and applying "convolution" on the graph and training the weight of the neural network framework, the neural network can optimize feature selection for the training problem. By incorporating the effects from adjacent atoms recursively, graph convolutional neural networks can extract the features of latent atoms that represent chemical features of a molecule efficiently. In order to investigate alkaloid biosynthesis, we trained the network to distinguish the precursors of 566 alkaloids, which are almost all of the alkaloids whose biosynthesis pathways are known, and showed that the model could predict starting substances with an averaged accuracy of 97.5%. CONCLUSION: We have showed that our model can predict more accurately compared to the random forest and general neural network when the variables and fingerprints are not selected, while the performance is comparable when we carefully select 507 variables from 18000 dimensions of descriptors. The prediction of pathways contributes to understanding of alkaloid synthesis mechanisms and the application of graph based neural network models to similar problems in bioinformatics would therefore be beneficial. We applied our model to evaluate the precursors of biosynthesis of 12000 alkaloids found in various organisms and found power-low-like distribution.


Subject(s)
Alkaloids/classification , Biosynthetic Pathways , Neural Networks, Computer , Algorithms , Alkaloids/chemistry , Metabolome , Models, Theoretical
5.
BMC Bioinformatics ; 19(1): 264, 2018 07 13.
Article in English | MEDLINE | ID: mdl-30005591

ABSTRACT

BACKGROUND: There are different and complicated associations between genes and diseases. Finding the causal associations between genes and specific diseases is still challenging. In this work we present a method to predict novel associations of genes and pathways with inflammatory bowel disease (IBD) by integrating information of differential gene expression, protein-protein interaction and known disease genes related to IBD. RESULTS: We downloaded IBD gene expression data from NCBI's Gene Expression Omnibus, performed statistical analysis to determine differentially expressed genes, collected known IBD genes from DisGeNet database, which were used to construct a IBD related PPI network with HIPPIE database. We adapted our graph-based clustering algorithm DPClusO to cluster the disease PPI network. We evaluated the statistical significance of the identified clusters in the context of determining the richness of IBD genes using Fisher's exact test and predicted novel genes related to IBD. We showed 93.8% of our predictions are correct in the context of other databases and published literatures related to IBD. CONCLUSIONS: Finding disease-causing genes is necessary for developing drugs with synergistic effect targeting many genes simultaneously. Here we present an approach to identify novel disease genes and pathways and discuss our approach in the context of IBD. The approach can be generalized to find disease-associated genes for other diseases.


Subject(s)
Gene Regulatory Networks , Inflammatory Bowel Diseases/genetics , Algorithms , Area Under Curve , Databases, Genetic , Gene Ontology , Humans , Protein Interaction Maps/genetics , ROC Curve , Reproducibility of Results
SELECTION OF CITATIONS
SEARCH DETAIL
...