Pesquisa | Biblioteca Virtual em Saúde

1.

GOAT: Gene-level biomarker discovery from multi-Omics data using graph ATtention neural network for eosinophilic asthma subtype.

Jeong, Dabin; Koo, Bonil; Oh, Minsik; Kim, Tae-Bum; Kim, Sun.

Bioinformatics ; 39(10)2023 10 03.

Artigo em Inglês | MEDLINE | ID: mdl-37740295

RESUMO

MOTIVATION: Asthma is a heterogeneous disease where various subtypes are established and molecular biomarkers of the subtypes are yet to be discovered. Recent availability of multi-omics data paved a way to discover molecular biomarkers for the subtypes. However, multi-omics biomarker discovery is challenging because of the complex interplay between different omics layers. RESULTS: We propose a deep attention model named Gene-level biomarker discovery from multi-Omics data using graph ATtention neural network (GOAT) for identifying molecular biomarkers for eosinophilic asthma subtypes with multi-omics data. GOAT identifies genes that discriminate subtypes using a graph neural network by modeling complex interactions among genes as the attention mechanism in the deep learning model. In experiments with multi-omics profiles of the COREA (Cohort for Reality and Evolution of Adult Asthma in Korea) asthma cohort of 300 patients, GOAT outperforms existing models and suggests interpretable biological mechanisms underlying asthma subtypes. Importantly, GOAT identified genes that are distinct only in terms of relationship with other genes through attention. To better understand the role of biomarkers, we further investigated two transcription factors, CTNNB1 and JUN, captured by GOAT. We were successful in showing the role of the transcription factors in eosinophilic asthma pathophysiology in a network propagation and transcriptional network analysis, which were not distinct in terms of gene expression level differences. AVAILABILITY AND IMPLEMENTATION: Source code is available https://github.com/DabinJeong/Multi-omics_biomarker. The preprocessed data underlying this article is accessible in data folder of the github repository. Raw data are available in Multi-Omics Platform at http://203.252.206.90:5566/, and it can be accessible when requested.

Assuntos

Asma , Multiômica , Adulto , Humanos , Animais , Asma/genética , Biomarcadores , Redes Neurais de Computação , Fatores de Transcrição , Cabras

2.

Adipocyte HIF2α functions as a thermostat via PKA Cα regulation in beige adipocytes.

Han, Ji Seul; Jeon, Yong Geun; Oh, Minsik; Lee, Gung; Nahmgoong, Hahn; Han, Sang Mun; Choi, Jeehye; Kim, Ye Young; Shin, Kyung Cheul; Kim, Jiwon; Jo, Kyuri; Choe, Sung Sik; Park, Eun Jung; Kim, Sun; Kim, Jae Bum.

Nat Commun ; 13(1): 3268, 2022 06 07.

Artigo em Inglês | MEDLINE | ID: mdl-35672324

RESUMO

Thermogenic adipocytes generate heat to maintain body temperature against hypothermia in response to cold. Although tight regulation of thermogenesis is required to prevent energy sources depletion, the molecular details that tune thermogenesis are not thoroughly understood. Here, we demonstrate that adipocyte hypoxia-inducible factor α (HIFα) plays a key role in calibrating thermogenic function upon cold and re-warming. In beige adipocytes, HIFα attenuates protein kinase A (PKA) activity, leading to suppression of thermogenic activity. Mechanistically, HIF2α suppresses PKA activity by inducing miR-3085-3p expression to downregulate PKA catalytic subunit α (PKA Cα). Ablation of adipocyte HIF2α stimulates retention of beige adipocytes, accompanied by increased PKA Cα during re-warming after cold stimuli. Moreover, administration of miR-3085-3p promotes beige-to-white transition via downregulation of PKA Cα and mitochondrial abundance in adipocyte HIF2α deficient mice. Collectively, these findings suggest that HIF2α-dependent PKA regulation plays an important role as a thermostat through dynamic remodeling of beige adipocytes.

Assuntos

Adipócitos Bege , Fatores de Transcrição Hélice-Alça-Hélice Básicos/metabolismo , Subunidades Catalíticas da Proteína Quinase Dependente de AMP Cíclico/metabolismo , MicroRNAs , Adipócitos , Adipócitos Bege/metabolismo , Tecido Adiposo Branco/metabolismo , Animais , Temperatura Baixa , Camundongos , MicroRNAs/metabolismo , Termogênese/genética

3.

Construction of Condition-Specific Gene Regulatory Network Using Kernel Canonical Correlation Analysis.

Jeong, Dabin; Lim, Sangsoo; Lee, Sangseon; Oh, Minsik; Cho, Changyun; Seong, Hyeju; Jung, Woosuk; Kim, Sun.

Front Genet ; 12: 652623, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-34093651

RESUMO

Gene expression profile or transcriptome can represent cellular states, thus understanding gene regulation mechanisms can help understand how cells respond to external stress. Interaction between transcription factor (TF) and target gene (TG) is one of the representative regulatory mechanisms in cells. In this paper, we present a novel computational method to construct condition-specific transcriptional networks from transcriptome data. Regulatory interaction between TFs and TGs is very complex, specifically multiple-to-multiple relations. Experimental data from TF Chromatin Immunoprecipitation sequencing is useful but produces one-to-multiple relations between TF and TGs. On the other hand, co-expression networks of genes can be useful for constructing condition transcriptional networks, but there are many false positive relations in co-expression networks. In this paper, we propose a novel method to construct a condition-specific and combinatorial transcriptional network, applying kernel canonical correlation analysis (kernel CCA) to identify multiple-to-multiple TF-TG relations in certain biological condition. Kernel CCA is a well-established statistical method for computing the correlation of a group of features vs. another group of features. We, therefore, employed kernel CCA to embed TFs and TGs into a new space where the correlation of TFs and TGs are reflected. To demonstrate the usefulness of our network construction method, we used the blood transcriptome data for the investigation on the response to high fat diet in a human and an arabidopsis data set for the investigation on the response to cold/heat stress. Our method detected not only important regulatory interactions reported in previous studies but also novel TF-TG relations where a module of TF is regulating a module of TGs upon specific stress.

4.

Polarizing and depolarizing charge injection through a thin dielectric layer in a ferroelectric-dielectric bilayer.

Park, Hyeon Woo; Hyun, Seung Dam; Lee, In Soo; Lee, Suk Hyun; Lee, Yong Bin; Oh, Minsik; Kim, Beom Yong; Ryoo, Seung Gyu; Hwang, Cheol Seong.

Nanoscale ; 13(4): 2556-2572, 2021 Feb 04.

Artigo em Inglês | MEDLINE | ID: mdl-33476352

RESUMO

Charge injection from the near-by-electrode can occur during ferroelectric switching in the ferroelectric-dielectric bilayer due to the high field applied to the adjacent dielectric layers. The aim of this study is to investigate the effect of the charge injection by separating the amount of switched polarization and the injected charge density. A dynamic model of the injection-involved switching is developed and exploited to elucidate the mechanism. The model demonstrates that the amount of injected charges, which compensates for the bound charge of the polarization, can be larger, smaller, or identical to that of the polarization. This model further describes the analytical conditions of this compensation state. The model predictions are validated by the newly introduced ramping pulse measurements involving the serially connected TiN/Hf0.5Zr0.5O2/TiN and TiN/amorphous Al2O3/TiN, which are capable of separating the injected charge from the switched polarization. The dynamic model, along with the electrical measurements, enables the quantitative prediction and estimation of the internal potential and the effective charge, which is the sum of the bound and injected charges in the bilayer. This work provides fundamental insights into field-effect devices such as the next-generation ferroelectric-field-effect-transistors with NAND architecture based on uncompensated ferroelectric charges.

5.

Erratum to: Machine learning-based analysis of multi-omics data on the cloud for investigating gene regulations.

Oh, Minsik; Park, Sungjoon; Kim, Sun; Chae, Heejoon.

Brief Bioinform ; 22(1): 607, 2021 01 18.

Artigo em Inglês | MEDLINE | ID: mdl-32337577

6.

Machine learning-based analysis of multi-omics data on the cloud for investigating gene regulations.

Oh, Minsik; Park, Sungjoon; Kim, Sun; Chae, Heejoon.

Brief Bioinform ; 22(1): 66-76, 2021 01 18.

Artigo em Inglês | MEDLINE | ID: mdl-32227074

RESUMO

Gene expressions are subtly regulated by quantifiable measures of genetic molecules such as interaction with other genes, methylation, mutations, transcription factor and histone modifications. Integrative analysis of multi-omics data can help scientists understand the condition or patient-specific gene regulation mechanisms. However, analysis of multi-omics data is challenging since it requires not only the analysis of multiple omics data sets but also mining complex relations among different genetic molecules by using state-of-the-art machine learning methods. In addition, analysis of multi-omics data needs quite large computing infrastructure. Moreover, interpretation of the analysis results requires collaboration among many scientists, often requiring reperforming analysis from different perspectives. Many of the aforementioned technical issues can be nicely handled when machine learning tools are deployed on the cloud. In this survey article, we first survey machine learning methods that can be used for gene regulation study, and we categorize them according to five different goals: gene regulatory subnetwork discovery, disease subtype analysis, survival analysis, clinical prediction and visualization. We also summarize the methods in terms of multi-omics input types. Then, we explain why the cloud is potentially a good solution for the analysis of multi-omics data, followed by a survey of two state-of-the-art cloud systems, Galaxy and BioVLAB. Finally, we discuss important issues when the cloud is used for the analysis of multi-omics data for the gene regulation study.

Assuntos

Computação em Nuvem , Biologia Computacional/métodos , Animais , Regulação da Expressão Gênica , Humanos , Aprendizado de Máquina

7.

mirTime: identifying condition-specific targets of microRNA in time-series transcript data using Gaussian process model and spherical vector clustering.

Kang, Hyejin; Ahn, Hongryul; Jo, Kyuri; Oh, Minsik; Kim, Sun.

Bioinformatics ; 37(11): 1544-1553, 2021 Jul 12.

Artigo em Inglês | MEDLINE | ID: mdl-31070735

RESUMO

BACKGROUND: MicroRNAs, small noncoding RNAs, are conserved in many species, and they are key regulators that mediate post-transcriptional gene silencing. Since biologists cannot perform experiments for each of target genes of thousands of microRNAs in numerous specific conditions, prediction on microRNA target genes has been extensively investigated. A general framework is a two-step process of selecting target candidates based on sequence and binding energy features and then predicting targets based on negative correlation of microRNAs and their targets. However, there are few methods that are designed for target predictions using time-series gene expression data. RESULTS: In this article, we propose a new pipeline, mirTime, that predicts microRNA targets by integrating sequence features and time-series expression profiles in a specific experimental condition. The most important feature of mirTime is that it uses the Gaussian process regression model to measure data at unobserved or unpaired time points. In experiments with two datasets in different experimental conditions and cell types, condition-specific target modules reported in the original papers were successfully predicted with our pipeline. The context specificity of target modules was assessed with three (correlation-based, target gene-based and network-based) evaluation criteria. mirTime showed better performance than existing expression-based microRNA target prediction methods in all three criteria. AVAILABILITY AND IMPLEMENTATION: mirTime is available at https://github.com/mirTime/mirtime. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

8.

DRIM: A Web-Based System for Investigating Drug Response at the Molecular Level by Condition-Specific Multi-Omics Data Integration.

Oh, Minsik; Park, Sungjoon; Lee, Sangseon; Lee, Dohoon; Lim, Sangsoo; Jeong, Dabin; Jo, Kyuri; Jung, Inuk; Kim, Sun.

Front Genet ; 11: 564792, 2020.

Artigo em Inglês | MEDLINE | ID: mdl-33281870

RESUMO

Pharmacogenomics is the study of how genes affect a person's response to drugs. Thus, understanding the effect of drug at the molecular level can be helpful in both drug discovery and personalized medicine. Over the years, transcriptome data upon drug treatment has been collected and several databases compiled before drug treatment cancer cell multi-omics data with drug sensitivity (IC 50, AUC) or time-series transcriptomic data after drug treatment. However, analyzing transcriptome data upon drug treatment is challenging since more than 20,000 genes interact in complex ways. In addition, due to the difficulty of both time-series analysis and multi-omics integration, current methods can hardly perform analysis of databases with different data characteristics. One effective way is to interpret transcriptome data in terms of well-characterized biological pathways. Another way is to leverage state-of-the-art methods for multi-omics data integration. In this paper, we developed Drug Response analysis Integrating Multi-omics and time-series data (DRIM), an integrative multi-omics and time-series data analysis framework that identifies perturbed sub-pathways and regulation mechanisms upon drug treatment. The system takes drug name and cell line identification numbers or user's drug control/treat time-series gene expression data as input. Then, analysis of multi-omics data upon drug treatment is performed in two perspectives. For the multi-omics perspective analysis, IC 50-related multi-omics potential mediator genes are determined by embedding multi-omics data to gene-centric vector space using a tensor decomposition method and an autoencoder deep learning model. Then, perturbed pathway analysis of potential mediator genes is performed. For the time-series perspective analysis, time-varying perturbed sub-pathways upon drug treatment are constructed. Additionally, a network involving transcription factors (TFs), multi-omics potential mediator genes, and perturbed sub-pathways is constructed, and paths to perturbed pathways from TFs are determined by an influence maximization method. To demonstrate the utility of our system, we provide analysis results of sub-pathway regulatory mechanisms in breast cancer cell lines of different drug sensitivity. DRIM is available at: http://biohealth.snu.ac.kr/software/DRIM/.

9.

DeepFam: deep learning based alignment-free method for protein family modeling and prediction.

Seo, Seokjun; Oh, Minsik; Park, Youngjune; Kim, Sun.

Bioinformatics ; 34(13): i254-i262, 2018 07 01.

Artigo em Inglês | MEDLINE | ID: mdl-29949966

RESUMO

Motivation: A large number of newly sequenced proteins are generated by the next-generation sequencing technologies and the biochemical function assignment of the proteins is an important task. However, biological experiments are too expensive to characterize such a large number of protein sequences, thus protein function prediction is primarily done by computational modeling methods, such as profile Hidden Markov Model (pHMM) and k-mer based methods. Nevertheless, existing methods have some limitations; k-mer based methods are not accurate enough to assign protein functions and pHMM is not fast enough to handle large number of protein sequences from numerous genome projects. Therefore, a more accurate and faster protein function prediction method is needed. Results: In this paper, we introduce DeepFam, an alignment-free method that can extract functional information directly from sequences without the need of multiple sequence alignments. In extensive experiments using the Clusters of Orthologous Groups (COGs) and G protein-coupled receptor (GPCR) dataset, DeepFam achieved better performance in terms of accuracy and runtime for predicting functions of proteins compared to the state-of-the-art methods, both alignment-free and alignment-based methods. Additionally, we showed that DeepFam has a power of capturing conserved regions to model protein families. In fact, DeepFam was able to detect conserved regions documented in the Prosite database while predicting functions of proteins. Our deep learning method will be useful in characterizing functions of the ever increasing protein sequences. Availability and implementation: Codes are available at https://bhi-kimlab.github.io/DeepFam.

Assuntos

Aprendizado Profundo , Proteínas/metabolismo , Análise de Sequência de Proteína/métodos , Software , Proteínas/química , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/metabolismo

10.

miRTarVis+: Web-based interactive visual analytics tool for microRNA target predictions.

L'Yi, Sehi; Jung, Daekyoung; Oh, Minsik; Kim, Bohyoung; Freishtat, Robert J; Giri, Mamta; Hoffman, Eric; Seo, Jinwook.

Methods ; 124: 78-88, 2017 07 15.

Artigo em Inglês | MEDLINE | ID: mdl-28600227

RESUMO

In this paper, we present miRTarVis+, a Web-based interactive visual analytics tool for miRNA target predictions and integrative analyses of multiple prediction results. Various microRNA (miRNA) target prediction algorithms have been developed to improve sequence-based miRNA target prediction by exploiting miRNA-mRNA expression profile data. There are also a few analytics tools to help researchers predict targets of miRNAs. However, there still is a need for improving the performance for miRNA prediction algorithms and more importantly for interactive visualization tools for an integrative analysis of multiple prediction results. miRTarVis+ has an intuitive interface to support the analysis pipeline of load, filter, predict, and visualize. It can predict targets of miRNA by adopting Bayesian inference and maximal information-based nonparametric exploration (MINE) analyses as well as conventional correlation and mutual information analyses. miRTarVis+ supports an integrative analysis of multiple prediction results by providing an overview of multiple prediction results and then allowing users to examine a selected miRNA-mRNA network in an interactive treemap and node-link diagram. To evaluate the effectiveness of miRTarVis+, we conducted two case studies using miRNA-mRNA expression profile data of asthma and breast cancer patients and demonstrated that miRTarVis+ helps users more comprehensively analyze targets of miRNA from miRNA-mRNA expression profile data. miRTarVis+ is available at http://hcil.snu.ac.kr/research/mirtarvisplus.

Assuntos

Asma/genética , Neoplasias da Mama/genética , MicroRNAs/genética , RNA Mensageiro/genética , Análise de Sequência de RNA/métodos , Interface Usuário-Computador , Algoritmos , Asma/diagnóstico , Asma/metabolismo , Asma/patologia , Sequência de Bases , Teorema de Bayes , Sítios de Ligação , Neoplasias da Mama/diagnóstico , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Feminino , Perfilação da Expressão Gênica , Regulação da Expressão Gênica , Humanos , Internet , MicroRNAs/metabolismo , RNA Mensageiro/metabolismo

11.

Literature-based condition-specific miRNA-mRNA target prediction.

Oh, Minsik; Rhee, Sungmin; Moon, Ji Hwan; Chae, Heejoon; Lee, Sunwon; Kang, Jaewoo; Kim, Sun.

PLoS One ; 12(3): e0174999, 2017.

Artigo em Inglês | MEDLINE | ID: mdl-28362846

RESUMO

miRNAs are small non-coding RNAs that regulate gene expression by binding to the 3'-UTR of genes. Many recent studies have reported that miRNAs play important biological roles by regulating specific mRNAs or genes. Many sequence-based target prediction algorithms have been developed to predict miRNA targets. However, these methods are not designed for condition-specific target predictions and produce many false positives; thus, expression-based target prediction algorithms have been developed for condition-specific target predictions. A typical strategy to utilize expression data is to leverage the negative control roles of miRNAs on genes. To control false positives, a stringent cutoff value is typically set, but in this case, these methods tend to reject many true target relationships, i.e., false negatives. To overcome these limitations, additional information should be utilized. The literature is probably the best resource that we can utilize. Recent literature mining systems compile millions of articles with experiments designed for specific biological questions, and the systems provide a function to search for specific information. To utilize the literature information, we used a literature mining system, BEST, that automatically extracts information from the literature in PubMed and that allows the user to perform searches of the literature with any English words. By integrating omics data analysis methods and BEST, we developed Context-MMIA, a miRNA-mRNA target prediction method that combines expression data analysis results and the literature information extracted based on the user-specified context. In the pathway enrichment analysis using genes included in the top 200 miRNA-targets, Context-MMIA outperformed the four existing target prediction methods that we tested. In another test on whether prediction methods can re-produce experimentally validated target relationships, Context-MMIA outperformed the four existing target prediction methods. In summary, Context-MMIA allows the user to specify a context of the experimental data to predict miRNA targets, and we believe that Context-MMIA is very useful for predicting condition-specific miRNA targets.

Assuntos

Literatura , MicroRNAs/metabolismo , RNA Mensageiro/metabolismo , Regiões 3' não Traduzidas/genética , Regiões 3' não Traduzidas/fisiologia , Algoritmos , Biologia Computacional , Humanos , PubMed , Software

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA