Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 96
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Genome Res ; 33(3): 386-400, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36894325

RESUMO

Topologically associating domains (TADs) have emerged as basic structural and functional units of genome organization and have been determined by many computational methods from Hi-C contact maps. However, the TADs obtained by different methods vary greatly, which makes the accurate determination of TADs a challenging issue and hinders subsequent biological analyses about their organization and functions. Obvious inconsistencies among the TADs identified by different methods indeed make the statistical and biological properties of TADs overly depend on the chosen method rather than on the data. To this end, we use the consensus structural information captured by these methods to define the TAD separation landscape for decoding the consensus domain organization of the 3D genome. We show that the TAD separation landscape could be used to compare domain boundaries across multiple cell types for discovering conserved and divergent topological structures, decipher three types of boundary regions with diverse biological features, and identify consensus TADs (ConsTADs). We illustrate that these analyses could deepen our understanding of the relationships between the topological domains and chromatin states, gene expression, and DNA replication timing.


Assuntos
Montagem e Desmontagem da Cromatina , Cromatina , Consenso , Cromatina/genética , Genoma , Cromossomos
2.
Brief Bioinform ; 24(3)2023 05 19.
Artigo em Inglês | MEDLINE | ID: mdl-37055234

RESUMO

Identifying cancer driver genes plays a curial role in the development of precision oncology and cancer therapeutics. Although a plethora of methods have been developed to tackle this problem, the complex cancer mechanisms and intricate interactions between genes still make the identification of cancer driver genes challenging. In this work, we propose a novel machine learning method of heterophilic graph diffusion convolutional networks (called HGDCs) to boost cancer-driver gene identification. Specifically, HGDC first introduces graph diffusion to generate an auxiliary network for capturing the structurally similar nodes in a biomolecular network. Then, HGDC designs an improved message aggregation and propagation scheme to adapt to the heterophilic setting of biomolecular networks, alleviating the problem of driver gene features being smoothed by its neighboring dissimilar genes. Finally, HGDC uses a layer-wise attention classifier to predict the probability of one gene being a cancer driver gene. In the comparison experiments with other existing state-of-the-art methods, our HGDC achieves outstanding performance in identifying cancer driver genes. The experimental results demonstrate that HGDC not only effectively identifies well-known driver genes on different networks but also novel candidate cancer genes. Moreover, HGDC can effectively prioritize cancer driver genes for individual patients. Particularly, HGDC can identify patient-specific additional driver genes, which work together with the well-known driver genes to cooperatively promote tumorigenesis.


Assuntos
Neoplasias , Humanos , Neoplasias/genética , Redes Reguladoras de Genes , Medicina de Precisão , Oncogenes , Transformação Celular Neoplásica/genética
3.
Brief Bioinform ; 24(5)2023 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-37738403

RESUMO

Identifying personalized cancer driver genes and further revealing their oncogenic mechanisms is critical for understanding the mechanisms of cell transformation and aiding clinical diagnosis. Almost all existing methods primarily focus on identifying driver genes at the cohort or individual level but fail to further uncover their underlying oncogenic mechanisms. To fill this gap, we present an interpretable framework, PhenoDriver, to identify personalized cancer driver genes, elucidate their roles in cancer development and uncover the association between driver genes and clinical phenotypic alterations. By analyzing 988 breast cancer patients, we demonstrate the outstanding performance of PhenoDriver in identifying breast cancer driver genes at the cohort level compared to other state-of-the-art methods. Otherwise, our PhenoDriver can also effectively identify driver genes with both recurrent and rare mutations in individual patients. We further explore and reveal the oncogenic mechanisms of some known and unknown breast cancer driver genes (e.g. TP53, MAP3K1, HTT, etc.) identified by PhenoDriver, and construct their subnetworks for regulating clinical abnormal phenotypes. Notably, most of our findings are consistent with existing biological knowledge. Based on the personalized driver profiles, we discover two existing and one unreported breast cancer subtypes and uncover their molecular mechanisms. These results intensify our understanding for breast cancer mechanisms, guide therapeutic decisions and assist in the development of targeted anticancer therapies.


Assuntos
Neoplasias da Mama , Humanos , Feminino , Neoplasias da Mama/genética , Oncogenes , Mutação , Fenótipo , Pesquisa
4.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36642408

RESUMO

Current machine learning-based methods have achieved inspiring predictions in the scenarios of mono-type and multi-type drug-drug interactions (DDIs), but they all ignore enhancive and depressive pharmacological changes triggered by DDIs. In addition, these pharmacological changes are asymmetric since the roles of two drugs in an interaction are different. More importantly, these pharmacological changes imply significant topological patterns among DDIs. To address the above issues, we first leverage Balance theory and Status theory in social networks to reveal the topological patterns among directed pharmacological DDIs, which are modeled as a signed and directed network. Then, we design a novel graph representation learning model named SGRL-DDI (social theory-enhanced graph representation learning for DDI) to realize the multitask prediction of DDIs. SGRL-DDI model can capture the task-joint information by integrating relation graph convolutional networks with Balance and Status patterns. Moreover, we utilize task-specific deep neural networks to perform two tasks, including the prediction of enhancive/depressive DDIs and the prediction of directed DDIs. Based on DDI entries collected from DrugBank, the superiority of our model is demonstrated by the comparison with other state-of-the-art methods. Furthermore, the ablation study verifies that Balance and Status patterns help characterize directed pharmacological DDIs, and that the joint of two tasks provides better DDI representations than individual tasks. Last, we demonstrate the practical effectiveness of our model by a version-dependent test, where 88.47 and 81.38% DDI out of newly added entries provided by the latest release of DrugBank are validated in two predicting tasks respectively.


Assuntos
Aprendizado de Máquina , Redes Neurais de Computação , Interações Medicamentosas
5.
Methods ; 226: 61-70, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38631404

RESUMO

As the most abundant mRNA modification, m6A controls and influences many aspects of mRNA metabolism including the mRNA stability and degradation. However, the role of specific m6A sites in regulating gene expression still remains unclear. In additional, the multicollinearity problem caused by the correlation of methylation level of multiple m6A sites in each gene could influence the prediction performance. To address the above challenges, we propose an elastic-net regularized negative binomial regression model (called m6Aexpress-enet) to predict which m6A site could potentially regulate its gene expression. Comprehensive evaluations on simulated datasets demonstrate that m6Aexpress-enet could achieve the top prediction performance. Applying m6Aexpress-enet on real MeRIP-seq data from human lymphoblastoid cell lines, we have uncovered the complex regulatory pattern of predicted m6A sites and their unique enrichment pathway of the constructed co-methylation modules. m6Aexpress-enet proves itself as a powerful tool to enable biologists to discover the mechanism of m6A regulatory gene expression. Furthermore, the source code and the step-by-step implementation of m6Aexpress-enet is freely accessed at https://github.com/tengzhangs/m6Aexpress-enet.


Assuntos
Regulação da Expressão Gênica , RNA Mensageiro , Humanos , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Regulação da Expressão Gênica/genética , Biologia Computacional/métodos , Metilação , Software , Adenosina/metabolismo , Adenosina/genética , Adenosina/análogos & derivados , Análise de Regressão
6.
Brief Bioinform ; 23(4)2022 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-35848879

RESUMO

As the most abundant RNA modification, N6-methyladenosine (m6A) plays an important role in various RNA activities including gene expression and translation. With the rapid application of MeRIP-seq technology, samples of multiple groups, such as the involved multiple viral/ bacterial infection or distinct cell differentiation stages, are extracted from same experimental unit. However, our current knowledge about how the dynamic m6A regulating gene expression and the role in certain biological processes (e.g. immune response in this complex context) is largely elusive due to lack of effective tools. To address this issue, we proposed a Bayesian hierarchical mixture model (called m6Aexpress-BHM) to predict m6A regulation of gene expression (m6A-reg-exp) in multiple groups of MeRIP-seq experiment with limited samples. Comprehensive evaluations of m6Aexpress-BHM on the simulated data demonstrate its high predicting precision and robustness. Applying m6Aexpress-BHM on three real-world datasets (i.e. Flaviviridae infection, infected time-points of bacteria and differentiation stages of dendritic cells), we predicted more m6A-reg-exp genes with positive regulatory mode that significantly participate in innate immune or adaptive immune pathways, revealing the underlying mechanism of the regulatory function of m6A during immune response. In addition, we also found that m6A may influence the expression of PD-1/PD-L1 via regulating its interacted genes. These results demonstrate the power of m6Aexpress-BHM, helping us understand the m6A regulatory function in immune system.


Assuntos
Adenosina , RNA , Adenosina/genética , Adenosina/metabolismo , Teorema de Bayes , Regulação da Expressão Gênica , Metilação , RNA/genética
7.
Brief Bioinform ; 22(6)2021 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-33954583

RESUMO

Considerable evidence suggests that during the progression of cancer initiation, the state transition from wellness to disease is not necessarily smooth but manifests switch-like nonlinear behaviors, preventing the cancer prediction and early interventional therapy for patients. Understanding the mechanism of such wellness-to-disease transitions is a fundamental and challenging task. Despite the advances in flux theory of nonequilibrium dynamics and 'critical slowing down'-based system resilience theory, a system-level approach still lacks to fully describe this state transition. Here, we present a novel framework (called bioRFR) to quantify such wellness-to-disease transition during cancer initiation through uncovering the biological system's resilience function from gene expression data. We used bioRFR to reconstruct the biologically and dynamically significant resilience functions for cancer initiation processes (e.g. BRCA, LUSC and LUAD). The resilience functions display the similar resilience pattern with hysteresis feature but different numbers of tipping points, which implies that once the cell become cancerous, it is very difficult or even impossible to reverse to the normal state. More importantly, bioRFR can measure the severe degree of cancer patients and identify the personalized key genes that are associated with the individual system's state transition from normal to tumor in resilience perspective, indicating that bioRFR can contribute to personalized medicine and targeted cancer therapy.


Assuntos
Algoritmos , Transformação Celular Neoplásica , Suscetibilidade a Doenças , Modelos Biológicos , Neoplasias/etiologia , Neoplasias/metabolismo , Biomarcadores Tumorais , Transformação Celular Neoplásica/genética , Transformação Celular Neoplásica/metabolismo , Biologia Computacional/métodos , Bases de Dados Genéticas , Progressão da Doença , Regulação Neoplásica da Expressão Gênica , Humanos , Neoplasias/patologia , Transcriptoma
8.
Methods ; 203: 207-213, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35462009

RESUMO

With the accumulation of ChIP-seq data, convolution neural network (CNN)-based methods have been proposed for predicting transcription factor binding sites (TFBSs). However, biological experimental data are noisy, and are often treated as ground truth for both training and testing. Particularly, existing classification methods ignore the false positive and false negative which are caused by the error in the peak calling stage, and therefore, they can easily overfit to biased training data. It leads to inaccurate identification and inability to reveal the rules of governing protein-DNA binding. To address this issue, we proposed a meta learning-based CNN method (namely TFBS_MLCNN or MLCNN for short) for suppressing the influence of noisy labels data and accurately recognizing TFBSs from ChIP-seq data. Guided by a small amount of unbiased meta-data, MLCNN can adaptively learn an explicit weighting function from ChIP-seq data and update the parameter of classifier simultaneously. The weighting function overcomes the influence of biased training data on classifier by assigning a weight to each sample according to its training loss. The experimental results on 424 ChIP-seq datasets show that MLCNN not only outperforms other existing state-of-the-art CNN methods, but can also detect noisy samples which are given the small weights to suppress them. The suppression ability to the noisy samples can be revealed through the visualization of samples' weights. Several case studies demonstrate that MLCNN has superior performance to others.


Assuntos
Sequenciamento de Cromatina por Imunoprecipitação , Redes Neurais de Computação , Sítios de Ligação , Ligação Proteica , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
9.
Methods ; 203: 125-138, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35436514

RESUMO

N6-methyladenosine (m6A) is the most abundant eukaryotic modification internal mRNA, which plays the crucial roles in the occurrence and development of cancer. However, current knowledge about m6A-mediated functional circuit and key genes targeted by m6A methylation in cancer is mostly elusive. Thus, here we proposed a novel network-based approach (called m6Acancer-Net) to identify m6A-mediated driver genes and their associated network in specific type of cancer, such as acute myeloid leukemia. m6A-mediated cancer driver genes are defined as genes mediated by m6A methylation, significantly mutated, and functionally interacted in cancer. m6Acancer-Net identified the m6A-mediated cancer driver genes by combining gene functional interaction network with RNA methylation, gene expression and mutation information. A cancer-specific gene-site heterogeneous network was firstly constructed by connecting the m6A site co-methylation network with the functional interaction pruned gene co-expression network generated from large scale gene expression profile of specific cancer. Then, the functional m6A-mediated genes were identified by selecting the m6A regulators as seed genes to perform the random walk with restart algorithm on the gene-site heterogeneous network. Finally, m6A-mediated cancer driver gene subnetworks were constructed by performing the heat diffusion of mutation frequency for functional m6A-mediated genes in protein-protein interaction networks. The experimental results of m6Acancer-Net on the acute myeloid leukemia (AML) and glioblastoma multiforme (GBM) data from TCGA project show that the m6A-mediated caner driver genes identified by m6Acancer-Net are targeted by m6A regulators, and mediate significant cancer-related pathways. They play crucial roles in development and prognostic stratification of cancer. Moreover, 15 m6A-mediated cancer driver genes identified in AML are validated by literatures to mediate AML progress, and 14 m6A-mediated cancer driver genes identified in GBM are validated by literatures to participate in development of GBM. m6Acancer-Net is reliable to identify the functionally significant m6A-mediated driver genes in specific cancer, and it can effectively facilitate the understanding of regulatory and therapeutic mechanism of cancer driver genes in epitranscriptome layer.


Assuntos
Redes Reguladoras de Genes , Glioblastoma , Algoritmos , Glioblastoma/genética , Humanos , Mutação , Mapas de Interação de Proteínas/genética
10.
Methods ; 203: 167-178, 2022 07.
Artigo em Inglês | MEDLINE | ID: mdl-35314342

RESUMO

N6-methyladenosine (m6A) is the most abundant form of mRNA modification and plays an important role in regulating gene expression. However, the mechanisms of m6A regulated gene expression in cell or condition specific, are still poorly understood. Even though, some methods are able to predict m6A regulated expression (m6A-reg-exp) genes in specific context, they don't introduce the m6A reader binding information, while this information can help to predict m6A-reg-exp genes and more clearly to explain the mechanisms of m6A-mediated gene expression process. Thus, by integrating m6A sites and reader binding information, we proposed a novel method (called m6Aexpress-Reader) to predict m6A-reg-exp genes from limited MeRIP-seq data in specific context. m6Aexpress-Reader adopts the reader binding signal strength to weight the posterior distribution of the estimated regulatory coefficients for enhancing the prediction power. By using m6Aexpress-Reader, we found the complex characteristic of m6A on gene expression regulation and the distinct regulated pattern of m6A-reg-exp genes with different reader binding. m6A readers, YTHDF2 or IGF2BP1/3 all play an important role in various cancers and the key cancer pathways. In addition, m6Aexpress-Reader reveals the distinct m6A regulated mode of reader targeted genes in cancer. m6Aexpress-Reader could be a useful tool for studying the m6A regulation on reader target genes in specific context and it can be freely accessible at: https://github.com/NWPU-903PR/m6AexpressReader.


Assuntos
Neoplasias , Proteínas de Ligação a RNA , Adenosina/genética , Adenosina/metabolismo , Regulação da Expressão Gênica , Humanos , Neoplasias/genética , Proteínas de Ligação a RNA/genética , Proteínas de Ligação a RNA/metabolismo
11.
Nucleic Acids Res ; 49(7): e37, 2021 04 19.
Artigo em Inglês | MEDLINE | ID: mdl-33434272

RESUMO

Multiple driver genes in individual patient samples may cause resistance to individual drugs in precision medicine. However, current computational methods have not studied how to fill the gap between personalized driver gene identification and combinatorial drug discovery for individual patients. Here, we developed a novel structural network controllability-based personalized driver genes and combinatorial drug identification algorithm (CPGD), aiming to identify combinatorial drugs for an individual patient by targeting personalized driver genes from network controllability perspective. On two benchmark disease datasets (i.e. breast cancer and lung cancer datasets), performance of CPGD is superior to that of other state-of-the-art driver gene-focus methods in terms of discovery rate among prior-known clinical efficacious combinatorial drugs. Especially on breast cancer dataset, CPGD evaluated synergistic effect of pairwise drug combinations by measuring synergistic effect of their corresponding personalized driver gene modules, which are affected by a given targeting personalized driver gene set of drugs. The results showed that CPGD performs better than existing synergistic combinatorial strategies in identifying clinical efficacious paired combinatorial drugs. Furthermore, CPGD enhanced cancer subtyping by computationally providing personalized side effect signatures for individual patients. In addition, CPGD identified 90 drug combinations candidates from SARS-COV2 dataset as potential drug repurposing candidates for recently spreading COVID-19.


Assuntos
Algoritmos , Neoplasias da Mama/tratamento farmacológico , Neoplasias da Mama/genética , Quimioterapia Combinada , Neoplasias Pulmonares/tratamento farmacológico , Neoplasias Pulmonares/genética , Medicina de Precisão/métodos , Neoplasias da Mama/classificação , COVID-19/genética , Conjuntos de Dados como Assunto , Reposicionamento de Medicamentos , Sinergismo Farmacológico , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Regulação Neoplásica da Expressão Gênica/genética , Genes Neoplásicos/genética , Humanos , Medição de Risco , Fluxo de Trabalho , Tratamento Farmacológico da COVID-19
12.
Nucleic Acids Res ; 49(20): e116, 2021 11 18.
Artigo em Inglês | MEDLINE | ID: mdl-34417605

RESUMO

N6-methyladenosine (m6A) is the most abundant form of mRNA modification and controls many aspects of RNA metabolism including gene expression. However, the mechanisms by which m6A regulates cell- and condition-specific gene expression are still poorly understood, partly due to a lack of tools capable of identifying m6A sites that regulate gene expression under different conditions. Here we develop m6A-express, the first algorithm for predicting condition-specific m6A regulation of gene expression (m6A-reg-exp) from limited methylated RNA immunoprecipitation sequencing (MeRIP-seq) data. Comprehensive evaluations of m6A-express using simulated and real data demonstrated its high prediction specificity and sensitivity. When only a few MeRIP-seq samples may be available for the cellular or treatment conditions, m6A-express is particularly more robust than the log-linear model. Using m6A-express, we reported that m6A writers, METTL3 and METTL14, competitively regulate the transcriptional processes by mediating m6A-reg-exp of different genes in Hela cells. In contrast, METTL3 induces different m6A-reg-exp of a distinct group of genes in HepG2 cells to regulate protein functions and stress-related processes. We further uncovered unique m6A-reg-exp patterns in human brain and intestine tissues, which are enriched in organ-specific processes. This study demonstrates the effectiveness of m6A-express in predicting condition-specific m6A-reg-exp and highlights the complex, condition-specific nature of m6A-regulation of gene expression.


Assuntos
Adenosina/análogos & derivados , Processamento Pós-Transcricional do RNA , Análise de Sequência de RNA/métodos , Adenosina/metabolismo , Encéfalo/metabolismo , Células HeLa , Células Hep G2 , Humanos , Mucosa Intestinal/metabolismo
13.
BMC Bioinformatics ; 23(1): 341, 2022 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-35974311

RESUMO

BACKGROUND: Cancer is a heterogeneous disease in which tumor genes cooperate as well as adapt and evolve to the changing conditions for individual patients. It is a meaningful task to discover the personalized cancer driver genes that can provide diagnosis and target drug for individual patients. However, most of existing methods mainly ranks potential personalized cancer driver genes by considering the patient-specific nodes information on the gene/protein interaction network. These methods ignore the personalized edge weight information in gene interaction network, leading to false positive results. RESULTS: In this work, we presented a novel algorithm (called PDGPCS) to predict the Personalized cancer Driver Genes based on the Prize-Collecting Steiner tree model by considering the personalized edge weight information. PDGPCS first constructs the personalized weighted gene interaction network by integrating the personalized gene expression data and prior known gene/protein interaction network knowledge. Then the gene mutation data and pathway data are integrated to quantify the impact of each mutant gene on every dysregulated pathway with the prize-collecting Steiner tree model. Finally, according to the mutant gene's aggregated impact score on all dysregulated pathways, the mutant genes are ranked for prioritizing the personalized cancer driver genes. Experimental results on four TCGA cancer datasets show that PDGPCS has better performance than other personalized driver gene prediction methods. In addition, we verified that the personalized edge weight of gene interaction network can improve the prediction performance. CONCLUSIONS: PDGPCS can more accurately identify the personalized driver genes and takes a step further toward personalized medicine and treatment. The source code of PDGPCS can be freely downloaded from https://github.com/NWPU-903PR/PDGPCS .


Assuntos
Redes Reguladoras de Genes , Neoplasias , Medicina de Precisão , Algoritmos , Humanos , Mutação , Neoplasias/diagnóstico , Neoplasias/genética , Oncogenes
14.
Brief Bioinform ; 21(5): 1641-1662, 2020 09 25.
Artigo em Inglês | MEDLINE | ID: mdl-31711128

RESUMO

To understand tumor heterogeneity in cancer, personalized driver genes (PDGs) need to be identified for unraveling the genotype-phenotype associations corresponding to particular patients. However, most of the existing driver-focus methods mainly pay attention on the cohort information rather than on individual information. Recent developing computational approaches based on network control principles are opening a new way to discover driver genes in cancer, particularly at an individual level. To provide comprehensive perspectives of network control methods on this timely topic, we first considered the cancer progression as a network control problem, in which the expected PDGs are altered genes by oncogene activation signals that can change the individual molecular network from one health state to the other disease state. Then, we reviewed the network reconstruction methods on single samples and introduced novel network control methods on single-sample networks to identify PDGs in cancer. Particularly, we gave a performance assessment of the network structure control-based PDGs identification methods on multiple cancer datasets from TCGA, for which the data and evaluation package also are publicly available. Finally, we discussed future directions for the application of network control methods to identify PDGs in cancer and diverse biological processes.


Assuntos
Neoplasias/genética , Algoritmos , Biologia Computacional/métodos , Heterogeneidade Genética , Humanos , Mutação
15.
Bioinformatics ; 37(23): 4477-4484, 2021 12 07.
Artigo em Inglês | MEDLINE | ID: mdl-34175939

RESUMO

MOTIVATION: The driver genes play a key role in the evolutionary process of cancer. Effectively identifying these driver genes is crucial to cancer diagnosis and treatment. However, due to the high heterogeneity of cancers, it remains challenging to identify the driver genes for individual patients. Although some computational methods have been proposed to tackle this problem, they seldom consider the fact that the genes functionally similar to the well-established driver genes may likely play similar roles in cancer process, which potentially promotes the driver gene identification. Thus, here we developed a novel approach of IMCDriver to promote the driver gene identification both for cohorts and individual patients. RESULTS: IMCDriver first considers the well-established driver genes as prior information, and adopts the using multi-omics data (e.g. somatic mutation, gene expression and protein-protein interaction) to compute the similarity between patients/genes. Then, IMCDriver prioritizes the personalized mutated genes according to their functional similarity to the well-established driver genes via Inductive Matrix Completion. Finally, IMCDriver identifies the highly rank-ordered genes as the personalized driver genes. The results on five cancer datasets from the Cancer Genome Consortium show that our IMCDriver outperforms other existing state-of-the-art methods both in the cohort and patient-specific driver gene identification. IMCDriver also reveals some novel driver genes that potentially drive cancer development. In addition, even for the driver genes rarely mutated among a population, IMCDriver can still identify them and prioritize them with high priorities. AVAILABILITY AND IMPLEMENTATION: Code available at https://github.com/NWPU-903PR/IMCDriver. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Algoritmos , Neoplasias , Humanos , Mutação , Neoplasias/genética , Medicina de Precisão , Multiômica
16.
Bioinformatics ; 37(22): 4277-4279, 2021 11 18.
Artigo em Inglês | MEDLINE | ID: mdl-33974000

RESUMO

MOTIVATION: N 6-methyladenosine (m6A) is the most abundant mammalian mRNA methylation with versatile functions. To date, although a number of bioinformatics tools have been developed for location discovery of m6A modification, functional understanding is still quite limited. As the focus of RNA epigenetics gradually shifts from site discovery to functional studies, there is an urgent need for user-friendly tools to identify and explore the functional relevance of context-specific m6A methylation to gain insights into the epitranscriptome layer of gene expression regulation. RESULTS: We introduced here Funm6AViewer, a novel platform to identify, prioritize and visualize the functional gene interaction networks mediated by dynamic m6A RNA methylation unveiled from a case control study. By taking the differential RNA methylation data and differential gene expression data, both of which can be inferred from the widely used MeRIP-seq data, as the inputs, Funm6AViewer enables a series of analysis, including: (i) examining the distribution of differential m6A sites, (ii) prioritizing the genes mediated by dynamic m6A methylation and (iii) characterizing functionally the gene regulatory networks mediated by condition-specific m6A RNA methylation. Funm6AViewer should effectively facilitate the understanding of the epitranscriptome circuitry mediated by this reversible RNA modification. AVAILABILITY AND IMPLEMENTATION: Funm6AViewer is available both as a convenient web server (https://www.xjtlu.edu.cn/biologicalsciences/funm6aviewer) with graphical interface and as an independent R package (https://github.com/NWPU-903PR/Funm6AViewer) for local usage. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Epigênese Genética , RNA , Animais , Metilação , Estudos de Casos e Controles , RNA/metabolismo , Redes Reguladoras de Genes , Adenosina/metabolismo , Mamíferos/genética
17.
Anal Biochem ; 646: 114631, 2022 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-35227661

RESUMO

It is crucial to identify DDIs and explore their underlying mechanism (e.g., DDIs types) for polypharmacy safety. However, the detection of DDIs in assays is still time-consuming and costly, due to the need for experimental search over a large space of drug combinations. Thus, many computational methods have been developed to predict DDIs, most of them focusing on whether a drug interacts with another or not. And a few deep learning-based methods address a more realistic screening task for identifying various DDI types, but they assume a DDI only triggers one pharmacological effect, while a DDI can trigger more types of pharmacological effects. Thus, here we proposed a novel end-to-end deep learning-based method (called deepMDDI) for the Multi-label prediction of Drug-Drug Interactions. deepMDDI contains an encoder derived from relational graph convolutional networks and a tensor-like decoder to uniformly model interactions. deepMDDI is not only efficient for DDI transductive prediction, but also inductive prediction. The experimental results show that our model is superior to other state-of-the-art deep learning-based methods. We also validated the power of deepMDDI in the DDIs multi-label prediction and found several new valid DDIs in the case study. In conclusion, deepMDDI is beneficial to uncover the mechanism and regularity of DDIs.


Assuntos
Interações Medicamentosas
18.
PLoS Comput Biol ; 17(5): e1008962, 2021 05.
Artigo em Inglês | MEDLINE | ID: mdl-33956788

RESUMO

In the past few years, a wealth of sample-specific network construction methods and structural network control methods has been proposed to identify sample-specific driver nodes for supporting the Sample-Specific network Control (SSC) analysis of biological networked systems. However, there is no comprehensive evaluation for these state-of-the-art methods. Here, we conducted a performance assessment for 16 SSC analysis workflows by using the combination of 4 sample-specific network reconstruction methods and 4 representative structural control methods. This study includes simulation evaluation of representative biological networks, personalized driver genes prioritization on multiple cancer bulk expression datasets with matched patient samples from TCGA, and cell marker genes and key time point identification related to cell differentiation on single-cell RNA-seq datasets. By widely comparing analysis of existing SSC analysis workflows, we provided the following recommendations and banchmarking workflows. (i) The performance of a network control method is strongly dependent on the up-stream sample-specific network method, and Cell-Specific Network construction (CSN) method and Single-Sample Network (SSN) method are the preferred sample-specific network construction methods. (ii) After constructing the sample-specific networks, the undirected network-based control methods are more effective than the directed network-based control methods. In addition, these data and evaluation pipeline are freely available on https://github.com/WilfongGuo/Benchmark_control.


Assuntos
Análise de Célula Única/métodos , Algoritmos , Biologia Computacional/métodos , Redes Reguladoras de Genes , Humanos , RNA-Seq/métodos
19.
Molecules ; 27(9)2022 May 07.
Artigo em Inglês | MEDLINE | ID: mdl-35566354

RESUMO

The treatment of complex diseases by using multiple drugs has become popular. However, drug-drug interactions (DDI) may give rise to the risk of unanticipated adverse effects and even unknown toxicity. Therefore, for polypharmacy safety it is crucial to identify DDIs and explore their underlying mechanisms. The detection of DDI in the wet lab is expensive and time-consuming, due to the need for experimental research over a large volume of drug combinations. Although many computational methods have been developed to predict DDIs, most of these are incapable of predicting potential DDIs between drugs within the DDI network and new drugs from outside the DDI network. In addition, they are not designed to explore the underlying mechanisms of DDIs and lack interpretative capacity. Thus, here we propose a novel method of GNN-DDI to predict potential DDIs by constructing a five-layer graph attention network to identify k-hops low-dimensional feature representations for each drug from its chemical molecular graph, concatenating all identified features of each drug pair, and inputting them into a MLP predictor to obtain the final DDI prediction score. The experimental results demonstrate that our GNN-DDI is suitable for each of two DDI predicting scenarios, namely the potential DDIs among known drugs in the DDI network and those between drugs within the DDI network and new drugs from outside DDI network. The case study indicates that our method can explore the specific drug substructures that lead to the potential DDIs, which helps to improve interpretability and discover the underlying interaction mechanisms of drug pairs.


Assuntos
Produtos Biológicos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Interações Medicamentosas , Humanos , Redes Neurais de Computação , Projetos de Pesquisa
20.
BMC Bioinformatics ; 21(1): 341, 2020 Aug 04.
Artigo em Inglês | MEDLINE | ID: mdl-32753028

RESUMO

BACKGROUND: Single Molecule Sequencing (SMS) technology can produce longer reads with higher sequencing error rate. Mapping these reads to a reference genome is often the most fundamental and computing-intensive step for downstream analysis. Most existing mapping tools generally adopt the traditional seed-and-extend strategy, and the candidate aligned regions for each query read are selected either by counting the number of matched seeds or chaining a group of seeds. However, for all the existing mapping tools, the coverage ratio of the alignment region to the query read is lower, and the read alignment quality and efficiency need to be improved. Here, we introduce smsMap, a novel mapping tool that is specifically designed to map the long reads of SMS to a reference genome. RESULTS: smsMap was evaluated with other existing seven SMS mapping tools (e.g., BLASR, minimap2, and BWA-MEM) on both simulated and real-life SMS datasets. The experimental results show that smsMap can efficiently achieve higher aligned read coverage ratio and has higher sensitivity that can align more sequences and bases to the reference genome. Additionally, smsMap is more robust to sequencing errors. CONCLUSIONS: smsMap is computationally efficient to align SMS reads, especially for the larger size of the reference genome (e.g., H. sapiens genome with over 3 billion base pairs). The source code of smsMap can be freely downloaded from https://github.com/NWPU-903PR/smsMap .


Assuntos
Alinhamento de Sequência , Análise de Sequência de DNA/métodos , Software , Algoritmos , Simulação por Computador , Bases de Dados Genéticas , Escherichia coli/genética , Humanos , Fatores de Tempo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA