Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Bioinformatics ; 39(11)2023 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-37982748

RESUMEN

MOTIVATION: Identifying target promoters of active enhancers is a crucial step for realizing gene regulation and deciphering phenotypes and diseases. Up to now, several computational methods were developed to predict enhancer gene interactions, but they require either many epigenomic and transcriptomic experimental assays to generate cell-type (CT)-specific predictions or a single experiment applied to a large cohort of CTs to extract correlations between activities of regulatory elements. Thus, inferring CT-specific enhancer gene interactions in unstudied or poorly annotated CTs becomes a laborious and costly task. RESULTS: Here, we aim to infer CT-specific enhancer target interactions, using minimal experimental input. We introduce Cell-specific ENhancer Target pREdiction (CENTRE), a machine learning framework that predicts enhancer target interactions in a CT-specific manner, using only gene expression and ChIP-seq data for three histone modifications for the CT of interest. CENTRE exploits the wealth of available datasets and extracts cell-type agnostic statistics to complement the CT-specific information. CENTRE is thoroughly tested across many datasets and CTs and achieves equivalent or superior performance than existing algorithms that require massive experimental data. AVAILABILITY AND IMPLEMENTATION: CENTRE's open-source code is available at GitHub via https://github.com/slrvv/CENTRE.


Asunto(s)
Algoritmos , Elementos de Facilitación Genéticos , Humanos , Regulación de la Expresión Génica , Regiones Promotoras Genéticas , Epigenómica
2.
Bioinformatics ; 38(10): 2683-2691, 2022 05 13.
Artículo en Inglés | MEDLINE | ID: mdl-35561158

RESUMEN

MOTIVATION: Recombination is one of the essential genetic processes for sexually reproducing organisms, which can happen more frequently in some regions, called recombination hotspots. Although several factors, such as PRDM9 binding motifs, are known to be related to the hotspots, their contributions to the recombination hotspots have not been quantified, and other determinants are yet to be elucidated. Here, we propose a computational method, RHSNet, based on deep learning and signal processing, to identify and quantify the hotspot determinants in a purely data-driven manner, utilizing datasets from various studies, populations, sexes and species. RESULTS: RHSNet can significantly outperform other sequence-based methods on multiple datasets across different species, sexes and studies. In addition to being able to identify hotspot regions and the well-known determinants accurately, more importantly, RHSNet can quantify the determinants that contribute significantly to the recombination hotspot formation in the relation between PRDM9 binding motif, histone modification and GC content. Further cross-sex, cross-population and cross-species studies suggest that the proposed method has the generalization power and potential to identify and quantify the evolutionary determinant motifs. AVAILABILITY AND IMPLEMENTATION: https://github.com/frankchen121212/RHSNet. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Aprendizaje Profundo , Recombinación Genética , Meiosis
3.
Bioinformatics ; 33(23): 3696-3700, 2017 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-28961713

RESUMEN

MOTIVATION: Drug combination therapy for treatment of cancers and other multifactorial diseases has the potential of increasing the therapeutic effect, while reducing the likelihood of drug resistance. In order to reduce time and cost spent in comprehensive screens, methods are needed which can model additive effects of possible drug combinations. RESULTS: We here show that the transcriptional response to combinatorial drug treatment at promoters, as measured by single molecule CAGE technology, is accurately described by a linear combination of the responses of the individual drugs at a genome wide scale. We also find that the same linear relationship holds for transcription at enhancer elements. We conclude that the described approach is promising for eliciting the transcriptional response to multidrug treatment at promoters and enhancers in an unbiased genome wide way, which may minimize the need for exhaustive combinatorial screens. AVAILABILITY AND IMPLEMENTATION: The CAGE sequence data used in this study is available in the DDBJ Sequence Read Archive (http://trace.ddbj.nig.ac.jp/index_e.html), accession number DRP001113. CONTACT: xin.gao@kaust.edu.sa or erik.arner@riken.jp. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Protocolos de Quimioterapia Combinada Antineoplásica/farmacología , Elementos de Facilitación Genéticos/efectos de los fármacos , Regiones Promotoras Genéticas/efectos de los fármacos , Genoma Humano , Humanos , Análisis de Regresión , Transcripción Genética/efectos de los fármacos
4.
Bioinformatics ; 30(16): 2324-33, 2014 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-24771561

RESUMEN

MOTIVATION: Single nucleotide polymorphisms (SNPs) are considered the most frequently occurring DNA sequence variations. Several computational methods have been proposed for the classification of missense SNPs to neutral and disease associated. However, existing computational approaches fail to select relevant features by choosing them arbitrarily without sufficient documentation. Moreover, they are limited to the problem of missing values, imbalance between the learning datasets and most of them do not support their predictions with confidence scores. RESULTS: To overcome these limitations, a novel ensemble computational methodology is proposed. EnsembleGASVR facilitates a two-step algorithm, which in its first step applies a novel evolutionary embedded algorithm to locate close to optimal Support Vector Regression models. In its second step, these models are combined to extract a universal predictor, which is less prone to overfitting issues, systematizes the rebalancing of the learning sets and uses an internal approach for solving the missing values problem without loss of information. Confidence scores support all the predictions and the model becomes tunable by modifying the classification thresholds. An extensive study was performed for collecting the most relevant features for the problem of classifying SNPs, and a superset of 88 features was constructed. Experimental results show that the proposed framework outperforms well-known algorithms in terms of classification performance in the examined datasets. Finally, the proposed algorithmic framework was able to uncover the significant role of certain features such as the solvent accessibility feature, and the top-scored predictions were further validated by linking them with disease phenotypes. AVAILABILITY AND IMPLEMENTATION: Datasets and codes are freely available on the Web at http://prlab.ceid.upatras.gr/EnsembleGASVR/dataset-codes.zip. All the required information about the article is available through http://prlab.ceid.upatras.gr/EnsembleGASVR/site.html.


Asunto(s)
Algoritmos , Mutación Missense , Polimorfismo de Nucleótido Simple , Sustitución de Aminoácidos , Humanos
5.
Stem Cell Reports ; 16(5): 1381-1390, 2021 05 11.
Artículo en Inglés | MEDLINE | ID: mdl-33891873

RESUMEN

Controlling cell fate has great potential for regenerative medicine, drug discovery, and basic research. Although transcription factors are able to promote cell reprogramming and transdifferentiation, methods based on their upregulation often show low efficiency. Small molecules that can facilitate conversion between cell types can ameliorate this problem working through safe, rapid, and reversible mechanisms. Here, we present DECCODE, an unbiased computational method for identification of such molecules based on transcriptional data. DECCODE matches a large collection of drug-induced profiles for drug treatments against a large dataset of primary cell transcriptional profiles to identify drugs that either alone or in combination enhance cell reprogramming and cell conversion. Extensive validation in the context of human induced pluripotent stem cells shows that DECCODE is able to prioritize drugs and drug combinations enhancing cell reprogramming. We also provide predictions for cell conversion with single drugs and drug combinations for 145 different cell types.


Asunto(s)
Reprogramación Celular , Bibliotecas de Moléculas Pequeñas/farmacología , Algoritmos , Animales , Automatización , Reprogramación Celular/efectos de los fármacos , Análisis por Conglomerados , Células Madre Pluripotentes Inducidas/citología , Células Madre Pluripotentes Inducidas/metabolismo , Ratones , Reproducibilidad de los Resultados
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA