Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros

Base de dados
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 49(18): 10397-10418, 2021 10 11.
Artigo em Inglês | MEDLINE | ID: mdl-34508352

RESUMO

Understanding how epigenetic variation in non-coding regions is involved in distal gene-expression regulation is an important problem. Regulatory regions can be associated to genes using large-scale datasets of epigenetic and expression data. However, for regions of complex epigenomic signals and enhancers that regulate many genes, it is difficult to understand these associations. We present StitchIt, an approach to dissect epigenetic variation in a gene-specific manner for the detection of regulatory elements (REMs) without relying on peak calls in individual samples. StitchIt segments epigenetic signal tracks over many samples to generate the location and the target genes of a REM simultaneously. We show that this approach leads to a more accurate and refined REM detection compared to standard methods even on heterogeneous datasets, which are challenging to model. Also, StitchIt REMs are highly enriched in experimentally determined chromatin interactions and expression quantitative trait loci. We validated several newly predicted REMs using CRISPR-Cas9 experiments, thereby demonstrating the reliability of StitchIt. StitchIt is able to dissect regulation in superenhancers and predicts thousands of putative REMs that go unnoticed using peak-based approaches suggesting that a large part of the regulome might be uncharted water.


Assuntos
Cromatina/metabolismo , Análise de Dados , Elementos Facilitadores Genéticos , Epigênese Genética , Regulação da Expressão Gênica , Células Endoteliais da Veia Umbilical Humana , Humanos
2.
J Chem Phys ; 154(24): 244114, 2021 Jun 28.
Artigo em Inglês | MEDLINE | ID: mdl-34241352

RESUMO

Drug efficacy depends on its capacity to permeate across the cell membrane. We consider the prediction of passive drug-membrane permeability coefficients. Beyond the widely recognized correlation with hydrophobicity, we additionally consider the functional relationship between passive permeation and acidity. To discover easily interpretable equations that explain the data well, we use the recently proposed sure-independence screening and sparsifying operator (SISSO), an artificial-intelligence technique that combines symbolic regression with compressed sensing. Our study is based on a large in silico dataset of 0.4 × 106 small molecules extracted from coarse-grained simulations. We rationalize the equation suggested by SISSO via an analysis of the inhomogeneous solubility-diffusion model in several asymptotic acidity regimes. We further extend our analysis to the dependence on lipid-membrane composition. Lipid-tail unsaturation plays a key role but surprisingly contributes stepwise rather than proportionally. Our results are in line with previously observed changes in permeability, suggesting the distinction between liquid-disordered and liquid-ordered permeation. Together, compressed sensing with analytically derived asymptotes establish and validate an accurate, broadly applicable, and interpretable equation for passive permeability across both drug and lipid-tail chemistry.


Assuntos
Membrana Celular/química , Preparações Farmacêuticas/química , Permeabilidade
3.
Bioinformatics ; 34(17): 3050-3051, 2018 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-29659721

RESUMO

Motivation: Genome-wide measurements of paired miRNA and gene expression data have enabled the prediction of competing endogenous RNAs (ceRNAs). It has been shown that the sponge effect mediated by protein-coding as well as non-coding ceRNAs can play an important regulatory role in the cell in health and disease. Therefore, many computational methods for the computational identification of ceRNAs have been suggested. In particular, methods based on Conditional Mutual Information (CMI) have shown promising results. However, the currently available implementation is slow and cannot be used to perform computations on a large scale. Results: Here, we present JAMI, a Java tool that uses a non-parametric estimator for CMI values from gene and miRNA expression data. We show that JAMI speeds up the computation of ceRNA networks by a factor of ∼70 compared to currently available implementations. Further, JAMI supports multi-threading to make use of common multi-core architectures for further performance gain. Requirements: Java 8. Availability and implementation: JAMI is available as open-source software from https://github.com/SchulzLab/JAMI. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Redes Reguladoras de Genes , RNA/genética , Software
5.
Nat Commun ; 11(1): 4428, 2020 09 04.
Artigo em Inglês | MEDLINE | ID: mdl-32887879

RESUMO

Although machine learning (ML) models promise to substantially accelerate the discovery of novel materials, their performance is often still insufficient to draw reliable conclusions. Improved ML models are therefore actively researched, but their design is currently guided mainly by monitoring the average model test error. This can render different models indistinguishable although their performance differs substantially across materials, or it can make a model appear generally insufficient while it actually works well in specific sub-domains. Here, we present a method, based on subgroup discovery, for detecting domains of applicability (DA) of models within a materials class. The utility of this approach is demonstrated by analyzing three state-of-the-art ML models for predicting the formation energy of transparent conducting oxides. We find that, despite having a mutually indistinguishable and unsatisfactory average error, the models have DAs with distinctive features and notably improved performance.

6.
J Proteomics ; 74(8): 1201-17, 2011 Aug 12.
Artigo em Inglês | MEDLINE | ID: mdl-21443973

RESUMO

To understand physiological processes, insight into protein complexes is very important. Through a combination of blue native gel electrophoresis and LC-MS/MS, we were able to isolate protein complexes and identify their potential subunits from Nicotiana tabacum cv. Bright Yellow-2. For this purpose, a bioanalytical approach was used that works without a priori knowledge of the interacting proteins. Different clustering methods (e.g., k-means and hierarchical clustering) and a biclustering approach were evaluated according to their ability to group proteins by their migration profile and to correlate the proteins to a specific complex. The biclustering approach was identified as a very powerful tool for the exploration of protein complexes of whole cell lysates since it allows for the promiscuous nature of proteins. Furthermore, it searches for associations between proteins that co-occur frequently throughout the BN gel, which increases the confidence of the putative associations between co-migrating proteins. The statistical significance and biological relevance of the profile clusters were verified using functional gene ontology annotation. The proof of concept for identifying protein complexes by our BN PAGE/LC-MS/MS approach is provided through the analysis of known protein complexes. Both well characterized long-lived protein complexes as well as potential temporary sequential multi-enzyme complexes were characterized.


Assuntos
Complexos Multiproteicos/isolamento & purificação , Nicotiana/química , Proteínas de Plantas/isolamento & purificação , Cromatografia Líquida/métodos , Análise por Conglomerados , Eletroforese em Gel Bidimensional/métodos , Eletroforese em Gel de Poliacrilamida , Espectrometria de Massas em Tandem/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA