Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Am J Physiol Lung Cell Mol Physiol ; 324(3): L245-L258, 2023 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-36625483

RESUMO

The most common preclinical, in vivo model to study lung fibrosis is the bleomycin-induced lung fibrosis model in 2- to 3-mo-old mice. Although this model resembles key aspects of idiopathic pulmonary fibrosis (IPF), there are limitations in its predictability for the human disease. One of the main differences is the juvenile age of animals that are commonly used in experiments, resembling humans of around 20 yr. Because IPF patients are usually older than 60 yr, aging appears to play an important role in the pathogenesis of lung fibrosis. Therefore, we compared young (3 months) and old mice (21 months) 21 days after intratracheal bleomycin instillation. Analyzing lung transcriptomics (mRNAs and miRNAs) and proteomics, we found most pathways to be similarly regulated in young and old mice. However, old mice show imbalanced protein homeostasis as well as an increased inflammatory state in the fibrotic phase compared to young mice. Comparisons with published human transcriptomic data sets (GSE47460, GSE32537, and GSE24206) revealed that the gene signature of old animals correlates significantly better with IPF patients, and it also turned human healthy individuals better into "IPF patients" using an approach based on predictive disease modeling. Both young and old animals show similar molecular hallmarks of IPF in the bleomycin-induced lung fibrosis model, although old mice more closely resemble several features associated with IPF in comparison to young animals.


Assuntos
Bleomicina , Fibrose Pulmonar Idiopática , Humanos , Camundongos , Animais , Bleomicina/farmacologia , Transcriptoma , Proteômica , Pulmão/metabolismo , Fibrose Pulmonar Idiopática/patologia , Modelos Animais de Doenças , Camundongos Endogâmicos C57BL
2.
Bioinformatics ; 37(6): 845-852, 2021 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-33070187

RESUMO

MOTIVATION: Network diffusion and label propagation are fundamental tools in computational biology, with applications like gene-disease association, protein function prediction and module discovery. More recently, several publications have introduced a permutation analysis after the propagation process, due to concerns that network topology can bias diffusion scores. This opens the question of the statistical properties and the presence of bias of such diffusion processes in each of its applications. In this work, we characterized some common null models behind the permutation analysis and the statistical properties of the diffusion scores. We benchmarked seven diffusion scores on three case studies: synthetic signals on a yeast interactome, simulated differential gene expression on a protein-protein interaction network and prospective gene set prediction on another interaction network. For clarity, all the datasets were based on binary labels, but we also present theoretical results for quantitative labels. RESULTS: Diffusion scores starting from binary labels were affected by the label codification and exhibited a problem-dependent topological bias that could be removed by the statistical normalization. Parametric and non-parametric normalization addressed both points by being codification-independent and by equalizing the bias. We identified and quantified two sources of bias-mean value and variance-that yielded performance differences when normalizing the scores. We provided closed formulae for both and showed how the null covariance is related to the spectral properties of the graph. Despite none of the proposed scores systematically outperformed the others, normalization was preferred when the sought positive labels were not aligned with the bias. We conclude that the decision on bias removal should be problem and data-driven, i.e. based on a quantitative analysis of the bias and its relation to the positive entities. AVAILABILITY: The code is publicly available at https://github.com/b2slab/diffuBench and the data underlying this article are available at https://github.com/b2slab/retroData. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional , Mapas de Interação de Proteínas , Difusão , Estudos Prospectivos , Proteínas/genética
3.
Bioinformatics ; 37(1): 137-139, 2021 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-33367476

RESUMO

SUMMARY: High-throughput screening yields vast amounts of biological data which can be highly challenging to interpret. In response, knowledge-driven approaches emerged as possible solutions to analyze large datasets by leveraging prior knowledge of biomolecular interactions represented in the form of biological networks. Nonetheless, given their size and complexity, their manual investigation quickly becomes impractical. Thus, computational approaches, such as diffusion algorithms, are often employed to interpret and contextualize the results of high-throughput experiments. Here, we present MultiPaths, a framework consisting of two independent Python packages for network analysis. While the first package, DiffuPy, comprises numerous commonly used diffusion algorithms applicable to any generic network, the second, DiffuPath, enables the application of these algorithms on multi-layer biological networks. To facilitate its usability, the framework includes a command line interface, reproducible examples and documentation. To demonstrate the framework, we conducted several diffusion experiments on three independent multi-omics datasets over disparate networks generated from pathway databases, thus, highlighting the ability of multi-layer networks to integrate multiple modalities. Finally, the results of these experiments demonstrate how the generation of harmonized networks from disparate databases can improve predictive performance with respect to individual resources. AVAILABILITY AND IMPLEMENTATION: DiffuPy and DiffuPath are publicly available under the Apache License 2.0 at https://github.com/multipaths. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

4.
Anal Chem ; 93(31): 10772-10778, 2021 08 10.
Artigo em Inglês | MEDLINE | ID: mdl-34320315

RESUMO

Untargeted metabolomics using liquid chromatography coupled to mass spectrometry (LC-MS) allows the detection of thousands of metabolites in biological samples. However, LC-MS data annotation is still considered a major bottleneck in the metabolomics pipeline since only a small fraction of the metabolites present in the sample can be annotated with the required confidence level. Here, we introduce mWISE (metabolomics wise inference of speck entities), an R package for context-based annotation of LC-MS data. The algorithm consists of three main steps aimed at (i) matching mass-to-charge ratio values to the Kyoto Encyclopedia of Genes and Genomes (KEGG) database, (ii) clustering and filtering the potential KEGG candidates, and (iii) building a final prioritized list using diffusion in graphs. The algorithm performance is evaluated with three publicly available studies using both positive and negative ionization modes. We have also compared mWISE to other available annotation algorithms in terms of their performance and computation time. In particular, we explored four different configurations for mWISE, and all four of them outperform xMSannotator (a state-of-the-art annotator) in terms of both performance and computation time. Using a diffusion configuration that combines the biological network obtained from the FELLA R package and raw scores, mWISE shows a sensitivity mean (standard deviation) across data sets of 0.63 (0.07), while xMSannotator achieves a sensitivity of 0.55 (0.19). We have also shown that the chemical structures of the compounds proposed by mWISE are closer to the original compounds than those proposed by xMSannotator. Finally, we explore the diffusion prioritization separately, showing its key role in the annotation process. mWISE is freely available on GitHub (https://github.com/b2slab/mWISE) under a GPL license.


Assuntos
Algoritmos , Metabolômica , Cromatografia Líquida , Difusão , Espectrometria de Massas , Software
5.
J Chem Inf Model ; 61(4): 1657-1669, 2021 04 26.
Artigo em Inglês | MEDLINE | ID: mdl-33779173

RESUMO

In silico analysis of biological activity data has become an essential technique in pharmaceutical development. Specifically, the so-called proteochemometric models aim to share information between targets in machine learning ligand-target activity prediction models. However, bioactivity data sets used in proteochemometric modeling are usually imbalanced, which could potentially affect the performance of the models. In this work, we explored the effect of different balancing strategies in deep learning proteochemometric target-compound activity classification models while controlling for the compound series bias through clustering. These strategies were (1) no_resampling, (2) resampling_after_clustering, (3) resampling_before_clustering, and (4) semi_resampling. These schemas were evaluated in kinases, GPCRs, nuclear receptors, and proteases from BindingDB. We observed that the predicted proportion of positives was driven by the actual data balance in the test set. Additionally, it was confirmed that data balance had an impact on the performance estimates of the proteochemometric model. We recommend a combination of data augmentation and clustering in the training set (semi_resampling) to mitigate the data imbalance effect in a realistic scenario. The code of this analysis is publicly available at https://github.com/b2slab/imbalance_pcm_benchmark.


Assuntos
Aprendizado Profundo , Simulação por Computador , Ligantes , Aprendizado de Máquina
6.
PLoS Comput Biol ; 15(9): e1007276, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31479437

RESUMO

In-silico identification of potential target genes for disease is an essential aspect of drug target discovery. Recent studies suggest that successful targets can be found through by leveraging genetic, genomic and protein interaction information. Here, we systematically tested the ability of 12 varied algorithms, based on network propagation, to identify genes that have been targeted by any drug, on gene-disease data from 22 common non-cancerous diseases in OpenTargets. We considered two biological networks, six performance metrics and compared two types of input gene-disease association scores. The impact of the design factors in performance was quantified through additive explanatory models. Standard cross-validation led to over-optimistic performance estimates due to the presence of protein complexes. In order to obtain realistic estimates, we introduced two novel protein complex-aware cross-validation schemes. When seeding biological networks with known drug targets, machine learning and diffusion-based methods found around 2-4 true targets within the top 20 suggestions. Seeding the networks with genes associated to disease by genetics decreased performance below 1 true hit on average. The use of a larger network, although noisier, improved overall performance. We conclude that diffusion-based prioritisers and machine learning applied to diffusion-based features are suited for drug discovery in practice and improve over simpler neighbour-voting methods. We also demonstrate the large impact of choosing an adequate validation strategy and the definition of seed disease genes.


Assuntos
Biologia Computacional/métodos , Simulação por Computador , Descoberta de Drogas/métodos , Algoritmos , Benchmarking , Bases de Dados Genéticas , Doença/genética , Humanos , Aprendizado de Máquina
7.
Bioinformatics ; 34(3): 533-534, 2018 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-29029016

RESUMO

Summary: Label propagation and diffusion over biological networks are a common mathematical formalism in computational biology for giving context to molecular entities and prioritizing novel candidates in the area of study. There are several choices in conceiving the diffusion process-involving the graph kernel, the score definitions and the presence of a posterior statistical normalization-which have an impact on the results. This manuscript describes diffuStats, an R package that provides a collection of graph kernels and diffusion scores, as well as a parallel permutation analysis for the normalized scores, that eases the computation of the scores and their benchmarking for an optimal choice. Availability and implementation: The R package diffuStats is publicly available in Bioconductor, https://bioconductor.org, under the GPL-3 license. Contact: sergi.picart@upc.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Software , Redes e Vias Metabólicas , Mapas de Interação de Proteínas , Leveduras/metabolismo
8.
BMC Bioinformatics ; 19(1): 538, 2018 Dec 22.
Artigo em Inglês | MEDLINE | ID: mdl-30577788

RESUMO

BACKGROUND: Pathway enrichment techniques are useful for understanding experimental metabolomics data. Their purpose is to give context to the affected metabolites in terms of the prior knowledge contained in metabolic pathways. However, the interpretation of a prioritized pathway list is still challenging, as pathways show overlap and cross talk effects. RESULTS: We introduce FELLA, an R package to perform a network-based enrichment of a list of affected metabolites. FELLA builds a hierarchical representation of an organism biochemistry from the Kyoto Encyclopedia of Genes and Genomes (KEGG), containing pathways, modules, enzymes, reactions and metabolites. In addition to providing a list of pathways, FELLA reports intermediate entities (modules, enzymes, reactions) that link the input metabolites to them. This sheds light on pathway cross talk and potential enzymes or metabolites as targets for the condition under study. FELLA has been applied to six public datasets -three from Homo sapiens, two from Danio rerio and one from Mus musculus- and has reproduced findings from the original studies and from independent literature. CONCLUSIONS: The R package FELLA offers an innovative enrichment concept starting from a list of metabolites, based on a knowledge graph representation of the KEGG database that focuses on interpretability. Besides reporting a list of pathways, FELLA suggests intermediate entities that are of interest per se. Its usefulness has been shown at several molecular levels on six public datasets, including human and animal models. The user can run the enrichment analysis through a simple interactive graphical interface or programmatically. FELLA is publicly available in Bioconductor under the GPL-3 license.


Assuntos
Biologia Computacional/métodos , Redes e Vias Metabólicas , Metabolômica/métodos , Software , Animais , Gráficos por Computador , Conjuntos de Dados como Assunto , Feminino , Humanos , Malária/metabolismo , Malária/patologia , Camundongos , Modelos Biológicos , Hepatopatia Gordurosa não Alcoólica/metabolismo , Hepatopatia Gordurosa não Alcoólica/patologia , Neoplasias Ovarianas/metabolismo , Neoplasias Ovarianas/patologia , Peixe-Zebra
9.
J Pharm Sci ; 113(4): 880-890, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-37924976

RESUMO

Sub-visible particles can be a quality concern in pharmaceutical products, especially parenteral preparations. To quantify and characterize these particles, liquid samples may be passed through a flow-imaging microscopy instrument that also generates images of each detected particle. Machine learning techniques have increasingly been applied to this kind of data to detect changes in experimental conditions or classify specific types of particles, primarily focusing on silicone oil. That technique generally requires manual labeling of particle images by subject matter experts, a time-consuming and complex task. In this study, we created artificial datasets of silicone oil, protein particles, and glass particles that mimicked complex datasets of particles found in biopharmaceutical products. We used unsupervised learning techniques to effectively describe particle composition by sample. We then trained independent one-class classifiers to detect specific particle populations: silicone oil and glass particles. We also studied the consistency of the particle labels used to evaluate these models. Our results show that one-class classifiers are a reasonable choice for handling heterogeneous flow-imaging microscopy data and that unsupervised learning can aid in the labeling process. However, we found agreement among experts to be rather low, especially for smaller particles (< 8 µm for our Micro-Flow Imaging data). Given the fact that particle label confidence is not usually reported in the literature, we recommend more careful assessment of this topic in the future.


Assuntos
Microscopia , Óleos de Silicone , Microscopia/métodos , Óleos de Silicone/análise , Aprendizado de Máquina , Vidro , Proteínas , Tamanho da Partícula
10.
Front Genet ; 13: 814093, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35360842

RESUMO

Indication expansion aims to find new indications for existing targets in order to accelerate the process of launching a new drug for a disease on the market. The rapid increase in data types and data sources for computational drug discovery has fostered the use of semantic knowledge graphs (KGs) for indication expansion through target centric approaches, or in other words, target repositioning. Previously, we developed a novel method to construct a KG for indication expansion studies, with the aim of finding and justifying alternative indications for a target gene of interest. In contrast to other KGs, ours combines human-curated full-text literature and gene expression data from biomedical databases to encode relationships between genes, diseases, and tissues. Here, we assessed the suitability of our KG for explainable target-disease link prediction using a glass-box approach. To evaluate the predictive power of our KG, we applied shortest path with tissue information- and embedding-based prediction methods to a graph constructed with information published before or during 2010. We also obtained random baselines by applying the shortest path predictive methods to KGs with randomly shuffled node labels. Then, we evaluated the accuracy of the top predictions using gene-disease links reported after 2010. In addition, we investigated the contribution of the KG's tissue expression entity to the prediction performance. Our experiments showed that shortest path-based methods significantly outperform the random baselines and embedding-based methods outperform the shortest path predictions. Importantly, removing the tissue expression entity from the KG severely impacts the quality of the predictions, especially those produced by the embedding approaches. Finally, since the interpretability of the predictions is crucial in indication expansion, we highlight the advantages of our glass-box model through the examination of example candidate target-disease predictions.

11.
Dis Model Mech ; 15(1)2022 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-34845494

RESUMO

Alterations in metabolic pathways were recently recognized as potential underlying drivers of idiopathic pulmonary fibrosis (IPF), translating into novel therapeutic targets. However, knowledge of metabolic and lipid regulation in fibrotic lungs is limited. To comprehensively characterize metabolic perturbations in the bleomycin mouse model of IPF, we analyzed the metabolome and lipidome by mass spectrometry. We identified increased tissue turnover and repair, evident by enhanced breakdown of proteins, nucleic acids and lipids and extracellular matrix turnover. Energy production was upregulated, including glycolysis, the tricarboxylic acid cycle, glutaminolysis, lactate production and fatty acid oxidation. Higher eicosanoid synthesis indicated inflammatory processes. Because the risk of IPF increases with age, we investigated how age influences metabolomic and lipidomic changes in the bleomycin-induced pulmonary fibrosis model. Surprisingly, except for cytidine, we did not detect any significantly differential metabolites or lipids between old and young bleomycin-treated lungs. Together, we identified metabolomic and lipidomic changes in fibrosis that reflect higher energy demand, proliferation, tissue remodeling, collagen deposition and inflammation, which might serve to improve diagnostic and therapeutic options for fibrotic lung diseases in the future.


Assuntos
Bleomicina , Fibrose Pulmonar Idiopática , Animais , Bleomicina/efeitos adversos , Bleomicina/metabolismo , Fibrose , Lipidômica , Pulmão/patologia , Camundongos , Camundongos Endogâmicos C57BL
12.
Environ Toxicol Chem ; 38(5): 965-977, 2019 05.
Artigo em Inglês | MEDLINE | ID: mdl-30702171

RESUMO

The antidepressant amitriptyline is a widely used selective serotonin reuptake inhibitor that is found in the aquatic environment. The present study investigates alterations in the brain and the liver metabolome of gilt-head bream (Sparus aurata) after exposure at an environmentally relevant concentration (0.2 µg/L) of amitriptyline for 7 d. Analysis of variance-simultaneous component analysis is used to identify metabolites that distinguish exposed from control animals. Overall, alterations in lipid metabolism suggest the occurrence of oxidative stress in both the brain and the liver-a common adverse effect of xenobiotics. However, alterations in the amino acid arginine are also observed. These are likely related to the nitric oxide system that is known to be associated with the mechanism of action of antidepressants. In addition, changes in asparagine and methionine levels in the brain and pantothenate, uric acid, and formylisoglutamine/N-formimino-L-glutamate levels in the liver could indicate variation of amino acid metabolism in both tissues; and the perturbation of glutamate in the liver implies that the energy metabolism is also affected. These results reveal that environmentally relevant concentrations of amitriptyline perturb a fraction of the metabolome that is not typically associated with antidepressant exposure in fish. Environ Toxicol Chem 2019;00:1-13. © 2019 SETAC.


Assuntos
Amitriptilina/toxicidade , Monoaminas Biogênicas/metabolismo , Monitoramento Ambiental , Metaboloma , Dourada/metabolismo , Animais , Carnitina/metabolismo , Feminino , Fígado/efeitos dos fármacos , Fígado/metabolismo , Metaboloma/efeitos dos fármacos , Estresse Oxidativo/efeitos dos fármacos , Análise de Componente Principal , Poluentes Químicos da Água/toxicidade
13.
Chemosphere ; 211: 624-631, 2018 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-30098557

RESUMO

The extensive use of the organic UV filter oxybenzone has led to its ubiquitous occurrence in the aquatic environment, causing an ecotoxicological risk to biota. Although some studies reported adverse effects, such as reproductive toxicity, further research needs to be done in order to assess its molecular effects and mechanism of action. Therefore, in the present work, we investigated metabolic perturbations in juvenile gilt-head bream (Sparus aurata) exposed over 14 days via the water to oxybenzone (50 mg/L). The non-targeted analysis of brain, liver and plasma extracts was performed by means of UHPLC-qOrbitrap MS in positive and negative modes with both C18 and HILIC separation. Although there was no mortality or alterations in general physiological parameters during the experiment, and the metabolic profile of brain was not affected, the results of this study showed that oxybenzone could perturb both liver and plasma metabolome. The pathway enrichment suggested that different pathways in lipid metabolism (fatty acid elongation, α-linolenic acid metabolism, biosynthesis of unsaturated fatty acids and fatty acid metabolism) were significantly altered, as well as metabolites involved in phenylalanine and tyrosine metabolism. Overall, these changes are signs of possible oxidative stress and energy metabolism modification. Therefore, this research indicates that oxybenzone has adverse effects beyond the commonly studied hormonal activity, and demonstrates the sensitivity of metabolomics to assess molecular-level effects of emerging contaminants.


Assuntos
Benzofenonas/química , Fígado/metabolismo , Metabolômica/métodos , Animais , Metabolismo Energético , Feminino , Peixes , Metaboloma/efeitos dos fármacos
14.
PLoS One ; 12(12): e0189012, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-29211807

RESUMO

Metabolomics experiments identify metabolites whose abundance varies as the conditions under study change. Pathway enrichment tools help in the identification of key metabolic processes and in building a plausible biological explanation for these variations. Although several methods are available for pathway enrichment using experimental evidence, metabolomics does not yet have a comprehensive overview in a network layout at multiple molecular levels. We propose a novel pathway enrichment procedure for analysing summary metabolomics data based on sub-network analysis in a graph representation of a reference database. Relevant entries are extracted from the database according to statistical measures over a null diffusive process that accounts for network topology and pathway crosstalk. Entries are reported as a sub-pathway network, including not only pathways, but also modules, enzymes, reactions and possibly other compound candidates for further analyses. This provides a richer biological context, suitable for generating new study hypotheses and potential enzymatic targets. Using this method, we report results from cells depleted for an uncharacterised mitochondrial gene using GC and LC-MS data and employing KEGG as a knowledge base. Partial validation is provided with NMR-based tracking of 13C glucose labelling of these cells.


Assuntos
Metabolômica , Modelos Teóricos , Algoritmos , Espectroscopia de Ressonância Magnética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA