Búsqueda | Portal Regional de la BVS

1.

Detecting outliers in case-control cohorts for improving deep learning networks on Schizophrenia prediction.

Martins, Daniel; Abbasi, Maryam; Egas, Conceição; Arrais, Joel P.

J Integr Bioinform ; 2024 Jul 15.

Artículo en Inglés | MEDLINE | ID: mdl-39004922

RESUMEN

This study delves into the intricate genetic and clinical aspects of Schizophrenia, a complex mental disorder with uncertain etiology. Deep Learning (DL) holds promise for analyzing large genomic datasets to uncover new risk factors. However, based on reports of non-negligible misdiagnosis rates for SCZ, case-control cohorts may contain outlying genetic profiles, hindering compelling performances of classification models. The research employed a case-control dataset sourced from the Swedish populace. A gene-annotation-based DL architecture was developed and employed in two stages. First, the model was trained on the entire dataset to highlight differences between cases and controls. Then, samples likely to be misclassified were excluded, and the model was retrained on the refined dataset for performance evaluation. The results indicate that SCZ prevalence and misdiagnosis rates can affect case-control cohorts, potentially compromising future studies reliant on such datasets. However, by detecting and filtering outliers, the study demonstrates the feasibility of adapting DL methodologies to large-scale biological problems, producing results more aligned with existing heritability estimates for SCZ. This approach not only advances the comprehension of the genetic background of SCZ but also opens doors for adapting DL techniques in complex research for precision medicine in mental health.

2.

Ensemble-imbalance-based classification for amyotrophic lateral sclerosis prognostic prediction: identifying short-survival patients at diagnosis.

Papaiz, Fabiano; Dourado, Mario Emílio Teixeira; de Medeiros Valentim, Ricardo Alexsandro; Pinto, Rafael; de Morais, Antônio Higor Freire; Arrais, Joel Perdiz.

BMC Med Inform Decis Mak ; 24(1): 80, 2024 Mar 19.

Artículo en Inglés | MEDLINE | ID: mdl-38504285

RESUMEN

Prognosticating Amyotrophic Lateral Sclerosis (ALS) presents a formidable challenge due to patients exhibiting different onset sites, progression rates, and survival times. In this study, we have developed and evaluated Machine Learning (ML) algorithms that integrate Ensemble and Imbalance Learning techniques to classify patients into Short and Non-Short survival groups based on data collected during diagnosis. We aimed to identify individuals at high risk of mortality within 24 months of symptom onset through analysis of patient data commonly encountered in daily clinical practice. Our Ensemble-Imbalance approach underwent evaluation employing six ML algorithms as base classifiers. Remarkably, our results outperformed those of individual algorithms, achieving a Balanced Accuracy of 88% and a Sensitivity of 96%. Additionally, we used the Shapley Additive Explanations framework to elucidate the decision-making process of the top-performing model, pinpointing the most important features and their correlations with the target prediction. Furthermore, we presented helpful tools to visualize and compare patient similarities, offering valuable insights. Confirming the obtained results, our approach could aid physicians in devising personalized treatment plans at the time of diagnosis or serve as an inclusion/exclusion criterion in clinical trials.

Asunto(s)

Esclerosis Amiotrófica Lateral , Humanos , Esclerosis Amiotrófica Lateral/diagnóstico , Esclerosis Amiotrófica Lateral/tratamiento farmacológico , Pronóstico , Aprendizaje Automático

3.

Predicting drug activity against cancer through genomic profiles and SMILES.

Abbasi, Maryam; Carvalho, Filipa G; Ribeiro, Bernardete; Arrais, Joel P.

Artif Intell Med ; 150: 102820, 2024 Apr.

Artículo en Inglés | MEDLINE | ID: mdl-38553160

RESUMEN

Due to the constant increase in cancer rates, the disease has become a leading cause of death worldwide, enhancing the need for its detection and treatment. In the era of personalized medicine, the main goal is to incorporate individual variability in order to choose more precisely which therapy and prevention strategies suit each person. However, predicting the sensitivity of tumors to anticancer treatments remains a challenge. In this work, we propose two deep neural network models to predict the impact of anticancer drugs in tumors through the half-maximal inhibitory concentration (IC50). These models join biological and chemical data to apprehend relevant features of the genetic profile and the drug compounds, respectively. In order to predict the drug response in cancer cell lines, this study employed different DL methods, resorting to Recurrent Neural Networks (RNNs) and Convolutional Neural Networks (CNNs). In the first stage, two autoencoders were pre-trained with high-dimensional gene expression and mutation data of tumors. Afterward, this genetic background is transferred to the prediction models that return the IC50 value that portrays the potency of a substance in inhibiting a cancer cell line. When comparing RSEM Expected counts and TPM as methods for displaying gene expression data, RSEM has been shown to perform better in deep models and CNNs model can obtain better insight in these types of data. Moreover, the obtained results reflect the effectiveness of the extracted deep representations in the prediction of the IC50 value that portrays the potency of a substance in inhibiting a tumor, achieving a performance of a mean squared error of 1.06 and surpassing previous state-of-the-art models.

Asunto(s)

Perfil Genético , Neoplasias , Humanos , Redes Neurales de la Computación , Neoplasias/tratamiento farmacológico , Neoplasias/genética , Línea Celular , Genómica

4.

Enhancing reinforcement learning for de novo molecular design applying self-attention mechanisms.

Pereira, Tiago O; Abbasi, Maryam; Arrais, Joel P.

Brief Bioinform ; 24(6)2023 09 22.

Artículo en Inglés | MEDLINE | ID: mdl-37903414

RESUMEN

The drug discovery process can be significantly improved by applying deep reinforcement learning (RL) methods that learn to generate compounds with desired pharmacological properties. Nevertheless, RL-based methods typically condense the evaluation of sampled compounds into a single scalar value, making it difficult for the generative agent to learn the optimal policy. This work combines self-attention mechanisms and RL to generate promising molecules. The idea is to evaluate the relative significance of each atom and functional group in their interaction with the target, and to utilize this information for optimizing the Generator. Therefore, the framework for de novo drug design is composed of a Generator that samples new compounds combined with a Transformer-encoder and a biological affinity Predictor that evaluate the generated structures. Moreover, it takes the advantage of the knowledge encapsulated in the Transformer's attention weights to evaluate each token individually. We compared the performance of two output prediction strategies for the Transformer: standard and masked language model (MLM). The results show that the MLM Transformer is more effective in optimizing the Generator compared with the state-of-the-art works. Additionally, the evaluation models identified the most important regions of each molecule for the biological interaction with the target. As a case study, we generated synthesizable hit compounds that can be putative inhibitors of the enzyme ubiquitin-specific protein 7 (USP7).

Asunto(s)

Diseño de Fármacos , Aprendizaje , Descubrimiento de Drogas

5.

Artificial intelligence for prediction of biological activities and generation of molecular hits using stereochemical information.

Pereira, Tiago O; Abbasi, Maryam; Oliveira, Rita I; Guedes, Romina A; Salvador, Jorge A R; Arrais, Joel P.

J Comput Aided Mol Des ; 37(12): 791-806, 2023 12.

Artículo en Inglés | MEDLINE | ID: mdl-37847342

RESUMEN

In this work, we develop a method for generating targeted hit compounds by applying deep reinforcement learning and attention mechanisms to predict binding affinity against a biological target while considering stereochemical information. The novelty of this work is a deep model Predictor that can establish the relationship between chemical structures and their corresponding [Formula: see text] values. We thoroughly study the effect of different molecular descriptors such as ECFP4, ECFP6, SMILES and RDKFingerprint. Also, we demonstrated the importance of attention mechanisms to capture long-range dependencies in molecular sequences. Due to the importance of stereochemical information for the binding mechanism, this information was employed both in the prediction and generation processes. To identify the most promising hits, we apply the self-adaptive multi-objective optimization strategy. Moreover, to ensure the existence of stereochemical information, we consider all the possible enumerated stereoisomers to provide the most appropriate 3D structures. We evaluated this approach against the Ubiquitin-Specific Protease 7 (USP7) by generating putative inhibitors for this target. The predictor with SMILES notations as descriptor plus bidirectional recurrent neural network using attention mechanism has the best performance. Additionally, our methodology identify the regions of the generated molecules that are important for the interaction with the receptor's active site. Also, the obtained results demonstrate that it is possible to discover synthesizable molecules with high biological affinity for the target, containing the indication of their optimal stereochemical conformation.

Asunto(s)

Inteligencia Artificial , Diseño de Fármacos , Redes Neurales de la Computación , Estructura Molecular

6.

FSM-DDTR: End-to-end feedback strategy for multi-objective De Novo drug design using transformers.

Monteiro, Nelson R C; Pereira, Tiago O; Machado, Ana Catarina D; Oliveira, José L; Abbasi, Maryam; Arrais, Joel P.

Comput Biol Med ; 164: 107285, 2023 09.

Artículo en Inglés | MEDLINE | ID: mdl-37557054

RESUMEN

The design of compounds that target specific biological functions with relevant selectivity is critical in the context of drug discovery, especially due to the polypharmacological nature of most existing drug molecules. In recent years, in silico-based methods combined with deep learning have shown promising results in the de novo drug design challenge, leading to potential leads for biologically interesting targets. However, several of these methods overlook the importance of certain properties, such as validity rate and target selectivity, or simplify the generative process by neglecting the multi-objective nature of the pharmacological space. In this study, we propose a multi-objective Transformer-based architecture to generate drug candidates with desired molecular properties and increased selectivity toward a specific biological target. The framework consists of a Transformer-Decoder Generator that generates novel and valid compounds in the SMILES format notation, a Transformer-Encoder Predictor that estimates the binding affinity toward the biological target, and a feedback loop combined with a multi-objective optimization strategy to rank the generated molecules and condition the generating distribution around the targeted properties. The results demonstrate that the proposed architecture can generate novel and synthesizable small compounds with desired pharmacological properties toward a biologically relevant target. The unbiased Transformer-based Generator achieved superior performance in the novelty rate (97.38%) and comparable performance in terms of internal diversity, uniqueness, and validity against state-of-the-art baselines. The optimization of the unbiased Transformer-based Generator resulted in the generation of molecules exhibiting high binding affinity toward the Adenosine A2A Receptor (AA2AR) and possessing desirable physicochemical properties, where 99.36% of the generated molecules follow Lipinski's rule of five. Furthermore, the implementation of a feedback strategy, in conjunction with a multi-objective algorithm, effectively shifted the distribution of the generated molecules toward optimal values of molecular weight, molecular lipophilicity, topological polar surface area, synthetic accessibility score, and quantitative estimate of drug-likeness, without the necessity of prior training sets comprising molecules endowed with pharmacological properties of interest. Overall, this research study validates the applicability of a Transformer-based architecture in the context of drug design, capable of exploring the vast chemical representation space to generate novel molecules with improved pharmacological properties and target selectivity. The data and source code used in this study are available at: https://github.com/larngroup/FSM-DDTR.

Asunto(s)

Diseño de Fármacos , Descubrimiento de Drogas , Retroalimentación , Algoritmos , Programas Informáticos

7.

Correction to: Designing optimized drug candidates with Generative Adversarial Network.

Abbasi, Maryam; Santos, Beatriz P; Pereira, Tiago C; Sofa, Raul; Monteiro, Nelson R C; Simões, Carlos J V; Brito, Rui M M; Ribeiro, Bernardete; Oliveira, José L; Arrais, Joel P.

J Cheminform ; 14(1): 53, 2022 Aug 11.

Artículo en Inglés | MEDLINE | ID: mdl-35953869

8.

Deep generative model for therapeutic targets using transcriptomic disease-associated data-USP7 case study.

Pereira, Tiago; Abbasi, Maryam; Oliveira, Rita I; Guedes, Romina A; Salvador, Jorge A R; Arrais, Joel P.

Brief Bioinform ; 23(4)2022 07 18.

Artículo en Inglés | MEDLINE | ID: mdl-35789255

RESUMEN

The generation of candidate hit molecules with the potential to be used in cancer treatment is a challenging task. In this context, computational methods based on deep learning have been employed to improve in silico drug design methodologies. Nonetheless, the applied strategies have focused solely on the chemical aspect of the generation of compounds, disregarding the likely biological consequences for the organism's dynamics. Herein, we propose a method to implement targeted molecular generation that employs biological information, namely, disease-associated gene expression data, to conduct the process of identifying interesting hits. When applied to the generation of USP7 putative inhibitors, the framework managed to generate promising compounds, with more than 90% of them containing drug-like properties and essential active groups for the interaction with the target. Hence, this work provides a novel and reliable method for generating new promising compounds focused on the biological context of the disease.

Asunto(s)

Diseño de Fármacos , Transcriptoma , Peptidasa Específica de Ubiquitina 7

9.

DTITR: End-to-end drug-target binding affinity prediction with transformers.

Monteiro, Nelson R C; Oliveira, José L; Arrais, Joel P.

Comput Biol Med ; 147: 105772, 2022 08.

Artículo en Inglés | MEDLINE | ID: mdl-35777085

RESUMEN

The accurate identification of Drug-Target Interactions (DTIs) remains a critical turning point in drug discovery and understanding of the binding process. Despite recent advances in computational solutions to overcome the challenges of in vitro and in vivo experiments, most of the proposed in silico-based methods still focus on binary classification, overlooking the importance of characterizing DTIs with unbiased binding strength values to properly distinguish primary interactions from those with off-targets. Moreover, several of these methods usually simplify the entire interaction mechanism, neglecting the joint contribution of the individual units of each binding component and the interacting substructures involved, and have yet to focus on more explainable and interpretable architectures. In this study, we propose an end-to-end Transformer-based architecture for predicting drug-target binding affinity (DTA) using 1D raw sequential and structural data to represent the proteins and compounds. This architecture exploits self-attention layers to capture the biological and chemical context of the proteins and compounds, respectively, and cross-attention layers to exchange information and capture the pharmacological context of the DTIs. The results show that the proposed architecture is effective in predicting DTA, achieving superior performance in both correctly predicting the value of interaction strength and being able to correctly discriminate the rank order of binding strength compared to state-of-the-art baselines. The combination of multiple Transformer-Encoders was found to result in robust and discriminative aggregate representations of the proteins and compounds for binding affinity prediction, in which the addition of a Cross-Attention Transformer-Encoder was identified as an important block for improving the discriminative power of these representations. Overall, this research study validates the applicability of an end-to-end Transformer-based architecture in the context of drug discovery, capable of self-providing different levels of potential DTI and prediction understanding due to the nature of the attention blocks. The data and source code used in this study are available at: https://github.com/larngroup/DTITR.

Asunto(s)

Proteínas , Programas Informáticos , Desarrollo de Medicamentos , Descubrimiento de Drogas/métodos , Proteínas/química

10.

Designing optimized drug candidates with Generative Adversarial Network.

Abbasi, Maryam; Santos, Beatriz P; Pereira, Tiago C; Sofia, Raul; Monteiro, Nelson R C; Simões, Carlos J V; Brito, Rui M M; Ribeiro, Bernardete; Oliveira, José L; Arrais, Joel P.

J Cheminform ; 14(1): 40, 2022 Jun 26.

Artículo en Inglés | MEDLINE | ID: mdl-35754029

RESUMEN

Drug design is an important area of study for pharmaceutical businesses. However, low efficacy, off-target delivery, time consumption, and high cost are challenges and can create barriers that impact this process. Deep Learning models are emerging as a promising solution to perform de novo drug design, i.e., to generate drug-like molecules tailored to specific needs. However, stereochemistry was not explicitly considered in the generated molecules, which is inevitable in targeted-oriented molecules. This paper proposes a framework based on Feedback Generative Adversarial Network (GAN) that includes optimization strategy by incorporating Encoder-Decoder, GAN, and Predictor deep models interconnected with a feedback loop. The Encoder-Decoder converts the string notations of molecules into latent space vectors, effectively creating a new type of molecular representation. At the same time, the GAN can learn and replicate the training data distribution and, therefore, generate new compounds. The feedback loop is designed to incorporate and evaluate the generated molecules according to the multiobjective desired property at every epoch of training to ensure a steady shift of the generated distribution towards the space of the targeted properties. Moreover, to develop a more precise set of molecules, we also incorporate a multiobjective optimization selection technique based on a non-dominated sorting genetic algorithm. The results demonstrate that the proposed framework can generate realistic, novel molecules that span the chemical space. The proposed Encoder-Decoder model correctly reconstructs 99% of the datasets, including stereochemical information. The model's ability to find uncharted regions of the chemical space was successfully shown by optimizing the unbiased GAN to generate molecules with a high binding affinity to the Kappa Opioid and Adenosine [Formula: see text] receptor. Furthermore, the generated compounds exhibit high internal and external diversity levels 0.88 and 0.94, respectively, and uniqueness.

11.

Explainable deep drug-target representations for binding affinity prediction.

Monteiro, Nelson R C; Simões, Carlos J V; Ávila, Henrique V; Abbasi, Maryam; Oliveira, José L; Arrais, Joel P.

BMC Bioinformatics ; 23(1): 237, 2022 Jun 17.

Artículo en Inglés | MEDLINE | ID: mdl-35715734

RESUMEN

BACKGROUND: Several computational advances have been achieved in the drug discovery field, promoting the identification of novel drug-target interactions and new leads. However, most of these methodologies have been overlooking the importance of providing explanations to the decision-making process of deep learning architectures. In this research study, we explore the reliability of convolutional neural networks (CNNs) at identifying relevant regions for binding, specifically binding sites and motifs, and the significance of the deep representations extracted by providing explanations to the model's decisions based on the identification of the input regions that contributed the most to the prediction. We make use of an end-to-end deep learning architecture to predict binding affinity, where CNNs are exploited in their capacity to automatically identify and extract discriminating deep representations from 1D sequential and structural data. RESULTS: The results demonstrate the effectiveness of the deep representations extracted from CNNs in the prediction of drug-target interactions. CNNs were found to identify and extract features from regions relevant for the interaction, where the weight associated with these spots was in the range of those with the highest positive influence given by the CNNs in the prediction. The end-to-end deep learning model achieved the highest performance both in the prediction of the binding affinity and on the ability to correctly distinguish the interaction strength rank order when compared to baseline approaches. CONCLUSIONS: This research study validates the potential applicability of an end-to-end deep learning architecture in the context of drug discovery beyond the confined space of proteins and ligands with determined 3D structure. Furthermore, it shows the reliability of the deep representations extracted from the CNNs by providing explainability to the decision-making process.

Asunto(s)

Redes Neurales de la Computación , Proteínas , Sitios de Unión , Extractos Vegetales , Proteínas/química , Reproducibilidad de los Resultados

12.

The Road to Personalized Medicine in Alzheimer's Disease: The Use of Artificial Intelligence.

Silva-Spínola, Anuschka; Baldeiras, Inês; Arrais, Joel P; Santana, Isabel.

Biomedicines ; 10(2)2022 Jan 29.

Artículo en Inglés | MEDLINE | ID: mdl-35203524

RESUMEN

Dementia remains an extremely prevalent syndrome among older people and represents a major cause of disability and dependency. Alzheimer's disease (AD) accounts for the majority of dementia cases and stands as the most common neurodegenerative disease. Since age is the major risk factor for AD, the increase in lifespan not only represents a rise in the prevalence but also adds complexity to the diagnosis. Moreover, the lack of disease-modifying therapies highlights another constraint. A shift from a curative to a preventive approach is imminent and we are moving towards the application of personalized medicine where we can shape the best clinical intervention for an individual patient at a given point. This new step in medicine requires the most recent tools and analysis of enormous amounts of data where the application of artificial intelligence (AI) plays a critical role on the depiction of disease-patient dynamics, crucial in reaching early/optimal diagnosis, monitoring and intervention. Predictive models and algorithms are the key elements in this innovative field. In this review, we present an overview of relevant topics regarding the application of AI in AD, detailing the algorithms and their applications in the fields of drug discovery, and biomarkers.

13.

Optimizing blood-brain barrier permeation through deep reinforcement learning for de novo drug design.

Pereira, Tiago; Abbasi, Maryam; Oliveira, José Luis; Ribeiro, Bernardete; Arrais, Joel.

Bioinformatics ; 37(Suppl_1): i84-i92, 2021 07 12.

Artículo en Inglés | MEDLINE | ID: mdl-34252946

RESUMEN

MOTIVATION: The process of placing new drugs into the market is time-consuming, expensive and complex. The application of computational methods for designing molecules with bespoke properties can contribute to saving resources throughout this process. However, the fundamental properties to be optimized are often not considered or conflicting with each other. In this work, we propose a novel approach to consider both the biological property and the bioavailability of compounds through a deep reinforcement learning framework for the targeted generation of compounds. We aim to obtain a promising set of selective compounds for the adenosine A2A receptor and, simultaneously, that have the necessary properties in terms of solubility and permeability across the blood-brain barrier to reach the site of action. The cornerstone of the framework is based on a recurrent neural network architecture, the Generator. It seeks to learn the building rules of valid molecules to sample new compounds further. Also, two Predictors are trained to estimate the properties of interest of the new molecules. Finally, the fine-tuning of the Generator was performed with reinforcement learning, integrated with multi-objective optimization and exploratory techniques to ensure that the Generator is adequately biased. RESULTS: The biased Generator can generate an interesting set of molecules, with approximately 85% having the two fundamental properties biased as desired. Thus, this approach has transformed a general molecule generator into a model focused on optimizing specific objectives. Furthermore, the molecules' synthesizability and drug-likeness demonstrate the potential applicability of the de novo drug design in medicinal chemistry. AVAILABILITY AND IMPLEMENTATION: All code is publicly available in the https://github.com/larngroup/De-Novo-Drug-Design. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Asunto(s)

Barrera Hematoencefálica , Diseño de Fármacos , Transporte Biológico , Redes Neurales de la Computación

14.

Diversity oriented Deep Reinforcement Learning for targeted molecule generation.

Pereira, Tiago; Abbasi, Maryam; Ribeiro, Bernardete; Arrais, Joel P.

J Cheminform ; 13(1): 21, 2021 Mar 09.

Artículo en Inglés | MEDLINE | ID: mdl-33750461

RESUMEN

In this work, we explore the potential of deep learning to streamline the process of identifying new potential drugs through the computational generation of molecules with interesting biological properties. Two deep neural networks compose our targeted generation framework: the Generator, which is trained to learn the building rules of valid molecules employing SMILES strings notation, and the Predictor which evaluates the newly generated compounds by predicting their affinity for the desired target. Then, the Generator is optimized through Reinforcement Learning to produce molecules with bespoken properties. The innovation of this approach is the exploratory strategy applied during the reinforcement training process that seeks to add novelty to the generated compounds. This training strategy employs two Generators interchangeably to sample new SMILES: the initially trained model that will remain fixed and a copy of the previous one that will be updated during the training to uncover the most promising molecules. The evolution of the reward assigned by the Predictor determines how often each one is employed to select the next token of the molecule. This strategy establishes a compromise between the need to acquire more information about the chemical space and the need to sample new molecules, with the experience gained so far. To demonstrate the effectiveness of the method, the Generator is trained to design molecules with an optimized coefficient of partition and also high inhibitory power against the Adenosine [Formula: see text] and [Formula: see text] opioid receptors. The results reveal that the model can effectively adjust the newly generated molecules towards the wanted direction. More importantly, it was possible to find promising sets of unique and diverse molecules, which was the main purpose of the newly implemented strategy.

15.

Drug-Target Interaction Prediction: End-to-End Deep Learning Approach.

Monteiro, Nelson R C; Ribeiro, Bernardete; Arrais, Joel P.

IEEE/ACM Trans Comput Biol Bioinform ; 18(6): 2364-2374, 2021.

Artículo en Inglés | MEDLINE | ID: mdl-32142454

RESUMEN

The discovery of potential Drug-Target Interactions (DTIs) is a determining step in the drug discovery and repositioning process, as the effectiveness of the currently available antibiotic treatment is declining. Although putting efforts on the traditional in vivo or in vitro methods, pharmaceutical financial investment has been reduced over the years. Therefore, establishing effective computational methods is decisive to find new leads in a reasonable amount of time. Successful approaches have been presented to solve this problem but seldom protein sequences and structured data are used together. In this paper, we present a deep learning architecture model, which exploits the particular ability of Convolutional Neural Networks (CNNs) to obtain 1D representations from protein sequences (amino acid sequence) and compounds SMILES (Simplified Molecular Input Line Entry System) strings. These representations can be interpreted as features that express local dependencies or patterns that can then be used in a Fully Connected Neural Network (FCNN), acting as a binary classifier. The results achieved demonstrate that using CNNs to obtain representations of the data, instead of the traditional descriptors, lead to improved performance. The proposed end-to-end deep learning method outperformed traditional machine learning approaches in the correct classification of both positive and negative interactions.

Asunto(s)

Biología Computacional/métodos , Aprendizaje Profundo , Descubrimiento de Drogas/métodos , Reposicionamiento de Medicamentos/métodos , Algoritmos , Secuencia de Aminoácidos , Humanos , Aprendizaje Automático , Redes Neurales de la Computación , Preparaciones Farmacéuticas/química , Preparaciones Farmacéuticas/metabolismo , Proteínas/química , Proteínas/metabolismo

16.

CroP-Coordinated Panel visualization for biological networks analysis.

Cruz, António; Machado, Penousal; Arrais, Joel P.

Bioinformatics ; 36(4): 1298-1299, 2020 02 15.

Artículo en Inglés | MEDLINE | ID: mdl-31504214

RESUMEN

SUMMARY: CroP is a data visualization application that focuses on the analysis of relational data that changes over time. While it was specifically designed for addressing the preeminent need to interpret large scale time series from gene expression studies, CroP is prepared to analyze datasets from multiple contexts. Multiple datasets can be uploaded simultaneously and viewed through dynamic visualization models, which are contained within flexible panels that allow users to adapt the workspace to their data. Through clustering and the time curve visualization it is possible to quickly identify groups of data points with similar proprieties or behaviors, as well as temporal patterns across all points, such as periodic waves of expression. Additionally, it integrates a public biomedical database for gene annotation. CroP will be of major interest to biologists who seek to extract relations from complex sets of data. AVAILABILITY AND IMPLEMENTATION: CroP is freely available for download as an executable jar at https://cdv.dei.uc.pt/crop/.

Asunto(s)

Programas Informáticos , Análisis por Conglomerados , Bases de Datos Factuales , Expresión Génica , Anotación de Secuencia Molecular

17.

Handling Noise in Protein Interaction Networks.

Correia, Fernanda B; Coelho, Edgar D; Oliveira, José L; Arrais, Joel P.

Biomed Res Int ; 2019: 8984248, 2019.

Artículo en Inglés | MEDLINE | ID: mdl-31828144

RESUMEN

Protein-protein interactions (PPIs) can be conveniently represented as networks, allowing the use of graph theory for their study. Network topology studies may reveal patterns associated with specific organisms. Here, we propose a new methodology to denoise PPI networks and predict missing links solely based on the network topology, the organization measurement (OM) method. The OM methodology was applied in the denoising of the PPI networks of two Saccharomyces cerevisiae datasets (Yeast and CS2007) and one Homo sapiens dataset (Human). To evaluate the denoising capabilities of the OM methodology, two strategies were applied. The first strategy compared its application in random networks and in the reference set networks, while the second strategy perturbed the networks with the gradual random addition and removal of edges. The application of the OM methodology to the Yeast and Human reference sets achieved an AUC of 0.95 and 0.87, in Yeast and Human networks, respectively. The random removal of 80% of the Yeast and Human reference set interactions resulted in an AUC of 0.71 and 0.62, whereas the random addition of 80% interactions resulted in an AUC of 0.75 and 0.72, respectively. Applying the OM methodology to the CS2007 dataset yields an AUC of 0.99. We also perturbed the network of the CS2007 dataset by randomly inserting and removing edges in the same proportions previously described. The false positives identified and removed from the network varied from 97%, when inserting 20% more edges, to 89%, when 80% more edges were inserted. The true positives identified and inserted in the network varied from 95%, when removing 20% of the edges, to 40%, after the random deletion of 80% edges. The OM methodology is sensitive to the topological structure of the biological networks. The obtained results suggest that the present approach can efficiently be used to denoise PPI networks.

Asunto(s)

Biología Computacional/métodos , Mapeo de Interacción de Proteínas/métodos , Mapas de Interacción de Proteínas , Área Bajo la Curva , Bases de Datos de Proteínas , Humanos , Proteínas de Saccharomyces cerevisiae

18.

Interactive and coordinated visualization approaches for biological data analysis.

Cruz, António; Arrais, Joel P; Machado, Penousal.

Brief Bioinform ; 20(4): 1513-1523, 2019 07 19.

Artículo en Inglés | MEDLINE | ID: mdl-29590305

RESUMEN

The field of computational biology has become largely dependent on data visualization tools to analyze the increasing quantities of data gathered through the use of new and growing technologies. Aside from the volume, which often results in large amounts of noise and complex relationships with no clear structure, the visualization of biological data sets is hindered by their heterogeneity, as data are obtained from different sources and contain a wide variety of attributes, including spatial and temporal information. This requires visualization approaches that are able to not only represent various data structures simultaneously but also provide exploratory methods that allow the identification of meaningful relationships that would not be perceptible through data analysis algorithms alone. In this article, we present a survey of visualization approaches applied to the analysis of biological data. We focus on graph-based visualizations and tools that use coordinated multiple views to represent high-dimensional multivariate data, in particular time series gene expression, protein-protein interaction networks and biological pathways. We then discuss how these methods can be used to help solve the current challenges surrounding the visualization of complex biological data sets.

Asunto(s)

Biología Computacional/métodos , Análisis de Datos , Algoritmos , Animales , Gráficos por Computador/estadística & datos numéricos , Interpretación Estadística de Datos , Perfilación de la Expresión Génica/estadística & datos numéricos , Humanos , Modelos Biológicos , Análisis Multivariante , Mapas de Interacción de Proteínas , Interfaz Usuario-Computador

19.

SalivaPRINT Toolkit - Protein profile evaluation and phenotype stratification.

Cruz, Igor; Esteves, Eduardo; Fernandes, Mónica; Rosa, Nuno; Correia, Maria José; Arrais, Joel P; Barros, Marlene.

J Proteomics ; 171: 81-86, 2018 01 16.

Artículo en Inglés | MEDLINE | ID: mdl-28843534

RESUMEN

The value of the molecular information obtained from saliva is dependent on the use of in vitro and in silico techniques. The main proteins of saliva when separated by capillary electrophoresis enable the establishment of individual profiles with characteristic patterns reflecting each individual phenotype. Different physiological or pathological conditions may be identified by specific protein profiles. The association of each profile to the particular protein composition provides clues as to which biological processes are compromised in each situation. Patient stratification according to different phenotypes often within a particular disease spectrum is especially important for the management of individuals carrying multiple diseases and requiring personalized interventions. In this work we present the SalivaPRINT Toolkit, which enables the analysis of protein profile patterns and patient phenotyping. Additionally, the SalivaPRINT Toolkit allows the identification of molecular weight ranges altered in a particular condition and therefore potentially involved in the underlying dysregulated mechanisms. This tutorial introduces the use of the SalivaPRINT Toolkit command line interface (https://github.com/salivatec/SalivaPRINT) as an independent tool for electrophoretic protein profile evaluation. It provides a detailed overview of its functionalities, illustrated by the application to the analysis of profiles obtained from a healthy population versus a population affected with inflammatory conditions. BIOLOGICAL SIGNIFICANCE: We present SalivaPRINT, which serves as a patient characterization tool to identify molecular weights related with particular conditions and, from there, find proteins, which may be involved in the underlying dysregulated cellular mechanisms. The proposed analysis strategy has the potential to boost personalized diagnosis. To our knowledge this is the first independent tool for electrophoretic protein profile evaluation and is crucial when a large number of complex electrophoretic profiles needs to be compared and classified.

Asunto(s)

Biología Computacional/métodos , Proteoma/metabolismo , Saliva/metabolismo , Proteínas y Péptidos Salivales/metabolismo , Programas Informáticos , Enfermedad Celíaca/metabolismo , Bases de Datos de Proteínas , Humanos , Inflamación/metabolismo , Aprendizaje Automático , Peso Molecular , Fenotipo , Proteoma/clasificación

20.

New Targets for Zika Virus Determined by Human-Viral Interactomic: A Bioinformatics Approach.

Esteves, Eduardo; Rosa, Nuno; Correia, Maria José; Arrais, Joel P; Barros, Marlene.

Biomed Res Int ; 2017: 1734151, 2017.

Artículo en Inglés | MEDLINE | ID: mdl-29379794

RESUMEN

Identifying ZIKV factors interfering with human host pathways represents a major challenge in understanding ZIKV tropism and pathogenesis. The integration of proteomic, gene expression and Protein-Protein Interactions (PPIs) established between ZIKV and human host proteins predicted by the OralInt algorithm identified 1898 interactions with medium or high score (≥0.7). Targets implicated in vesicular traffic and docking were identified. New receptors involved in endocytosis pathways as ZIKV entry targets, using both clathrin-dependent (17 receptors) and independent (10 receptors) pathways, are described. New targets used by the ZIKV to undermine the host's antiviral immune response are proposed based on predicted interactions established between the virus and host cell receptors and/or proteins with an effector or signaling role in the immune response such as IFN receptors and TLR. Complement and cytokines are proposed as extracellular potential interacting partners of the secreted form of NS1 ZIKV protein. Altogether, in this article, 18 new human targets for structural and nonstructural ZIKV proteins are proposed. These results are of great relevance for the understanding of viral pathogenesis and consequently the development of preventive (vaccines) and therapeutic targets for ZIKV infection management.

Asunto(s)

Biología Computacional , Modelos Inmunológicos , Proteínas Virales/inmunología , Infección por el Virus Zika/inmunología , Virus Zika/inmunología , Femenino , Humanos , Masculino , Vacunas Virales/inmunología , Infección por el Virus Zika/patología , Infección por el Virus Zika/prevención & control

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA