Pesquisa | Biblioteca Virtual em Saúde

Polymer informatics for QSPR prediction of tensile mechanical properties. Case study: Strength at break.

Cravero, Fiorella; Díaz, Mónica F; Ponzoni, Ignacio.

J Chem Phys ; 156(20): 204903, 2022 May 28.

Artigo em Inglês | MEDLINE | ID: mdl-35649865

RESUMO

The artificial intelligence-based prediction of the mechanical properties derived from the tensile test plays a key role in assessing the application profile of new polymeric materials, especially in the design stage, prior to synthesis. This strategy saves time and resources when creating new polymers with improved properties that are increasingly demanded by the market. A quantitative structure-property relationship (QSPR) model for tensile strength at break is presented in this work. The QSPR methodology applied here is based on machine learning tools, visual analytics methods, and expert-in-the-loop strategies. From the whole study, a QSPR model composed of five molecular descriptors that achieved a correlation coefficient of 0.9226 is proposed. We applied visual analytics tools at two levels of analysis: a more general one in which models are discarded for redundant information metrics and a deeper one in which a chemistry expert can make decisions on the composition of the model in terms of subsets of molecular descriptors, from a physical-chemical point of view. In this way, with the present work, we close a contribution cycle to polymer informatics, providing QSPR models oriented to the prediction of mechanical properties related to the tensile test.

Assuntos

Inteligência Artificial , Polímeros , Informática , Polímeros/química , Relação Quantitativa Estrutura-Atividade

Feature Selection for Polymer Informatics: Evaluating Scalability and Robustness of the FS4RV_DD Algorithm Using Synthetic Polydisperse Data Sets.

Cravero, Fiorella; Schustik, Santiago A; Martínez, M Jimena; Vázquez, Gustavo E; Díaz, Mónica F; Ponzoni, Ignacio.

J Chem Inf Model ; 60(2): 592-603, 2020 02 24.

Artigo em Inglês | MEDLINE | ID: mdl-31790226

RESUMO

The feature selection (FS) process is a key step in the Quantitative Structure-Property Relationship (QSPR) modeling of physicochemical properties in cheminformatics. In particular, the inference of QSPR models for polymeric material properties constitutes a complex problem because of the uncertainty introduced by the polydispersity of these materials. The main challenge is how to capture the polydispersity information from the molecular weight distribution (MWD) curve to achieve a more effective computational representation of polymeric materials. To date, most of the existing QSPR techniques use only a single molecule to represent each of these materials, but polydispersity is not considered. Consequently, QSPR models obtained by these approaches are being oversimplified. For this reason, we introduced in a previous work a new FS algorithm called Feature Selection for Random Variables with Discrete Distribution (FS4RVDD), which allows dealing with polydisperse data. In the present paper, we evaluate both the scalability and the robustness of the FS4RVDD algorithm. In this sense, we generated synthetic data by varying and combining different parameters: the size of the database, the cardinality of the selected feature subsets, the presence of noise in the data, and the type of correlation (linear and nonlinear). Moreover, the performances obtained by FS4RVDD were contrasted with traditional FS techniques applied to different simplified representations of polymeric materials. The obtained results show that the FS4RVDD algorithm outperformed the traditional FS methods in all proposed scenarios, which suggest the need of an algorithm such as FS4RVDD to deal with the uncertainty that polydispersity introduces in human-made polymers.

Assuntos

Algoritmos , Polímeros/química , Modelos Moleculares , Conformação Molecular , Peso Molecular , Relação Quantitativa Estrutura-Atividade

QSPR models for predicting log P(liver) values for volatile organic compounds combining statistical methods and domain knowledge.

Palomba, Damián; Martínez, María J; Ponzoni, Ignacio; Díaz, Mónica F; Vazquez, Gustavo E; Soto, Axel J.

Molecules ; 17(12): 14937-53, 2012 Dec 17.

Artigo em Inglês | MEDLINE | ID: mdl-23247367

RESUMO

Volatile organic compounds (VOCs) are contained in a variety of chemicals that can be found in household products and may have undesirable effects on health. Thereby, it is important to model blood-to-liver partition coefficients (log P(liver)) for VOCs in a fast and inexpensive way. In this paper, we present two new quantitative structure-property relationship (QSPR) models for the prediction of log P(liver), where we also propose a hybrid approach for the selection of the descriptors. This hybrid methodology combines a machine learning method with a manual selection based on expert knowledge. This allows obtaining a set of descriptors that is interpretable in physicochemical terms. Our regression models were trained using decision trees and neural networks and validated using an external test set. Results show high prediction accuracy compared to previous log P(liver) models, and the descriptor selection approach provides a means to get a small set of descriptors that is in agreement with theoretical understanding of the target property.

Assuntos

Gases , Modelos Teóricos , Relação Quantitativa Estrutura-Atividade , Compostos Orgânicos Voláteis , Animais , Inteligência Artificial , Gases/química , Gases/toxicidade , Humanos , Fígado/efeitos dos fármacos , Ratos , Compostos Orgânicos Voláteis/química , Compostos Orgânicos Voláteis/toxicidade

Vehicular fleet characterisation and assessment of the on-road mobile source emission inventory of a Latin American intermediate city.

Grassi, Yamila S; Brignole, Nélida B; Díaz, Mónica F.

Sci Total Environ ; 792: 148255, 2021 Oct 20.

Artigo em Inglês | MEDLINE | ID: mdl-34144242

RESUMO

The management of urban mobile-source emissions is nowadays a challenging topic. This paper describes a comprehensive and practical analysis of the vehicular fleet characterisation and the 2018 on-road mobile source emission inventory for Bahía Blanca (Argentina), a medium-size Latin American city. An exhaustive segmentation of the vehicle fleet was done to obtain more real results from the emission inventory carried out by COPERT software. Results for 2018, allow us to conclude that motorcycles were the main source of CO, NMVOC, CO2 and CH4. While, light commercial vehicles were the ones that emitted the most amount of NOx. Finally, it was concluded that the polluting emissions observed in 2013 are higher than the 2018's ones, in spite of the growth of the vehicular fleet (12.45% in 2018 respect to 2013). This may mainly be due to the incorporation of a new, more efficient emission control technology in vehicular fleet of 2018. However, these improvements result in increased GHGs emissions, which is still a challenge in this area. Finally, the main trends for vehicle flow and emissions detected in 2020 are presented.

Assuntos

Poluentes Atmosféricos , Emissões de Veículos , Poluentes Atmosféricos/análise , Cidades , Monitoramento Ambiental , América Latina , Emissões de Veículos/análise

QSAR Classification Models for Predicting the Activity of Inhibitors of Beta-Secretase (BACE1) Associated with Alzheimer's Disease.

Ponzoni, Ignacio; Sebastián-Pérez, Víctor; Martínez, María J; Roca, Carlos; De la Cruz Pérez, Carlos; Cravero, Fiorella; Vazquez, Gustavo E; Páez, Juan A; Díaz, Mónica F; Campillo, Nuria E.

Sci Rep ; 9(1): 9102, 2019 06 24.

Artigo em Inglês | MEDLINE | ID: mdl-31235739

RESUMO

Alzheimer's disease is one of the most common neurodegenerative disorders in elder population. The ß-site amyloid cleavage enzyme 1 (BACE1) is the major constituent of amyloid plaques and plays a central role in this brain pathogenesis, thus it constitutes an auspicious pharmacological target for its treatment. In this paper, a QSAR model for identification of potential inhibitors of BACE1 protein is designed by using classification methods. For building this model, a database with 215 molecules collected from different sources has been assembled. This dataset contains diverse compounds with different scaffolds and physical-chemical properties, covering a wide chemical space in the drug-like range. The most distinctive aspect of the applied QSAR strategy is the combination of hybridization with backward elimination of models, which contributes to improve the quality of the final QSAR model. Another relevant step is the visual analysis of the molecular descriptors that allows guaranteeing the absence of information redundancy in the model. The QSAR model performances have been assessed by traditional metrics, and the final proposed model has low cardinality, and reaches a high percentage of chemical compounds correctly classified.

Assuntos

Doença de Alzheimer/tratamento farmacológico , Secretases da Proteína Precursora do Amiloide/antagonistas & inibidores , Inibidores de Proteases/química , Inibidores de Proteases/farmacologia , Relação Quantitativa Estrutura-Atividade , Doença de Alzheimer/enzimologia , Simulação por Computador , Aprendizado de Máquina , Inibidores de Proteases/uso terapêutico

Hybridizing Feature Selection and Feature Learning Approaches in QSAR Modeling for Drug Discovery.

Ponzoni, Ignacio; Sebastián-Pérez, Víctor; Requena-Triguero, Carlos; Roca, Carlos; Martínez, María J; Cravero, Fiorella; Díaz, Mónica F; Páez, Juan A; Arrayás, Ramón Gómez; Adrio, Javier; Campillo, Nuria E.

Sci Rep ; 7(1): 2403, 2017 05 25.

Artigo em Inglês | MEDLINE | ID: mdl-28546583

RESUMO

Quantitative structure-activity relationship modeling using machine learning techniques constitutes a complex computational problem, where the identification of the most informative molecular descriptors for predicting a specific target property plays a critical role. Two main general approaches can be used for this modeling procedure: feature selection and feature learning. In this paper, a performance comparative study of two state-of-art methods related to these two approaches is carried out. In particular, regression and classification models for three different issues are inferred using both methods under different experimental scenarios: two drug-like properties, such as blood-brain-barrier and human intestinal absorption, and enantiomeric excess, as a measurement of purity used for chiral substances. Beyond the contrastive analysis of feature selection and feature learning methods as competitive approaches, the hybridization of these strategies is also evaluated based on previous results obtained in material sciences. From the experimental results, it can be concluded that there is not a clear winner between both approaches because the performance depends on the characteristics of the compound databases used for modeling. Nevertheless, in several cases, it was observed that the accuracy of the models can be improved by combining both approaches when the molecular descriptor sets provided by feature selection and feature learning contain complementary information.

Assuntos

Descoberta de Drogas , Aprendizado de Máquina , Modelos Moleculares , Relação Quantitativa Estrutura-Atividade , Algoritmos , Barreira Hematoencefálica/efeitos dos fármacos , Barreira Hematoencefálica/metabolismo , Fenômenos Químicos , Descoberta de Drogas/métodos , Humanos , Absorção Intestinal/efeitos dos fármacos , Software

Visual analytics in cheminformatics: user-supervised descriptor selection for QSAR methods.

Martínez, María Jimena; Ponzoni, Ignacio; Díaz, Mónica F; Vazquez, Gustavo E; Soto, Axel J.

J Cheminform ; 7: 39, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26300983

RESUMO

BACKGROUND: The design of QSAR/QSPR models is a challenging problem, where the selection of the most relevant descriptors constitutes a key step of the process. Several feature selection methods that address this step are concentrated on statistical associations among descriptors and target properties, whereas the chemical knowledge is left out of the analysis. For this reason, the interpretability and generality of the QSAR/QSPR models obtained by these feature selection methods are drastically affected. Therefore, an approach for integrating domain expert's knowledge in the selection process is needed for increase the confidence in the final set of descriptors. RESULTS: In this paper a software tool, which we named Visual and Interactive DEscriptor ANalysis (VIDEAN), that combines statistical methods with interactive visualizations for choosing a set of descriptors for predicting a target property is proposed. Domain expertise can be added to the feature selection process by means of an interactive visual exploration of data, and aided by statistical tools and metrics based on information theory. Coordinated visual representations are presented for capturing different relationships and interactions among descriptors, target properties and candidate subsets of descriptors. The competencies of the proposed software were assessed through different scenarios. These scenarios reveal how an expert can use this tool to choose one subset of descriptors from a group of candidate subsets or how to modify existing descriptor subsets and even incorporate new descriptors according to his or her own knowledge of the target property. CONCLUSIONS: The reported experiences showed the suitability of our software for selecting sets of descriptors with low cardinality, high interpretability, low redundancy and high statistical performance in a visual exploratory way. Therefore, it is possible to conclude that the resulting tool allows the integration of a chemist's expertise in the descriptor selection process with a low cognitive effort in contrast with the alternative of using an ad-hoc manual analysis of the selected descriptors. Graphical abstractVIDEAN allows the visual analysis of candidate subsets of descriptors for QSAR/QSPR. In the two panels on the top, users can interactively explore numerical correlations as well as co-occurrences in the candidate subsets through two interactive graphs.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA