Pesquisa | Biblioteca Virtual em Saúde

Exploring proteasome inhibition using atomic weighted vector indices and machine learning approaches.

Martínez-López, Yoan; Castillo-Garit, Juan A; Casanola-Martin, Gerardo M; Rasulev, Bakhtiyor; Rodríguez-Gonzalez, Ansel Y; Martínez-Santiago, Oscar; Barigye, Stephen J.

Mol Divers ; 2023 Apr 05.

Artigo em Inglês | MEDLINE | ID: mdl-37017875

RESUMO

Ubiquitin-proteasome system (UPS) is a highly regulated mechanism of intracellular protein degradation and turnover. The UPS is involved in different biological activities, such as the regulation of gene transcription and cell cycle. Several researchers have applied cheminformatics and artificial intelligence methods to study the inhibition of proteasomes, including the prediction of UPP inhibitors. Following this idea, we applied a new tool for obtaining molecular descriptors (MDs) for modeling proteasome Inhibition in terms of EC50 (µmol/L), in which a set of new MDs called atomic weighted vectors (AWV) and several prediction algorithms were used in cheminformatics studies. In the manuscript, a set of descriptors based on AWV are presented as datasets for training different machine learning techniques, such as linear regression, multiple linear regression (MLR), random forest (RF), K-nearest neighbors (IBK), multi-layer perceptron, best-first search, and genetic algorithm. The results suggest that these atomic descriptors allow adequate modeling of proteasome inhibitors despite artificial intelligence techniques, as a variant to build efficient models for the prediction of inhibitory activity.

When global and local molecular descriptors are more than the sum of its parts: Simple, But Not Simpler?

Martínez-López, Yoan; Marrero-Ponce, Yovani; Barigye, Stephen J; Teran, Enrique; Martínez-Santiago, Oscar; Zambrano, Cesar H; Torres, F Javier.

Mol Divers ; 24(4): 913-932, 2020 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-31659696

RESUMO

In this report, we introduce a set of aggregation operators (AOs) to calculate global and local (group and atom type) molecular descriptors (MDs) as a generalization of the classical approach of molecular encoding using the sum of the atomic (or fragment) contributions. These AOs are implemented in a new and free software denominated MD-LOVIs ( http://tomocomd.com/md-lovis ), which allows for the calculation of MDs from atomic weights vector and LOVIs (local vertex invariants). This software was developed in Java programming language and employed the Chemical Development Kit (CDK) library for handling chemical structures and the calculation of atomic weights. An analysis of the complexities of the algorithms presented herein demonstrates that these aspects were efficiently implemented. The calculation speed experiments show that the MD-LOVIs software has satisfactory behavior when compared to software such as Padel, CDKDescriptor, DRAGON and Bluecal software. Shannon's entropy (SE)-based variability studies demonstrate that MD-LOVIs yields indices with greater information content when compared to those of popular academic and commercial software. A principal component analysis reveals that our approach captures chemical information orthogonal to that codified by the DRAGON, Padel and Mold2 software, as a result of the several generalizations in MD-LOVIs not used in other programs. Lastly, three QSARs were built using multiple linear regression with genetic algorithms, and the statistical parameters of these models demonstrate that the MD-LOVIs indices obtained with AOs yield better performance than those obtained when the summation operator is used exclusively. Moreover, it is also revealed that the MD-LOVIs indices yield models with comparable to superior performance when compared to other QSAR methodologies reported in the literature, despite their simplicity. The studies performed herein collectively demonstrated that MD-LOVIs software generates indices as simple as possible, but not simpler and that use of AOs enhances the diversity of the chemical information codified, which consequently improves the performance of traditional MDs.

Assuntos

Modelos Químicos , Bibliotecas de Moléculas Pequenas/química , Algoritmos , Modelos Lineares , Análise Multivariada , Relação Quantitativa Estrutura-Atividade , Software

Physico-Chemical and Structural Interpretation of Discrete Derivative Indices on N-Tuples Atoms.

Martínez-Santiago, Oscar; Marrero-Ponce, Yovani; Barigye, Stephen J; Le Thi Thu, Huong; Torres, F Javier; Zambrano, Cesar H; Muñiz Olite, Jorge L; Cruz-Monteagudo, Maykel; Vivas-Reyes, Ricardo; Vázquez Infante, Liliana; Artiles Martínez, Luis M.

Int J Mol Sci ; 17(6)2016 May 27.

Artigo em Inglês | MEDLINE | ID: mdl-27240357

RESUMO

This report examines the interpretation of the Graph Derivative Indices (GDIs) from three different perspectives (i.e., in structural, steric and electronic terms). It is found that the individual vertex frequencies may be expressed in terms of the geometrical and electronic reactivity of the atoms and bonds, respectively. On the other hand, it is demonstrated that the GDIs are sensitive to progressive structural modifications in terms of: size, ramifications, electronic richness, conjugation effects and molecular symmetry. Moreover, it is observed that the GDIs quantify the interaction capacity among molecules and codify information on the activation entropy. A structure property relationship study reveals that there exists a direct correspondence between the individual frequencies of atoms and Hückel's Free Valence, as well as between the atomic GDIs and the chemical shift in NMR, which collectively validates the theory that these indices codify steric and electronic information of the atoms in a molecule. Taking in consideration the regularity and coherence found in experiments performed with the GDIs, it is possible to say that GDIs possess plausible interpretation in structural and physicochemical terms.

Assuntos

Preparações Farmacêuticas/química , Algoritmos , Gráficos por Computador , Desenho de Fármacos , Entropia

Relations frequency hypermatrices in mutual, conditional and joint entropy-based information indices.

Barigye, Stephen J; Marrero-Ponce, Yovani; Martínez-López, Yoan; Torrens, Francisco; Artiles-Martínez, Luis Manuel; Pino-Urias, Ricardo W; Martínez-Santiago, Oscar.

J Comput Chem ; 34(4): 259-74, 2013 Feb 05.

Artigo em Inglês | MEDLINE | ID: mdl-23015467

RESUMO

Graph-theoretic matrix representations constitute the most popular and significant source of topological molecular descriptors (MDs). Recently, we have introduced a novel matrix representation, named the duplex relations frequency matrix, F, derived from the generalization of an incidence matrix whose row entries are connected subgraphs of a given molecular graph G. Using this matrix, a series of information indices (IFIs) were proposed. In this report, an extension of F is presented, introducing for the first time the concept of a hypermatrix in graph-theoretic chemistry. The hypermatrix representation explores the n-tuple participation frequencies of vertices in a set of connected subgraphs of G. In this study we, however, focus on triple and quadruple participation frequencies, generating triple and quadruple relations frequency matrices, respectively. The introduction of hypermatrices allows us to redefine the recently proposed MDs, that is, the mutual, conditional, and joint entropy-based IFIs, in a generalized way. These IFIs are implemented in GT-STAF (acronym for Graph Theoretical Thermodynamic STAte Functions), a new module of the TOMOCOMD-CARDD program. Information theoretic-based variability analysis of the proposed IFIs suggests that the use of hypermatrices enhances the entropy and, hence, the variability of the previously proposed IFIs, especially the conditional and mutual entropy based IFIs. The predictive capacity of the proposed IFIs was evaluated by the analysis of the regression models, obtained for physico-chemical properties the partition coefficient (Log P) and the specific rate constant (Log K) of 34 derivatives of 2-furylethylene. The statistical parameters, for the best models obtained for these properties, were compared to those reported in the literature depicting better performance. This result suggests that the use of the hypermatrix-based approach, in the redefinition of the previously proposed IFIs, avails yet other valuable tools beneficial in QSPR studies and diversity analysis.

Assuntos

Simulação por Computador , Etilenos/química , Modelos Químicos , Mineração de Dados , Entropia

Higher-Order and Mixed Discrete Derivatives such as a Novel Graph- Theoretical Invariant for Generating New Molecular Descriptors.

Martínez-Santiago, Oscar; Marrero-Ponce, Yovani; Vivas-Reyes, Ricardo; Ugarriza, Mauricio E O; Hurtado-Rodríguez, Elízabeth; Martínez-López, Yoan; Torres, F Javier; Zambrano, Cesar H; Pham-The, Hai.

Curr Top Med Chem ; 19(11): 944-956, 2019.

Artigo em Inglês | MEDLINE | ID: mdl-31074367

RESUMO

BACKGROUND: Recently, some authors have defined new molecular descriptors (MDs) based on the use of the Graph Discrete Derivative, known as Graph Derivative Indices (GDI). This new approach about discrete derivatives over various elements from a graph takes as outset the formation of subgraphs. Previously, these definitions were extended into the chemical context (N-tuples) and interpreted in structural/physicalchemical terms as well as applied into the description of several endpoints, with good results. OBJECTIVE: A generalization of GDIs using the definitions of Higher Order and Mixed Derivative for molecular graphs is proposed as a generalization of the previous works, allowing the generation of a new family of MDs. METHODS: An extension of the previously defined GDIs is presented, and for this purpose, the concept of Higher Order Derivatives and Mixed Derivatives is introduced. These novel approaches to obtaining MDs based on the concepts of discrete derivatives (finite difference) of the molecular graphs use the elements of the hypermatrices conceived from 12 different ways (12 events) of fragmenting the molecular structures. The result of applying the higher order and mixed GDIs over any molecular structure allows finding Local Vertex Invariants (LOVIs) for atom-pairs, for atoms-pairs-pairs and so on. All new families of GDIs are implemented in a computational software denominated DIVATI (acronym for Discrete DeriVAtive Type Indices), a module of KeysFinder Framework in TOMOCOMD-CARDD system. RESULTS: QSAR modeling of the biological activity (Log 1/K) of 31 steroids reveals that the GDIs obtained using the higher order and mixed GDIs approaches yield slightly higher performance compared to previously reported approaches based on the duplex, triplex and quadruplex matrix. In fact, the statistical parameters for models obtained with the higher-order and mixed GDI method are superior to those reported in the literature by using other 0-3D QSAR methods. CONCLUSION: It can be suggested that the higher-order and mixed GDIs, appear as a promissory tool in QSAR/QSPRs, similarity/dissimilarity analysis and virtual screening studies.

Assuntos

Relação Quantitativa Estrutura-Atividade , Modelos Moleculares

Prediction of aquatic toxicity of benzene derivatives using molecular descriptor from atomic weighted vectors.

Martínez-López, Yoan; Barigye, Stephen J; Martínez-Santiago, Oscar; Marrero-Ponce, Yovani; Green, James; Castillo-Garit, Juan A.

Environ Toxicol Pharmacol ; 56: 314-321, 2017 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-29091819

RESUMO

Several descriptors from atom weighted vectors are used in the prediction of aquatic toxicity of set of organic compounds of 392 benzene derivatives to the protozoo ciliate Tetrahymena pyriformis (log(IGC50)-1). These descriptors are calculated using the MD-LOVIs software and various Aggregation Operators are examined with the aim comparing their performances in predicting aquatic toxicity. Variability analysis is used to quantify the information content of these molecular descriptors by means of an information theory-based algorithm. Multiple Linear Regression with Genetic Algorithms is used to obtain models of the structure-toxicity relationships; the best model shows values of Q2=0.830 and R2=0.837 using six variables. Our models compare favorably with other previously published models that use the same data set. The obtained results suggest that these descriptors provide an effective alternative for determining aquatic toxicity of benzene derivatives.

Assuntos

Derivados de Benzeno/toxicidade , Tetrahymena pyriformis/efeitos dos fármacos , Poluentes Químicos da Água/toxicidade , Algoritmos , Modelos Moleculares , Software

Generalized Molecular Descriptors Derived From Event-Based Discrete Derivative.

Martínez-Santiago, Oscar; Cabrera, Reisel Millán; Marrero-Ponce, Yovani; Barigye, Stephen J; Le-Thi-Thu, Huong; Torres, F Javier; Zambrano, Cesar H; Yaber-Goenaga, Ivan; Cruz-Monteagudo, Maykel; López, Yoan Martínez; Giménez, Facundo Pérez; Torrens, Francisco.

Curr Pharm Des ; 22(33): 5095-5113, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-27852205

RESUMO

In the present study, a generalized approach for molecular structure characterization is introduced, based on the relation frequency matrix (F) representation of the molecular graph and the subsequent calculation of the corresponding discrete derivative (finite difference) over a pair of elements (atoms). In earlier publications (22- 24), an unique event, named connected subgraphs, (based on the Kier-Hall's subgraphs) was systematically employed for the computation of the matrix F. The present report is a generalization of this notion, in which eleven additional events are introduced, classified in three categories, namely, topological (terminal paths, vertex path incidence, quantum subgraphs, walks of length k, Sach's subgraphs), fingerprints (MACCs, E-state and substructure fingerprints) and atomic contributions (Ghose and Crippen atom-types for hydrophobicity and refractivity) for F generation. The events are intended to capture diverse information by the generation or search of different kinds of substructures from the graph representation of a molecule. The discrete derivative over duplex atom relations are calculated for each event, and the resulting derivatives, local vertex invariants (LOVIs) are finally obtained. These LOVIs are subsequently employed as the basis for the calculation of global and local indices over groups of atoms (heteroatoms, halogens, methyl carbons, etc.), by using norms, means, statistics and classical algorithms as aggregator (fusion) operators. These indices were implemented in our house software DIVATI (Derivative Type Indices, a new module of TOMOCOMDCARDD system). DIVATI provides a friendly and cross-platform graphical user interface, developed in the Java programming language and is freely available at: http: //www.tomocomd.com. Factor analysis shows that the presented events are rather orthogonal and collect diverse information about the chemical structure. Finally, QSPR models were built to describe the logP and logK of 34 furylethylenes derivatives using the eleven events. Generally, the equations obtained according to these events showed high correlations, with the Sach's sub-graphs and Multiplicity events showing the best behavior in the description of logK (Q2 LOO value of 99.06%) and logP (Q2 LOO value of 98.1 %), respectively. These results show that these new eventbased indices constitute a powerful approach for chemoinformatics studies.

Assuntos

Algoritmos , Furanos/química , Modelos Químicos , Software

Discrete Derivatives for Atom-Pairs as a Novel Graph-Theoretical Invariant for Generating New Molecular Descriptors: Orthogonality, Interpretation and QSARs/QSPRs on Benchmark Databases.

Martínez-Santiago, Oscar; Millán-Cabrera, Reisel; Marrero-Ponce, Yovani; Barigye, Stephen J; Martínez-López, Yoan; Torrens, Francisco; Pérez-Giménez, Facundo.

Mol Inform ; 33(5): 343-68, 2014 May.

Artigo em Inglês | MEDLINE | ID: mdl-27485891

RESUMO

This report presents a new mathematical method based on the concept of the derivative of a molecular graph (G) with respect to a given event (S) to codify chemical structure information. The derivate over each pair of atoms in the molecule is defined as ∂G/∂S(vi , vj )=(fi -2fij +fj )/fij , where fi (or fj ) and fij are the individual frequency of atom i (or j) and the reciprocal frequency of the atoms i and j, respectively. These frequencies characterize the participation intensity of atom pairs in S. Here, the event space is composed of molecular sub-graphs which participate in the formation of the G skeleton that could be complete (representing all possible connected sub-graphs) or comprised of sub-graphs of certain orders or types or combinations of these. The atom level graph derivative index, Δi , is expressed as a linear combination of all atom pair derivatives that include the atomic nuclei i. Global [total or local (group or atom-type)] indices are obtained by applying the so called invariants over a vector of Δi values. The novel MDs are validated using a data set of 28 alkyl-alcohols and other benchmark data sets proposed by the International Academy of Mathematical Chemistry. Also, the boiling point for the alcohols, the adrenergic blocking activity of N,N-dimethyl-2-halo-phenethylamines and physicochemical properties of polychlorinated biphenyls and octanes are modeled. These models exhibit satisfactory predictive power compared with other 0-3D indices implemented successfully by other researchers. In addition, tendencies of the proposed indices are investigated using examples of various types of molecular structures, including chain-lengthening, branching, heteroatoms-content, and multiple bonds. On the other hand, the relation of atom-based derivative indices with (17) O NMR of a series of ethers and carbonyls reflects that the new MDs encode electronic, topological and steric information. Linear independence between the graph derivative indices and other 0-3D MDs is demonstrated by using principal component analysis on a dataset of 41 heterogeneous molecules. It is concluded that the graph derivative indices are independent indices containing important structural information to be used in QSPR/QSAR and drug design studies, and permit obtaining easier, more interpretable and robust mathematical models than the majority of those reported in the literature.

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA