RESUMO
Protozoan parasites have been one of the most significant public health problems for centuries and several human infections caused by them have massive global impact. Most of the current drugs used to treat these illnesses have been used for decades and have many limitations such as the emergence of drug resistance, severe side-effects, low-to-medium drug efficacy, administration routes, cost, etc. These drugs have been largely neglected as models for drug development because they are majorly used in countries with limited resources and as a consequence with scarce marketing possibilities. Nowadays, there is a pressing need to identify and develop new drug-based antiprotozoan therapies. In an effort to overcome this problem, the main purpose of this study is to develop a QSARs-based ensemble classifier for antiprotozoan drug-like entities from a heterogeneous compounds collection. Here, we use some of the TOMOCOMD-CARDD molecular descriptors and linear discriminant analysis (LDA) to derive individual linear classification functions in order to discriminate between antiprotozoan and non-antiprotozoan compounds as a way to enable the computational screening of virtual combinatorial datasets and/or drugs already approved. Firstly, we construct a wide-spectrum benchmark database comprising of 680 organic chemicals with great structural variability (254 of them antiprotozoan agents and 426 to drugs having other clinical uses). This series of compounds was processed by a k-means cluster analysis in order to design training and predicting sets. In total, seven discriminant functions were obtained, by using the whole set of atom-based linear indices. All the LDA-based QSAR models show accuracies above 85% in the training set and values of Matthews correlation coefficients (C) vary from 0.70 to 0.86. The external validation set shows rather-good global classifications of around 80% (92.05% for best equation). Later, we developed a multi-agent QSAR classification system, in which the individual QSAR outputs are the inputs of the aforementioned fusion approach. Finally, the fusion model was used for the identification of a novel generation of lead-like antiprotozoan compounds by using ligand-based virtual screening of 'available' small molecules (with synthetic feasibility) in our 'in-house' library. A new molecular subsystem (quinoxalinones) was then theoretically selected as a promising lead series, and its derivatives subsequently synthesized, structurally characterized, and experimentally assayed by using in vitro screening that took into consideration a battery of five parasite-based assays. The chemicals 11(12) and 16 are the most active (hits) against apicomplexa (sporozoa) and mastigophora (flagellata) subphylum parasites, respectively. Both compounds depicted good activity in every protozoan in vitro panel and they did not show unspecific cytotoxicity on the host cells. The described technical framework seems to be a promising QSAR-classifier tool for the molecular discovery and development of novel classes of broad-antiprotozoan-spectrum drugs, which may meet the dual challenges posed by drug-resistant parasites and the rapid progression of protozoan illnesses.
Assuntos
Antiprotozoários/farmacologia , Quinoxalinas/síntese química , Ciclização , Estrutura Molecular , Relação Quantitativa Estrutura-Atividade , Quinoxalinas/químicaRESUMO
Trichomonas vaginalis (Tv) is the causative agent of the most common, non-viral, sexually transmitted disease in women and men worldwide. Since 1959, metronidazole (MTZ) has been the drug of choice in the systemic treatment of trichomoniasis. However, resistance to MTZ in some patients and the great cost associated with the development of new trichomonacidals make necessary the development of computational methods that shorten the drug discovery pipeline. Toward this end, bond-based linear indices, new TOMOCOMD-CARDD molecular descriptors, and linear discriminant analysis were used to discover novel trichomonacidal chemicals. The obtained models, using non-stochastic and stochastic indices, are able to classify correctly 89.01% (87.50%) and 82.42% (84.38%) of the chemicals in the training (test) sets, respectively. These results validate the models for their use in the ligand-based virtual screening. In addition, they show large Matthews' correlation coefficients (C) of 0.78 (0.71) and 0.65 (0.65) for the training (test) sets, correspondingly. The result of predictions on the 10% full-out cross-validation test also evidences the robustness of the obtained models. Later, both models are applied to the virtual screening of 12 compounds already proved against Tv. As a result, they correctly classify 10 out of 12 (83.33%) and 9 out of 12 (75.00%) of the chemicals, respectively; which is the most important criterion for validating the models. Besides, these classification functions are applied to a library of seven chemicals in order to find novel antitrichomonal agents. These compounds are synthesized and tested for in vitro activity against Tv. As a result, experimental observations approached to theoretical predictions, since it was obtained a correct classification of 85.71% (6 out of 7) of the chemicals. Moreover, out of the seven compounds that are screened, synthesized and biologically assayed, six compounds (VA7-34, VA7-35, VA7-37, VA7-38, VA7-68, VA7-70) show pronounced cytocidal activity at the concentration of 100 mug/ml at 24 h (48 h) within the range of 98.66%-100% (99.40%-100%), while only two molecules (chemicals VA7-37 and VA7-38) show high cytocidal activity at the concentration of 10 mug/ml at 24 h (48 h): 98.38% (94.23%) and 97.59% (98.10%), correspondingly. The LDA-assisted QSAR models presented here could significantly reduce the number of synthesized and tested compounds and could increase the chance of finding new chemical entities with anti-trichomonal activity.
Assuntos
Antitricômonas/química , Desenho de Fármacos , Relação Quantitativa Estrutura-Atividade , Algoritmos , Animais , Antitricômonas/farmacologia , Proliferação de Células/efeitos dos fármacos , Sobrevivência Celular/efeitos dos fármacos , Fenômenos Químicos , Físico-Química , Biologia Computacional/métodos , Análise Discriminante , Modelos Lineares , Metronidazol/farmacologia , Estrutura Molecular , Quinoxalinas/química , Quinoxalinas/farmacologia , Software , Validação de Programas de Computador , Processos Estocásticos , Trichomonas vaginalis/efeitos dos fármacosRESUMO
The recently introduced non-stochastic and stochastic bond-based linear indices are been generalized to codify chemical structure information for chiral drugs, making use of a trigonometric 3D-chirality correction factor. These improved modified descriptors are applied to several well-known data sets to validate each one of them. Particularly, Cramer's steroid data set has become a benchmark for the assessment of novel quantitative structure activity relationship methods. This data set has been used by several researchers using 3D-QSAR approaches such as Comparative Molecular Field Analysis, Molecular Quantum Similarity Measures, Comparative Molecular Moment Analysis, E-state, Mapping Property Distributions of Molecular Surfaces, and so on. For that reason, it is selected by us for the sake of comparability. In addition, to evaluate the effectiveness of this novel approach in drug design we model the angiotensin-converting enzyme inhibitory activity of perindoprilate's sigma-stereoisomers combinatorial library, as well as codify information related to a pharmacological property highly dependent on the molecular symmetry of a set of seven pairs of chiral N-alkylated 3-(3-hydroxyphenyl)-piperidines that bind sigma-receptors. The validation of this method is achieved by comparison with earlier publications applied to the same data sets. The non-stochastic and stochastic bond-based 3D-chiral linear indices appear to provide a very interesting alternative to other more common 3D-QSAR descriptors.
Assuntos
Inibidores da Enzima Conversora de Angiotensina/química , Desenho de Fármacos , Indóis/química , Modelos Químicos , Piperidinas/química , Inibidores da Enzima Conversora de Angiotensina/farmacologia , Técnicas de Química Combinatória , Indóis/farmacologia , Piperidinas/farmacologia , Relação Quantitativa Estrutura-Atividade , Receptores sigma/antagonistas & inibidores , Receptores sigma/metabolismo , Estereoisomerismo , Processos Estocásticos , TermodinâmicaRESUMO
The in vitro determination of the permeability through cultured Caco-2 cells is the most often-used in vitro model for drug absorption. In this report, we use the largest data set of measured P(Caco-2), consisting of 157 structurally diverse compounds. Linear discriminant analysis (LDA) was used to obtain quantitative models that discriminate higher absorption compounds from those with moderate-poorer absorption. The best LDA model has an accuracy of 90.58% and 84.21% for training and test set. The percentage of good correlation, in the virtual screening of 241 drugs with the reported values of the percentage of human intestinal absorption (HIA), was greater than 81%. In addition, multiple linear regression models were developed to predict Caco-2 permeability with determination coefficients of 0.71 and 0.72. Our method compares favorably with other approaches implemented in the Dragon software, as well as other methods from the international literature. These results suggest that the proposed method is a good tool for studying the oral absorption of drug candidates.
Assuntos
Desenho de Fármacos , Farmacocinética , Células CACO-2 , Humanos , Modelos Lineares , Permeabilidade , Relação Quantitativa Estrutura-Atividade , Curva ROC , Processos EstocásticosRESUMO
The great cost associated with the development of new anabolic-androgenic steroid (AASs) makes necessary the development of computational methods that shorten the drug discovery pipeline. Toward this end, quantum, and physicochemical molecular descriptors, plus linear discriminant analysis (LDA) were used to analyze the anabolic/androgenic activity of structurally diverse steroids and to discover novel AASs, as well as also to give a structural interpretation of their anabolic-androgenic ratio (AAR). The obtained models are able to correctly classify 91.67% (86.27%) of the AASs in the training (test) sets, respectively. The results of predictions on the 10% full-out cross-validation test also evidence the robustness of the obtained model. Moreover, these classification functions are applied to an "in house" library of chemicals, to find novel AASs. Two new AASs are synthesized and tested for in vivo activity. Although both AASs are less active than some commercially AASs, this result leaves a door open to a virtual variational study of the structure of the two compounds, to improve their biological activity. The LDA-assisted QSAR models presented here, could significantly reduce the number of synthesized and tested AASs, as well as could increase the chance of finding new chemical entities with higher AAR.