Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 73
Filtrar
Más filtros

Banco de datos
Tipo del documento
País de afiliación
Intervalo de año de publicación
1.
Chem Res Toxicol ; 34(2): 507-513, 2021 02 15.
Artículo en Inglés | MEDLINE | ID: mdl-33433197

RESUMEN

Recently developed computational models can estimate plasma, hepatic, and renal concentrations of industrial chemicals in rats. Typically, the input parameter values (i.e., the absorption rate constant, volume of systemic circulation, and hepatic intrinsic clearance) for simplified physiologically based pharmacokinetic (PBPK) model systems are calculated to give the best fit to measured or reported in vivo blood substance concentration values in animals. The purpose of the present study was to estimate in silico these three input pharmacokinetic parameters using a machine learning algorithm applied to a broad range of chemical properties obtained from several cheminformatics software tools. These in silico estimated parameters were then incorporated into PBPK models for predicting internal exposures in rats. Following this approach, simplified PBPK models were set up for 246 drugs, food components, and industrial chemicals with a broad range of chemical structures. We had previously generated PBPK models for 158 of these substances, whereas 88 for which concentration series data were available in the literature were newly modeled. The values for the absorption rate constant, volume of systemic circulation, and hepatic intrinsic clearance could be generated in silico by equations containing between 14 and 26 physicochemical properties. After virtual oral dosing, the output concentration values of the 246 compounds in plasma, liver, and kidney from rat PBPK models using traditionally determined and in silico estimated input parameters were well correlated (r ≥ 0.83). In summary, by using PBPK models consisting of chemical receptor (gut), metabolizing (liver), excreting (kidney), and central (main) compartments with in silico-derived input parameters, the forward dosimetry of new chemicals could provide the plasma/tissue concentrations of drugs and chemicals after oral dosing, thereby facilitating estimates of hematotoxic, hepatotoxic, or nephrotoxic potential as a part of risk assessment.


Asunto(s)
Simulación por Computador , Riñón/metabolismo , Hígado/metabolismo , Modelos Biológicos , Preparaciones Farmacéuticas/metabolismo , Administración Oral , Animales , Riñón/química , Hígado/química , Preparaciones Farmacéuticas/administración & dosificación , Preparaciones Farmacéuticas/química , Ratas
2.
Chem Res Toxicol ; 34(10): 2180-2183, 2021 10 18.
Artículo en Inglés | MEDLINE | ID: mdl-34586804

RESUMEN

Updated algorithms for predicting the volumes of systemic circulation (V1), along with absorption rate constants and hepatic intrinsic clearances, as input parameters for physiologically based pharmacokinetic (PBPK) models were established to improve the accuracy of estimated plasma and tissue concentrations of 323 chemicals after virtual oral administrations in rats. Using ridge regression with an enlarged set of chemical descriptors (up to 99), the estimated input V1 values resulted in an improved correlation coefficient (from 246 compounds) with the traditionally determined values. The PBPK model input parameters for rats of diverse compounds can be precisely estimated by increasing the number of descriptors.


Asunto(s)
Compuestos Orgánicos/farmacocinética , Administración Oral , Animales , Compuestos Orgánicos/administración & dosificación , Ratas , Distribución Tisular
3.
J Chem Inf Model ; 61(7): 3348-3360, 2021 07 26.
Artículo en Inglés | MEDLINE | ID: mdl-34264667

RESUMEN

The aim of scaffold hopping (SH) is to find compounds consisting of different scaffolds from those in already known active compounds, giving an opportunity for unexplored regions of chemical space. We previously demonstrated the usefulness of pharmacophore graphs (PhGs) for this purpose through proof-of-concept virtual screening experiments. PhGs consist of nodes and edges corresponding to pharmacophoric features (PFs) and their topological distances. Although PhGs were effective in SH, they are hard to interpret as they are complete graphs. Herein, we introduce an intuitive representation of a molecule, termed as sparse pharmacophore graphs (SPhG) by keeping the topological distances among PFs as much as possible while reducing the number of edges in the graphs. Several benchmark calculations quantitatively confirmed the sparseness of the graphs and the preservation of topological distances among pharmacophoric points. As proof-of-concept applications, virtual screening (VS) trials for SH were conducted using active and inactive compounds from ChEMBL and PubChem databases for three biological targets: thrombin, tyrosine kinase ABL1, and κ-opioid receptor. The performances of VS were comparable with using fully connected PhGs. Furthermore, highly ranked SPhGs were interpretable for the three biological targets, in particular for thrombin, for which selected SPhGs were in agreement with the structure-based interpretation.


Asunto(s)
Diseño de Fármacos , Receptores de Droga
4.
J Comput Aided Mol Des ; 35(2): 179-193, 2021 02.
Artículo en Inglés | MEDLINE | ID: mdl-33392949

RESUMEN

Quantitative structure-activity relationship (QSAR) and quantitative structure-property relationship (QSPR) models predict biological activity and molecular property based on the numerical relationship between chemical structures and activity (property) values. Molecular representations are of importance in QSAR/QSPR analysis. Topological information of molecular structures is usually utilized (2D representations) for this purpose. However, conformational information seems important because molecules are in the three-dimensional space. As a three-dimensional molecular representation applicable to diverse compounds, similarity between a test molecule and a set of reference molecules has been previously proposed. This 3D representation was found to be effective on virtual screening for early enrichment of active compounds. In this study, we introduced the 3D representation into QSAR/QSPR modeling (regression tasks). Furthermore, we investigated relative merits of 3D representations over 2D in terms of the diversity of training data sets. For the prediction task of quantum mechanics-based properties, the 3D representations were superior to 2D. For predicting activity of small molecules against specific biological targets, no consistent trend was observed in the difference of performance using the two types of representations, irrespective of the diversity of training data sets.


Asunto(s)
Compuestos Orgánicos/química , Bases de Datos Factuales , Evaluación Preclínica de Medicamentos , Aprendizaje Automático , Modelos Moleculares , Conformación Molecular , Relación Estructura-Actividad Cuantitativa , Análisis de Regresión
5.
Molecules ; 26(16)2021 Aug 13.
Artículo en Inglés | MEDLINE | ID: mdl-34443503

RESUMEN

Activity cliffs (ACs) are formed by two structurally similar compounds with a large difference in potency. Accurate AC prediction is expected to help researchers' decisions in the early stages of drug discovery. Previously, predictive models based on matched molecular pair (MMP) cliffs have been proposed. However, the proposed methods face a challenge of interpretability due to the black-box character of the predictive models. In this study, we developed interpretable MMP fingerprints and modified a model-specific interpretation approach for models based on a support vector machine (SVM) and MMP kernel. We compared important features highlighted by this SVM-based interpretation approach and the SHapley Additive exPlanations (SHAP) as a major model-independent approach. The model-specific approach could capture the difference between AC and non-AC, while SHAP assigned high weights to the features not present in the test instances. For specific MMPs, the feature weights mapped by the SVM-based interpretation method were in agreement with the previously confirmed binding knowledge from X-ray co-crystal structures, indicating that this method is able to interpret the AC prediction model in a chemically intuitive manner.

6.
AAPS PharmSciTech ; 22(1): 41, 2021 Jan 08.
Artículo en Inglés | MEDLINE | ID: mdl-33420526

RESUMEN

After the Food and Drug Association in the USA published guidelines on the enhanced use of process analytical technology (PAT) and continuous manufacturing, many studies regarding PAT and continuous manufacturing have been published. This paper describes a case study involving granulation and coating steps with ethenzamide to investigate interference for PAT model construction and model management. We investigated what factors should be considered and addressed when PAT is implemented for continuous manufacturing and how predictive models should be constructed. The product qualities that were monitored were moisture content and particle size in the granulation step and tablet weight and moisture content in the coating step. We have constructed models for the granulation step and validated the predictive capability of the models against an external dataset. A partial least squares (PLS) model with manual wavelength selection had the best predictive accuracy for loss on drying against the external validation set. We found that the prediction of loss on drying was accurate, but the prediction of particle size was not sufficiently accurate. In the coating step, because of the small amount of data, we performed three-fold cross-validation and y-scrambling 10 times, to select the optimal hyper-parameters and to check if the models were fitted to chance correlations. We confirmed that the coating agent weights, tablet weights, and water content could be accurately predicted based on the mean of the R2 score for cross-validation. Addition of other variables, as well as the absorbance, slightly improved the predictive accuracy.


Asunto(s)
Salicilamidas/química , Tecnología Farmacéutica/métodos , Composición de Medicamentos/métodos , Tamaño de la Partícula , Comprimidos
7.
Chem Res Toxicol ; 33(7): 1736-1751, 2020 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-32500706

RESUMEN

Recently developed high-throughput in vitro assays in combination with computational models could provide alternatives to animal testing. The purpose of the present study was to model the plasma, hepatic, and renal pharmacokinetics of approximately 150 structurally varied types of drugs, food components, and industrial chemicals after virtual external oral dosing in rats and to determine the relationship between the simulated internal concentrations in tissue/plasma and their lowest-observed-effect levels. The model parameters were based on rat plasma data from the literature and empirically determined pharmacokinetics measured after oral administrations to rats carried out to evaluate hepatotoxic or nephrotic potentials. To ensure that the analyzed substances exhibited a broad diversity of chemical structures, their structure-based location in the chemical space underwent projection onto a two-dimensional plane, as reported previously, using generative topographic mapping. A high-throughput in silico one-compartment model and a physiologically based pharmacokinetic (PBPK) model consisting of chemical receptor (gut), metabolizing (liver), central (main), and excreting (kidney) compartments were developed in parallel. For 159 disparate chemicals, the maximum plasma concentrations and the areas under the concentration-time curves obtained by one-compartment models and modified simple PBPK models were closely correlated. However, there were differences between the PBPK modeled and empirically obtained hepatic/renal concentrations and plasma maximal concentrations/areas under the concentration-time curves of the 159 chemicals. For a few compounds, the lowest-observed-effect levels were available for hepatotoxicity and nephrotoxicity in the Hazard Evaluation Support System Integrated Platform in Japan. The areas under the renal or hepatic concentration-time curves estimated using PBPK modeling were inversely associated with these lowest-observed-effect levels. Using PBPK forward dosimetry could provide the plasma/tissue concentrations of drugs and chemicals after oral dosing, thereby facilitating estimates of nephrotoxic or hepatotoxic potential as a part of the risk assessment.


Asunto(s)
Riñón/metabolismo , Hígado/metabolismo , Modelos Biológicos , Preparaciones Farmacéuticas/metabolismo , Farmacocinética , Administración Oral , Animales , Simulación por Computador , Preparaciones Farmacéuticas/sangre , Ratas
8.
J Chem Inf Model ; 60(4): 2073-2081, 2020 04 27.
Artículo en Inglés | MEDLINE | ID: mdl-32202780

RESUMEN

The primary goal of ligand-based virtual screening is to identify active compounds consisting of a core scaffold that is not found in the current active compound pool. Scaffold hopping is the term used for this purpose. In the present study, topological representations of pharmacophore features on chemical graphs were investigated for scaffold hopping. Pharmacophore graphs (PhGs), which consist of pharmacophore features as nodes and their topological distances as edges, were used as a representation of important information on compounds being active. We investigated ranking methods for prioritizing PhGs for scaffold hopping. The proposed method, NScaffold, which ranks PhGs based on the number of scaffolds covered by the PhGs, outperforms other conventional methods. As a demonstrative case, using a thrombin inhibitor data set, we interpreted the highest-ranked PhGs by NScaffold from the protein-ligand interaction point of view. It resulted that the NScaffold method successfully retrieved three known important interactions, showing the potential for identifying scaffold-hopped compounds with interpretable PhGs.


Asunto(s)
Receptores de Droga , Ligandos
9.
BMC Bioinformatics ; 20(1): 728, 2019 Dec 23.
Artículo en Inglés | MEDLINE | ID: mdl-31870296

RESUMEN

BACKGROUND: Natural products are the source of various functional materials such as medicines, and understanding their biosynthetic pathways can provide information that is helpful for their effective production through the synthetic biology approach. A number of studies have aimed to predict biosynthetic pathways from their chemical structures in a retrosynthesis manner; however, sometimes the calculation finishes without reaching the starting material from the target molecule. In order to address this problem, the method to find suitable starting materials is required. RESULTS: In this study, we developed a predictive workflow named the Metabolic Disassembler that automatically disassembles the target molecule structure into relevant biosynthetic units (BUs), which are the substructures that correspond to the starting materials in the biosynthesis pathway. This workflow uses a biosynthetic unit library (BUL), which contains starting materials, key intermediates, and their derivatives. We obtained the starting materials from the KEGG PATHWAY database, and 765 BUs were registered in the BUL. We then examined the proposed workflow to optimize the combination of the BUs. To evaluate the performance of the proposed Metabolic Disassembler workflow, we used 943 molecules that are included in the secondary metabolism maps of KEGG PATHWAY. About 95.8% of them (903 molecules) were correctly disassembled by our proposed workflow. For comparison, we also implemented a genetic algorithm-based workflow, and found that the accuracy was only about 52.0%. In addition, for 90.7% of molecules, our workflow finished the calculation within one minute. CONCLUSIONS: The Metabolic Disassembler enabled the effective disassembly of natural products in terms of both correctness and computational time. It also outputs automatically highlighted color-coded substructures corresponding to the BUs to help users understand the calculation results. The users do not have to specify starting molecules in advance, and can input any target molecule, even if it is not in databases. Our workflow will be very useful for understanding and predicting the biosynthesis of natural products.


Asunto(s)
Productos Biológicos/química , Vías Biosintéticas/genética , Biología Sintética/métodos , Humanos
10.
Chem Res Toxicol ; 32(1): 211-218, 2019 01 22.
Artículo en Inglés | MEDLINE | ID: mdl-30511563

RESUMEN

Only a small fraction of chemicals possesses adequate in vivo toxicokinetic data for assessing potential hazards. The aim of the present study was to model the plasma and hepatic pharmacokinetics of more than 50 disparate types of chemicals and drugs after virtual oral administrations in rats. The models were based on reported pharmacokinetics determined after oral administration to rats. An inverse relationship was observed between no-observed-effect levels after oral administration and chemical absorbance rates evaluated for cell permeability ( r = -0.98, p < 0.001, n = 17). For a varied selection of more than 30 chemicals, the plasma concentration curves and the maximum concentrations obtained using a simple one-compartment model (recently recommended as a high-throughput toxicokinetic model) and a simple physiologically based pharmacokinetic (PBPK) model (consisting of chemical receptor, metabolizing, and central compartments) were highly consistent. The hepatic and plasma concentrations and the hepatic and plasma areas under the concentration-time curves of more than 50 chemicals were roughly correlated; however, differences were evident between the PBPK-modeled values in livers and empirically obtained values in plasma. Of the compounds selected for analysis, only seven had the lowest observed effect level (LOEL) values for hepatoxicity listed in the Hazard Evaluation Support System Integrated Platform in Japan. For these seven compounds, the LOEL values and the areas under the hepatic concentration-time curves estimated using PBPK modeling were inversely correlated ( r = -0.78, p < 0.05, n = 7). This study provides important information to help simulate the high hepatic levels of potent hepatotoxic compounds. Using suitable PBPK parameters, the present models could estimate the plasma/hepatic concentrations of chemicals and drugs after oral doses using both PBPK forward and reverse dosimetry, thereby indicating the potential value of this modeling approach in predicting hepatic toxicity as a part of risk assessments of chemicals absorbed in the human body.


Asunto(s)
Compuestos Orgánicos/análisis , Compuestos Orgánicos/farmacocinética , Administración Oral , Animales , Células CACO-2 , Humanos , Hígado/química , Hígado/metabolismo , Permeabilidad , Ratas , Distribución Tisular
11.
J Chem Inf Model ; 59(6): 2626-2641, 2019 06 24.
Artículo en Inglés | MEDLINE | ID: mdl-31058504

RESUMEN

Identification of chemical compounds having desirable properties is a central goal of screening campaigns. Iterative screening is a means of surveying a set of compounds, during which their property values are determined and used as feedback for regression models. Quantitative models that assess the relationships between chemical structures and property/activity are repeatedly updated through this type of cycle, and the efficient sampling of compounds for the subsequent test is a key factor in the early identification of target compounds. Nevertheless, methodological approaches to comparisons and to establishing the degree of extrapolation of sampled compounds, including the effects of applicability domains, are still required. In the present study, we conducted a series of virtual experiments to assess the characteristics of different iterative screening methods. Genetic algorithm-based partial least-squares regression, support vector regression, Bayesian optimization with Gaussian Process (GP), and batch-based Bayesian optimization with GP (GP_batch) were all compared, based on the analysis of one million compounds extracted from the ZINC database. Our results show that, irrespective of the diversity of the initial set of compounds, it was possible to identify a compound having the desired property value using the appropriate screening method. However, overall, the GP_batch method was found to be preferable when evaluating properties either which are difficult to predict or for which a key factor is present in the set of molecular descriptors.


Asunto(s)
Descubrimiento de Drogas/métodos , Preparaciones Farmacéuticas/química , Bibliotecas de Moléculas Pequeñas/química , Teorema de Bayes , Humanos , Análisis de los Mínimos Cuadrados , Distribución Normal , Farmacología , Relación Estructura-Actividad Cuantitativa , Bibliotecas de Moléculas Pequeñas/farmacología , Máquina de Vectores de Soporte
12.
J Chem Inf Model ; 59(6): 2656-2663, 2019 06 24.
Artículo en Inglés | MEDLINE | ID: mdl-31059251

RESUMEN

Molecular fingerprints are indispensable in medicinal chemistry for quantifying chemical structures. Fingerprints can be calculated for substructures with attachment points, which are positions where a substructure and a corresponding core structure connect. Because structures with attachment points can be crucial for understanding structure-activity relationships, fingerprints specialized for representing this structural feature are required. R-group fingerprints and R-group descriptors were proposed previously for this purpose; however, these molecular representations have limitations. Current R-group fingerprints do not emphasize information about attachment points, and R-group descriptors are too sensitive to changes in the topological path length from an attachment point. In the present work, we developed novel R-group fingerprints, termed R-path fingerprints, which contain substituent information from an attachment point without being sensitive to small differences in topological distances. The concept of the R-path fingerprints is to describe a chemical substructure from the viewpoint of an attachment point, to distinguish atomistic information around the attachment point and other parts of the substructure. This was achieved by considering all the paths on the shortest path between the attachment point and each atom in a substituent. Benchmark testing was conducted, including comparisons of similarity distributions and potency prediction for R-group substituents. The results showed that R-path fingerprints should be useful for classifying and comparing substructures with attachment points.


Asunto(s)
Descubrimiento de Drogas/métodos , Bibliotecas de Moléculas Pequeñas/química , Bibliotecas de Moléculas Pequeñas/farmacología , Algoritmos , Diseño de Fármacos , Humanos , Estructura Molecular , Preparaciones Farmacéuticas/química , Relación Estructura-Actividad
13.
J Chem Inf Model ; 59(3): 993-1004, 2019 03 25.
Artículo en Inglés | MEDLINE | ID: mdl-30485091

RESUMEN

Activity landscapes (ALs) integrate structural and potency data of active compounds and provide graphical access to structure-activity relationships (SARs) contained in compound data sets. Three-dimensional (3D) ALs can be conceptualized as a two-dimensional (2D) projection of chemical space with an interpolated activity surface added as a third dimension. Such 3D ALs are particularly intuitive for SAR visualization. In this work, 3D ALs were generated on the basis of different projection methods and fingerprint descriptors, and their topologies were compared. Moreover, going beyond qualitative analysis, the use of 3D ALs for semiquantitative and quantitative potency predictions was investigated. NeuroScale, a neural network variant of multidimensional scaling, combined with Gaussian process regression (GPR) was identified as a preferred approach for generating 3D ALs that accounted for training compounds and their SAR characteristics with high accuracy. On the other hand, GPR-induced overfitting generally limited the accuracy of potency value predictions regardless of the projection method applied. However, 3D ALs enabled reliable mapping of test compounds with varying potency levels to corresponding AL regions. The most accurate mapping was achieved with NeuroScale models. Taken together, the results of our analysis indicate the high potential of 3D ALs for graphical SAR exploration and the identification of potent test compounds.


Asunto(s)
Simulación por Computador , Preparaciones Farmacéuticas/química , Diseño de Fármacos , Ligandos , Estructura Molecular , Distribución Normal , Relación Estructura-Actividad Cuantitativa
14.
J Chem Inf Model ; 59(3): 983-992, 2019 03 25.
Artículo en Inglés | MEDLINE | ID: mdl-30547580

RESUMEN

Support vector regression (SVR) is a premier approach for the prediction of compound potency. Given the conceptual link between support vector machine (SVM) and SVR modeling, SVR is capable of accounting for continuous and discontinuous structure-activity relationships (SARs) in potency prediction, which further extends the classical quantitative SAR (QSAR) paradigm. In the context of virtual compound screening, compound potency prediction can be applied to identify the most potent compounds that are available or enrich database selection sets with potent compounds. To these ends, we have evaluated new potency prediction strategies. Conventional (direct) potency prediction using SVR was compared to two-stage SVM-SVR modeling and potency prediction using SVR models trained in the presence of active and inactive compounds, a previously unconsidered approach. The latter models were found to maximize the recall of potent compounds but were least accurate in predicting high potency values. For this purpose, direct SVR predictions were preferred. However, the best balance between accurate potency predictions and enrichment of potent compounds in database selection sets was achieved by combined SVM-SVR modeling. Taken together, our findings further extend current approaches for compound potency prediction in virtual compound screening.


Asunto(s)
Evaluación Preclínica de Medicamentos/métodos , Máquina de Vectores de Soporte , Relación Estructura-Actividad Cuantitativa , Análisis de Regresión
15.
J Comput Aided Mol Des ; 33(8): 729-743, 2019 08.
Artículo en Inglés | MEDLINE | ID: mdl-31435894

RESUMEN

In this work, computational compound screening strategies on the basis of two- and three-dimensional (2D and 3D) molecular representations were investigated including similarity searching and support vector machine (SVM) ranking. Calculations based on topological fingerprints and molecular shape queries and features were compared. A unique aspect of the analysis setting apart from previous comparisons of 2D and 3D virtual screening approaches has been the design of compound reference, training, and test data sets with controlled incremental increases in intra-set structural diversity and different categories of structural relationships between reference/training and test sets. The use of these data sets made it possible to assess the relative performance of 2D and 3D screening strategies under increasingly challenging conditions ultimately leading to the use of training and test sets with essentially unrelated structures. The results showed that 3D similarity searching had little advantage over 2D searching in identifying active compounds with remote structural relationships. However, 3D SVM models trained on the basis of shape features were superior to other approaches (including 2D SVM) when the detection of structure-activity relationships became increasingly challenging. Such 3D SVM methods has thus far only been little investigated in virtual screening, proving a wealth of opportunities for further analyses.


Asunto(s)
Química Computacional/métodos , Relación Estructura-Actividad , Máquina de Vectores de Soporte , Interfaz Usuario-Computador , Aprendizaje Automático , Conformación Molecular , Unión Proteica/genética
16.
J Fluoresc ; 28(2): 695-706, 2018 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-29680928

RESUMEN

The Quantitative Structure - Property Relationship (QSPR) approach was performed to study the fluorescence absorption wavelengths and emission wavelengths of 413 fluorescent dyes in different solvent conditions. The dyes included the chromophore derivatives of cyanine, xanthene, coumarin, pyrene, naphthalene, anthracene and etc., with the wavelength ranging from 250 nm to 800 nm. An ensemble method, random forest (RF), was employed to construct nonlinear prediction models compared with the results of linear partial least squares and nonlinear support vector machine regression models. Quantum chemical descriptors derived from density functional theory method and solvent information were also used by constructing models. The best prediction results were obtained from RF model, with the squared correlation coefficients [Formula: see text] of 0.940 and 0.905 for λabs and λem, respectively. The descriptors used in the models were discussed in detail in this report by comparing the feature importance of RF.

18.
AAPS PharmSciTech ; 18(3): 595-604, 2017 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-27170163

RESUMEN

This article proposes a novel concentration prediction model that requires little training data and is useful for rapid process understanding. Process analytical technology is currently popular, especially in the pharmaceutical industry, for enhancement of process understanding and process control. A calibration-free method, iterative optimization technology (IOT), was proposed to predict pure component concentrations, because calibration methods such as partial least squares, require a large number of training samples, leading to high costs. However, IOT cannot be applied to concentration prediction in non-ideal mixtures because its basic equation is derived from the Beer-Lambert law, which cannot be applied to non-ideal mixtures. We proposed a novel method that realizes prediction of pure component concentrations in mixtures from a small number of training samples, assuming that spectral changes arising from molecular interactions can be expressed as a function of concentration. The proposed method is named IOT with virtual molecular interaction spectra (IOT-VIS) because the method takes spectral change as a virtual spectrum x nonlin,i into account. It was confirmed through the two case studies that the predictive accuracy of IOT-VIS was the highest among existing IOT methods.


Asunto(s)
Preparaciones Farmacéuticas/química , Calibración , Industria Farmacéutica/métodos , Análisis de los Mínimos Cuadrados
19.
J Chem Inf Model ; 56(2): 286-99, 2016 Feb 22.
Artículo en Inglés | MEDLINE | ID: mdl-26818135

RESUMEN

Retrieving descriptor information (x information) from a value of an objective variable (y) is a fundamental problem in inverse quantitative structure-property relationship (inverse-QSPR) analysis but challenging because of the complexity of the preimage function. Herewith, we propose using a cluster-wise multiple linear regression (cMLR) model as a QSPR model for inverse-QSPR analysis. x information is acquired as a probability density function by combining cMLR and the prior distribution modeled with a mixture of Gaussians (GMMs). Three case studies were conducted to demonstrate various aspects of the potential of cMLR. It was found that the predictive power of cMLR was superior to that of MLR, especially for data with nonlinearity. Moreover, it turned out that the applicability domain could be considered since the posterior distribution inherits the prior distribution's feature (i.e., training data feature) and represents the possibility of having the desired property. Finally, a series of inverse analyses with the GMMs/cMLR was demonstrated with the aim to generate de novo structures having specific aqueous solubility.


Asunto(s)
Relación Estructura-Actividad Cuantitativa , Modelos Químicos , Estructura Molecular
20.
J Chem Inf Model ; 56(10): 1885-1893, 2016 10 24.
Artículo en Inglés | MEDLINE | ID: mdl-27632418

RESUMEN

To discover drug compounds in chemical space containing an enormous number of compounds, a structure generator is required to produce virtual drug-like chemical structures. The de novo design algorithm for exploring chemical space (DAECS) visualizes the activity distribution on a two-dimensional plane corresponding to chemical space and generates structures in a target area on a plane selected by the user. In this study, we modify the DAECS to enable the user to select a target area to consider properties other than activity and improve the diversity of the generated structures by visualizing the drug-likeness distribution and the activity distribution, generating structures by substructure-based structural changes, including addition, deletion, and substitution of substructures, as well as the slight structural changes used in the DAECS. Through case studies using ligand data for the human adrenergic alpha2A receptor and the human histamine H1 receptor, the modified DAECS can generate high diversity drug-like structures, and the usefulness of the modification of the DAECS is verified.


Asunto(s)
Algoritmos , Diseño de Fármacos , Bibliotecas de Moléculas Pequeñas/química , Bibliotecas de Moléculas Pequeñas/farmacología , Humanos , Ligandos , Simulación del Acoplamiento Molecular , Receptores Adrenérgicos alfa 2/química , Receptores Adrenérgicos alfa 2/metabolismo , Receptores Histamínicos H1/química , Receptores Histamínicos H1/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA