RESUMO
Owing to its high natural abundance compared to the commonly used transition (precious) metals, as well as its high Lewis acidity and ability to change oxidation state, aluminium has recently been explored as the basis for a range of single-site catalysts. This paper aims to establish the ground rules for the development of a new type of cationic alkene oligomerisation catalyst containing two Al(III) ions, with the potential to act co-operatively in stereoselective assembly. Five new dimers of the type [R2Al(2-py')]2 (R=Me, iBu; py'=substituted pyridyl group) with different substituents on the Al atoms and pyridyl rings have been synthesised. The formation of the undesired cis isomers can be suppressed by the presence of substituents on the 6-position of the pyridyl ring due to steric congestion, with DFT calculations showing that the selection of the trans isomer is thermodynamically controlled. Calculations show that demethylation of the dimers [Me2Al(2-py')]2 with Ph3C+ to the cations [{MeAl(2-py')}2(µ-Me)]+ is highly favourable and that the desired trans disposition of the 2-pyridyl ring units is influenced by steric effects. Preliminary experimental studies confirm that demethylation of [Me2Al(6-MeO-2-py)]2 can be achieved using [Ph3C][B(C6F5)4].
RESUMO
C-H borylation is a high-value transformation in the synthesis of lead candidates for the pharmaceutical industry because a wide array of downstream coupling reactions is available. However, predicting its regioselectivity, especially in drug-like molecules that may contain multiple heterocycles, is not a trivial task. Using a data set of borylation reactions from Reaxys, we explored how a language model originally trained on USPTO_500_MT, a broad-scope set of patent data, can be used to predict the C-H borylation reaction product in different modes: product generation and site reactivity classification. Our fine-tuned T5Chem multitask language model can generate the correct product in 79% of cases. It can also classify the reactive aromatic C-H bonds with 95% accuracy and 88% positive predictive value, exceeding purpose-developed graph-based neural networks.
Assuntos
Hidrogênio , Hidrogênio/química , Modelos Químicos , Redes Neurais de ComputaçãoRESUMO
In the pursuit of improved compound identification and database search tasks, this study explores heteronuclear single quantum coherence (HSQC) spectra simulation and matching methodologies. HSQC spectra serve as unique molecular fingerprints, enabling a valuable balance of data collection time and information richness. We conducted a comprehensive evaluation of the following four HSQC simulation techniques: ACD/Labs (ACD), MestReNova (MNova), Gaussian NMR calculations (DFT), and a graph-based neural network (ML). For the latter two techniques, we developed a reconstruction logic to combine proton and carbon 1D spectra into HSQC spectra. The methodology involved the implementation of three peak-matching strategies (minimum-sum, Euclidean-distance, and Hungarian distance) combined with three padding strategies (zero-padding, peak-truncated, and nearest-neighbor double assignment). We found that coupling these strategies with a robust simulation technique facilitates the accurate identification of correct molecules from similar analogues (regio- and stereoisomers) and allows for fast and accurate large database searches. Furthermore, we demonstrated the efficacy of the best-performing methodology by rectifying the structures of a set of previously misidentified molecules. This research indicates that effective HSQC spectral simulation and matching methodologies significantly facilitate molecular structure elucidation. Furthermore, we offer a Google Colab notebook for researchers to use our methods on their own data (https://github.com/AstraZeneca/hsqc_structure_elucidation.git).
Assuntos
Simulação por Computador , Redes Neurais de ComputaçãoRESUMO
CONFPASS (Conformer Prioritizations and Analysis for DFT re-optimizations) has been developed to extract dihedral angle descriptors from conformational searching outputs, perform clustering, and return a priority list for density functional theory (DFT) re-optimizations. Evaluations were conducted with DFT data of the conformers for 150 structurally diverse molecules, most of which are flexible. CONFPASS gives a confidence estimate that the global minimum structure has been found, and based on our dataset, we can have 90% confidence after optimizing half of the FF structures. Re-optimizing conformers in order of the FF energy often generates duplicate results; using CONFPASS, the duplication rate is reduced by a factor of 2 for the first 30% of the re-optimizations, which include the global minimum structure about 80% of the time.
Assuntos
Conformação Molecular , TermodinâmicaRESUMO
Machine Learning (ML) is increasingly applied to fill data gaps in assessments to quantify impacts associated with chemical emissions and chemicals in products. However, the systematic application of ML-based approaches to fill chemical data gaps is still limited, and their potential for addressing a wide range of chemicals is unknown. We prioritized chemical-related parameters for chemical toxicity characterization to inform ML model development based on two criteria: (1) each parameter's relevance to robustly characterize chemical toxicity described by the uncertainty in characterization results attributable to each parameter and (2) the potential for ML-based approaches to predict parameter values for a wide range of chemicals described by the availability of chemicals with measured parameter data. We prioritized 13 out of 38 parameters for developing ML-based approaches, while flagging another nine with critical data gaps. For all prioritized parameters, we performed a chemical space analysis to assess further the potential for ML-based approaches to predict data for diverse chemicals considering the structural diversity of available measured data, showing that ML-based approaches can potentially predict 8-46% of marketed chemicals based on 1-10% with available measured data. Our results can systematically inform future ML model development efforts to address data gaps in chemical toxicity characterization.
Assuntos
Aprendizado de Máquina , Humanos , Medição de RiscoRESUMO
Computational reaction prediction has become a ubiquitous task in chemistry due to the potential value accurate predictions can bring to chemists. Boronic acids are widely used in industry; however, understanding how to avoid the protodeboronation side reaction remains a challenge. We have developed an algorithm for in silico prediction of the rate of protodeboronation of boronic acids. A general mechanistic model devised through kinetic studies of protodeboronation was found in the literature and forms the foundation on which the algorithm presented in this work is built. Protodeboronation proceeds through 7 distinct pathways, though for any particular boronic acid, only a subset of mechanistic pathways are active. The rate of each active mechanistic pathway is linearly correlated with its characteristic energy difference, which in turn can be determined using Density Functional Theory. We validated the algorithm using leave-one-out cross-validation on a data set of 50 boronic acids and made a further 50 rate predictions on academically and industrially important boronic acids out of sample. We believe this work will provide great assistance to chemists performing reactions that feature boronic acids, such as Suzuki-Miyaura and Chan-Evans-Lam couplings.
RESUMO
The epigenetic modification 5-methylcytosine plays a vital role in development, cell specific gene expression and disease states. The selective chemical modification of the 5-methylcytosine methyl group is challenging. Currently, no such chemistry exists. Direct functionalisation of 5-methylcytosine would improve the detection and study of this epigenetic feature. We report a xanthone-photosensitised process that introduces a 4-pyridine modification at a C(sp3 )-H bond in the methyl group of 5-methylcytosine. We propose a reaction mechanism for this type of reaction based on density functional calculations and apply transition state analysis to rationalise differences in observed reaction efficiencies between cyanopyridine derivatives. The reaction is initiated by single electron oxidation of 5-methylcytosine followed by deprotonation to generate the methyl group radical. Cross coupling of the methyl radical with 4-cyanopyridine installs a 4-pyridine label at 5-methylcytosine. We demonstrate use of the pyridination reaction to enrich 5-methylcytosine-containing ribonucleic acid.
Assuntos
5-Metilcitosina , Elétrons , 5-Metilcitosina/química , Oxirredução , Catálise , Epigênese GenéticaRESUMO
In recent times, machine learning has become increasingly prominent in predictive toxicology as it has shifted from in vivo studies toward in silico studies. Currently, in vitro methods together with other computational methods such as quantitative structure-activity relationship modeling and absorption, distribution, metabolism, and excretion calculations are being used. An overview of machine learning and its applications in predictive toxicology is presented here, including support vector machines (SVMs), random forest (RF) and decision trees (DTs), neural networks, regression models, naïve Bayes, k-nearest neighbors, and ensemble learning. The recent successes of these machine learning methods in predictive toxicology are summarized, and a comparison of some models used in predictive toxicology is presented. In predictive toxicology, SVMs, RF, and DTs are the dominant machine learning methods due to the characteristics of the data available. Lastly, this review describes the current challenges facing the use of machine learning in predictive toxicology and offers insights into the possible areas of improvement in the field.
Assuntos
Aprendizado de Máquina , Testes de Toxicidade , Humanos , Modelos Moleculares , Relação Quantitativa Estrutura-AtividadeRESUMO
In recent years, a growing number of organic reactions in the literature have shown selectivity controlled by reaction dynamics rather than by transition state theory. Such reactions are difficult to analyse because the transition state theory approach often does not capture the subtlety of the energy landscapes the compounds traverse and, therefore, cannot accurately predict the selectivity. We present an algorithm that can predict the major product and selectivity for a wide range of potential energy surfaces where the product distribution is influenced by reaction dynamics. The method requires as input calculation of the transition states, the intermediate (if present) and the product geometries. The algorithm is quick and simple to run and, except for two reactions with long alkyl chains, calculates selectivity more accurately than transition state theory alone.
RESUMO
N-Triflylphosphoramides (NTPA), have become increasingly popular catalysts in the development of enantioselective transformations as they are stronger Brønsted acids than the corresponding phosphoric acids (PA). Their highly acidic, asymmetric active site can activate difficult, unreactive substrates. In this review, we present an account of asymmetric transformations using this type of catalyst that have been reported in the past ten years and we classify these reactions using the enantio-determining step as the key criterion. This compendium of NTPA-catalysed reactions is organised into the following categories: (1) cycloadditions, (2) electrocyclisations, polyene and related cyclisations, (3) addition reactions to imines, (4) electrophilic aromatic substitutions, (5) addition reactions to carbocations, (6) aldol and related reactions, (7) addition reactions to double bonds, and (8) rearrangements and desymmetrisations. We highlight the use of NTPA in total synthesis and suggest mnemonics which account for their enantioselectivity.
RESUMO
BACKGROUND: Evidence suggests that health care data sharing may strengthen care coordination, improve quality and safety, and reduce costs. However, to achieve efficient and meaningful adoption of health care data-sharing initiatives, it is necessary to engage all stakeholders, from health care professionals to patients. Although previous work has assessed health care professionals' perceptions of data sharing, perspectives of the general public and particularly of seldom heard groups have yet to be fully assessed. OBJECTIVE: This study aims to explore the views of the public, particularly their hopes and concerns, around health care data sharing. METHODS: An original, immersive public engagement interactive experience was developed-The Can of Worms installation-in which participants were prompted to reflect about data sharing through listening to individual stories around health care data sharing. A multidisciplinary team with expertise in research, public involvement, and human-centered design developed this concept. The installation took place in three separate events between November 2018 and November 2019. A combination of convenience and snowball sampling was used in this study. Participants were asked to fill self-administered feedback cards and to describe their hopes and fears about the meaningful use of data in health care. The transcripts were compiled verbatim and systematically reviewed by four independent reviewers using the thematic analysis method to identify emerging themes. RESULTS: Our approach exemplifies the potential of using interdisciplinary expertise in research, public involvement, and human-centered design to tell stories, collect perspectives, and spark conversations around complex topics in participatory digital medicine. A total of 352 qualitative feedback cards were collected, each reflecting participants' hopes and fears for health care data sharing. Thematic analyses identified six themes under hopes: enablement of personal access and ownership, increased interoperability and collaboration, generation of evidence for better and safer care, improved timeliness and efficiency, delivery of more personalized care, and equality. The five main fears identified included inadequate security and exploitation, data inaccuracy, distrust, discrimination and inequality, and less patient-centered care. CONCLUSIONS: This study sheds new light on the main hopes and fears of the public regarding health care data sharing. Importantly, our results highlight novel concerns from the public, particularly in terms of the impact on health disparities, both at international and local levels, and on delivering patient-centered care. Incorporating the knowledge generated and focusing on co-designing solutions to tackle these concerns is critical to engage the public as active contributors and to fully leverage the potential of health care data use.
Assuntos
Medo/psicologia , Disseminação de Informação/métodos , Participação do Paciente/métodos , Assistência Centrada no Paciente/métodos , Adulto , Análise de Dados , Feminino , Humanos , Masculino , Pesquisa QualitativaRESUMO
A large number of organic reactions feature post-transition-state bifurcations. Selectivities in such reactions are difficult to analyze because they cannot be determined by comparing the energies of competing transition states. Molecular dynamics approaches can provide answers but are computationally very expensive. We present an algorithm that predicts the major products in bifurcating organic reactions with negligible computational cost. The method requires two transition states, two product geometries, and no additional information. The algorithm correctly predicts the major product for about 90% of the organic reactions investigated. For the remaining 10% of the reactions, the algorithm returns a warning indication that the conclusion may be uncertain. The method also reproduces the experimental and the molecular dynamics product ratios within 15% for more than 80% of the reactions. We have successfully applied the method to a trifurcating organic reaction, a carbocation rearrangement, and solvent-dependent Pummerer-like reactions, demonstrating the power of the algorithm to simplify and to help understand highly complex reactions.
RESUMO
The Minisci reaction is one of the most valuable methods for directly functionalizing basic heteroarenes to form carbon-carbon bonds. Use of prochiral, heteroatom-substituted radicals results in stereocenters being formed adjacent to the heteroaromatic system, generating motifs which are valuable in medicinal chemistry and chiral ligand design. Recently a highly enantioselective and regioselective protocol for the Minisci reaction was developed, using chiral phosphoric acid catalysis. However, the precise mechanism by which this process operated and the origin of selectivity remained unclear, making it challenging to develop the reaction more generally. Herein we report further experimental mechanistic studies which feed into detailed DFT calculations that probe the precise nature of the stereochemistry-determining step. Computational and experimental evidence together support Curtin-Hammett control in this reaction, with initial radical addition being quick and reversible, and enantioselectivity being achieved in the subsequent slower, irreversible deprotonation. A detailed survey via DFT calculations assessed a number of different possibilities for selectivity-determining deprotonation of the radical cation intermediate. Computations point to a clear preference for an initially unexpected mode of internal deprotonation enacted by the amide group, which is a crucial structural feature of the radical precursor, with the assistance of the associated chiral phosphate. This unconventional stereodetermining step underpins the high enantioselectivities and regioselectivities observed. The mechanistic model was further validated by applying it to a test set of substrates possessing varied structural features.
RESUMO
The aim of human toxicity risk assessment is to determine a safe dose or exposure to a chemical for humans. This requires an understanding of the exposure of a person to a chemical and how much of the chemical is required to cause an adverse effect. To do this computationally, we need to understand how much of a chemical is required to perturb normal biological function in an adverse outcome pathway (AOP). The molecular initiating event (MIE) is the first step in an adverse outcome pathway and can be considered as a chemical interaction between a chemical toxicant and a biological molecule. Key chemical characteristics can be identified and used to model the chemistry of these MIEs. In this study, we do just this by using chemical substructures to categorize chemicals and 3D quantitative structure-activity relationships (QSARs) based on comparative molecular field analysis (CoMFA) to calculate molecular activity. Models have been constructed across a variety of human biological targets, the glucocorticoid receptor, mu opioid receptor, cyclooxygenase-2 enzyme, human ether-à-go-go related gene channel, and dopamine transporter. These models tend to provide molecular activity estimation well within one log unit and electronic and steric fields that can be visualized to better understand the MIE and biological target of interest. The outputs of these fields can be used to identify key aspects of a chemical's chemistry which can be changed to reduce its ability to activate a given MIE. With this methodology, the quantitative chemical activity can be predicted for a wide variety of MIEs, which can feed into AOP-based chemical risk assessments, and understanding of the chemistry behind the MIE can be gained.
Assuntos
Compostos Orgânicos/análise , Relação Quantitativa Estrutura-Atividade , Bases de Dados de Compostos Químicos , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Humanos , Conformação Molecular , Medição de RiscoRESUMO
Having a measure of confidence in computational predictions of biological activity from in silico tools is vital when making predictions for new chemicals, for example, in chemical risk assessment. Where predictions of biological activity are used as an indicator of a potential hazard, false-negative predictions are the most concerning prediction; however, assigning confidence in inactive predictions is particularly challenging. How can one confidently identify the absence of activating features? In this study, we present methods for assigning confidence to both active and inactive predictions from structural alerts for protein-binding molecular initiating events (MIEs). Structural alerts were derived through an iterative statistical method. Confidence in the activity predictions is assigned by measuring the Tanimoto similarity between Morgan fingerprints of chemicals in the test set to relevant chemicals in the training set, and suitable cutoff values have been defined to give different confidence categories. To avoid a potential compound series bias in the test set and hence overestimate the performance of the method, we measured the biological activity of 27 compounds with 24 proteins, which gave us an additional 648 experimental measurements; many of the measurements are currently nonexistent in the literature and databases. This data set was complemented with newly measured biological activities published in ChEMBL25 and formed a combined independent validation data set. Applying the confidence categories to the computational predictions for the new data leads to the identification of chemicals for which one should be confident of either an inactive or active prediction, allowing model predictions to be used responsibly.
Assuntos
Compostos Orgânicos/química , Proteínas/química , Bases de Dados Factuais , Estrutura MolecularRESUMO
A molecular initiating event (MIE) is the gateway to an adverse outcome pathway (AOP), a sequence of events ending in an adverse effect. In silico predictions of MIEs are a vital tool in a modern, mechanism-focused approach to chemical risk assessment. For 90 biological targets representing important human MIEs, structural alert-based models have been constructed with an automated procedure that uses Bayesian statistics to iteratively select substructures. These models give impressive average performance statistics (an average of 92% correct predictions across targets), significantly improving on previous models. Random Forest models have been constructed from physicochemical features for the same targets, giving similarly impressive performance statistics (93% correct predictions). A key difference between the models is interpretation of predictions-the structural alert models are transparent and easy to interpret, while Random Forest models can only identify the most important physicochemical features for making predictions. The two complementary models have been combined in a consensus model, improving performance compared to each individual model (94% correct predictions) and increasing confidence in predictions. Variation in model performance has been explained by calculating a modelability index (MODI), using Tanimoto coefficient between Morgan fingerprints to identify nearest neighbor chemicals. This work is an important step toward building confidence in the use of in silico tools for assessment of toxicity.
Assuntos
Rotas de Resultados Adversos , Algoritmos , Simulação por Computador , Teorema de Bayes , Humanos , Estrutura Molecular , Relação Estrutura-AtividadeRESUMO
Molecular initiating events (MIEs) are key events in adverse outcome pathways that link molecular chemistry to target biology. As they are based on chemistry, these interactions are excellent targets for computational chemistry approaches to in silico modeling. In this work, we aim to link ligand chemical structures to MIEs for androgen receptor (AR) and glucocorticoid receptor (GR) binding using ToxCast data. This has been done using an automated computational algorithm to perform maximal common substructure searches on chemical binders for each target from the ToxCast dataset. The models developed show a high level of accuracy, correctly assigning 87.20% of AR binders and 96.81% of GR binders in a 25% test set using holdout cross-validation. The 2D structural alerts developed can be used as in silico models to predict these MIEs and as guidance for in vitro ToxCast assays to confirm hits. These models can target such experimental work, reducing the number of assays to be performed to gain required toxicological insight. Development of these models has also allowed some structural alerts to be identified as predictors for agonist or antagonist behavior at the receptor target. This work represents a first step in using computational methods to guide and target experimental approaches.
Assuntos
Androgênios , Receptores Androgênicos , Receptores de Glucocorticoides , Algoritmos , Simulação por Computador , Ligação Proteica , Testes de ToxicidadeRESUMO
AIMS: To test the reliability and validity of intravaginal pressure measurements acquired during pelvic floor muscle (PFM) tasks in different body positions using the FemFit®, a new intravaginal pressure device. METHODS: Twenty healthy adult women participated in this study. Two assessment sessions were conducted. Intravaginal pressure measurements using the FemFit® were repeated during PFM contraction and straining maneuvers while lying and standing. Maximal intravaginal pressures were collated and compared within and between sessions. They were also correlated to maximal force measurements obtained by dynamometry and vaginal digital palpation. Test-retest reliability was assessed using intraclass correlation coefficient, standard error of measurement and Bland-Altman plots. The validity of the pressure measurements was assessed using Pearson's correlation (dynamometry) and Spearman's rho (palpation). RESULTS: This test-retest study indicate excellent reliability for PFM contraction and straining maneuver both in lying and standing, within and between sessions. For the straining maneuver while standing, increased variability was suggested by a wider limit of agreement on Bland-Altman plots (spanning 31.3 to 43.3mm Hg). A significant moderate to strong correlation was found when comparing measurements of PFM contraction using the FemFit® and the dynamometer or the palpation (Pearson's coefficient = 0.72, P = .006; Spearman's rho = 0.68, P = .005, respectively). CONCLUSION: Our research findings suggest that intravaginal pressures can be reliably measured during PFM contraction and straining manoeuver while lying and standing, using the FemFit® device, both within and between sessions. A moderate to strong correlation between the FemFit® pressure and the force measurements obtained by dynamometry or palpation reinforce the validity of measurements.
Assuntos
Contração Muscular/fisiologia , Diafragma da Pelve/fisiologia , Vagina/fisiologia , Adulto , Idoso , Feminino , Humanos , Pessoa de Meia-Idade , Palpação , Pressão , Estudos Prospectivos , Reprodutibilidade dos Testes , Adulto JovemRESUMO
Many factors affect vaccine efficacy. One of the most salient is the frequency and intervals of vaccine administration. In this study, we assessed the vaccine administration modality for a recently reported polyanhydride-based vaccine formulation, shown to generate antitumor activity. Polyanhydride particles encapsulating ovalbumin (OVA) were prepared using a double-emulsion technique and subcutaneously delivered to mice either as a single-dose or as prime-boost vaccine regimens in which two different time intervals between prime and boost were assessed (7 or 21 days). This was followed by measurement of cellular and humoral immune responses, and subsequent challenge of the mice with a lethal dose of E.G7-OVA cells to evaluate tumor protection. Interestingly, a single dose of the polyanhydride particle-based formulation induced sustained OVA-specific cellular immune responses just as effectively as the prime-boost regimens. In addition, mice receiving single-dose vaccine had similar levels of protection against tumor challenge compared with mice administered prime-boosts. In contrast, measurements of OVA-specific IgG antibody titers indicated that a booster dose was required to stimulate strong humoral immune responses, since it was observed that mice administered a prime-boost vaccine had significantly higher OVA-specific IgG1 serum titers than mice administered a single dose. These findings indicate that the requirement for a booster dose using these particles appears unnecessary for the generation of effective cellular immunity.