Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 26
Filtrar
1.
Brief Bioinform ; 25(1)2023 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-38233090

RESUMO

Immunologic recognition of peptide antigens bound to class I major histocompatibility complex (MHC) molecules is essential to both novel immunotherapeutic development and human health at large. Current methods for predicting antigen peptide immunogenicity rely primarily on simple sequence representations, which allow for some understanding of immunogenic features but provide inadequate consideration of the full scale of molecular mechanisms tied to peptide recognition. We here characterize contributions that unsupervised and supervised artificial intelligence (AI) methods can make toward understanding and predicting MHC(HLA-A2)-peptide complex immunogenicity when applied to large ensembles of molecular dynamics simulations. We first show that an unsupervised AI method allows us to identify subtle features that drive immunogenicity differences between a cancer neoantigen and its wild-type peptide counterpart. Next, we demonstrate that a supervised AI method for class I MHC(HLA-A2)-peptide complex classification significantly outperforms a sequence model on small datasets corrected for trivial sequence correlations. Furthermore, we show that both unsupervised and supervised approaches reveal determinants of immunogenicity based on time-dependent molecular fluctuations and anchor position dynamics outside the MHC binding groove. We discuss implications of these structural and dynamic immunogenicity correlates for the induction of T cell responses and therapeutic T cell receptor design.


Assuntos
Antígeno HLA-A2 , Simulação de Dinâmica Molecular , Humanos , Antígeno HLA-A2/metabolismo , Inteligência Artificial , Peptídeos/química , Antígenos de Histocompatibilidade Classe I/metabolismo , Ligação Proteica
2.
J Chem Inf Model ; 62(4): 801-816, 2022 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-35130440

RESUMO

The application of deep learning to generative molecule design has shown early promise for accelerating lead series development. However, questions remain concerning how factors like training, data set, and seed bias impact the technology's utility to medicinal and computational chemists. In this work, we analyze the impact of seed and training bias on the output of an activity-conditioned graph-based variational autoencoder (VAE). Leveraging a massive, labeled data set corresponding to the dopamine D2 receptor, our graph-based generative model is shown to excel in producing desired conditioned activities and favorable unconditioned physical properties in generated molecules. We implement an activity-swapping method that allows for the activation, deactivation, or retention of activity of molecular seeds, and we apply independent deep learning classifiers to verify the generative results. Overall, we uncover relationships between noise, molecular seeds, and training set selection across a range of latent-space sampling procedures, providing important insights for practical AI-driven molecule generation.


Assuntos
Inteligência Artificial , Modelos Moleculares , Receptores de Dopamina D2 , Receptores de Dopamina D2/química
3.
J Chem Inf Model ; 62(2): 240-257, 2022 01 24.
Artigo em Inglês | MEDLINE | ID: mdl-34905358

RESUMO

Recent advances in deep learning have enabled the development of large-scale multimodal models for virtual screening and de novo molecular design. The human kinome with its abundant sequence and inhibitor data presents an attractive opportunity to develop proteochemometric models that exploit the size and internal diversity of this family of targets. Here, we challenge a standard practice in sequence-based affinity prediction models: instead of leveraging the full primary structure of proteins, each target is represented by a sequence of 29 discontiguous residues defining the ATP binding site. In kinase-ligand binding affinity prediction, our results show that the reduced active site sequence representation is not only computationally more efficient but consistently yields significantly higher performance than the full primary structure. This trend persists across different models, data sets, and performance metrics and holds true when predicting pIC50 for both unseen ligands and kinases. Our interpretability analysis reveals a potential explanation for the superiority of the active site models: whereas only mild statistical effects about the extraction of three-dimensional (3D) interaction sites take place in the full sequence models, the active site models are equipped with an implicit but strong inductive bias about the 3D structure stemming from the discontiguity of the active sites. Moreover, in direct comparisons, our models perform similarly or better than previous state-of-the-art approaches in affinity prediction. We then investigate a de novo molecular design task and find that the active site provides benefits in the computational efficiency, but otherwise, both kinase representations yield similar optimized affinities (for both SMILES- and SELFIES-based molecular generators). Our work challenges the assumption that the full primary structure is indispensable for modeling human kinases.


Assuntos
Proteínas , Sítios de Ligação , Domínio Catalítico , Humanos , Ligantes , Ligação Proteica , Proteínas/metabolismo
4.
J Chem Inf Model ; 62(18): 4295-4299, 2022 09 26.
Artigo em Inglês | MEDLINE | ID: mdl-36098536

RESUMO

Recent work showed that active site rather than full-protein-sequence information improves predictive performance in kinase-ligand binding affinity prediction. To refine the notion of an "active site", we here propose and compare multiple definitions. We report significant evidence that our novel definition is superior to previous definitions and better models of ATP-noncompetitive inhibitors. Moreover, we leverage the discontiguity of the active site sequence to motivate novel protein-sequence augmentation strategies and find that combining them further improves performance.


Assuntos
Trifosfato de Adenosina , Trifosfato de Adenosina/metabolismo , Sequência de Aminoácidos , Sítios de Ligação , Ligantes , Ligação Proteica
5.
J Comput Aided Mol Des ; 36(5): 391-404, 2022 05.
Artigo em Inglês | MEDLINE | ID: mdl-34817762

RESUMO

We here present a streamlined, explainable graph convolutional neural network (gCNN) architecture for small molecule activity prediction. We first conduct a hyperparameter optimization across nearly 800 protein targets that produces a simplified gCNN QSAR architecture, and we observe that such a model can yield performance improvements over both standard gCNN and RF methods on difficult-to-classify test sets. Additionally, we discuss how reductions in convolutional layer dimensions potentially speak to the "anatomical" needs of gCNNs with respect to radial coarse graining of molecular substructure. We augment this simplified architecture with saliency map technology that highlights molecular substructures relevant to activity, and we perform saliency analysis on nearly 100 data-rich protein targets. We show that resultant substructural clusters are useful visualization tools for understanding substructure-activity relationships. We go on to highlight connections between our models' saliency predictions and observations made in the medicinal chemistry literature, focusing on four case studies of past lead finding and lead optimization campaigns.


Assuntos
Redes Neurais de Computação , Proteínas
6.
J Chem Inf Model ; 60(9): 4170-4179, 2020 09 28.
Artigo em Inglês | MEDLINE | ID: mdl-32077698

RESUMO

We present a simple, modular graph-based convolutional neural network that takes structural information from protein-ligand complexes as input to generate models for activity and binding mode prediction. Complex structures are generated by a standard docking procedure and fed into a dual-graph architecture that includes separate subnetworks for the ligand bonded topology and the ligand-protein contact map. Recent work has indicated that data set bias drives many past promising results derived from combining deep learning and docking. Our dual-graph network allows contributions from ligand identity that give rise to such biases to be distinguished from effects of protein-ligand interactions on classification. We show that our neural network is capable of learning from protein structural information when, as in the case of binding mode prediction, an unbiased data set is constructed. We next develop a deep learning model for binding mode prediction that uses docking ranking as input in combination with docking structures. This strategy mirrors past consensus models and outperforms a baseline docking program (AutoDock Vina) in a variety of tests, including on cross-docking data sets that mimic real-world docking use cases. Furthermore, the magnitudes of network predictions serve as reliable measures of model confidence.


Assuntos
Aprendizado Profundo , Ligantes , Simulação de Acoplamento Molecular , Ligação Proteica , Proteínas/metabolismo
7.
Regul Toxicol Pharmacol ; 82: 94-98, 2016 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-27769827

RESUMO

Chronic (>3 months) preclinical toxicology studies are conducted to support the safe conduct of clinical trials exceeding 3 months in duration. We have conducted a review of 32 chronic toxicology studies in non-rodents (22 studies in dogs and 10 in non-human primates) and 27 chronic toxicology studies in rats dosed with Merck compounds to determine the frequency at which additional target organ toxicities are observed in chronic toxicology studies as compared to subchronic studies of 3 months in duration. Our review shows that majority of the findings are observed in the subchronic studies since additional target organs were not observed in 24 chronic non rodent studies and in 21 chronic rodent studies. However, 6 studies in non rodents and 6 studies in rodents yielded new findings that were not seen in studies of 3-month or shorter duration. For 3 compounds the new safety findings did contribute to termination of clinical development plans. Although the incidence of compound termination associated with chronic toxicology study observations is low (∼10%), the observations made in these studies can be important for evaluating human safety risk.


Assuntos
Testes de Toxicidade Crônica/métodos , Testes de Toxicidade Subcrônica/métodos , Animais , Bases de Dados Factuais , Relação Dose-Resposta a Droga , Indústria Farmacêutica , Humanos , Modelos Animais , Reprodutibilidade dos Testes , Medição de Risco , Especificidade da Espécie , Fatores de Tempo
8.
ChemMedChem ; 19(4): e202300202, 2024 02 16.
Artigo em Inglês | MEDLINE | ID: mdl-37574458

RESUMO

Molecular fragmentation has been frequently used for machine learning, molecular modeling, and drug discovery studies. However, the current molecular fragmentation tools often lead to large fragments that are useful to limited tasks. Specifically, long aliphatic chains, certain connected ring structures, fused rings, as well as various nitrogen-containing molecular entities often remain intact when using BRICS. With no known methods to solve this issue, we find that the fragments taken from BRICS are inflexible for tasks such as fragment-based machine learning, coarse-graining, and ligand-protein interaction assessment. In this work, a revised BRICS (r-BRICS) module is developed to allow more flexible fragmentation on a wider variety of molecules. It is shown that r-BRICS generates smaller fragments than BRICS, allowing localized fragment assessments. Furthermore, r-BRICS generates a fragment database with significantly more unique small fragments than BRICS, which is potentially useful for fragment-based drug discovery.


Assuntos
Carbono , Descoberta de Drogas , Modelos Moleculares , Ligantes
9.
Pediatr Clin North Am ; 71(2): 315-326, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38423723

RESUMO

When pediatricians, autistic people, and their families carefully consider and plan for the transition from pediatric care to adult care, there are better outcomes for patients. Pediatricians see their patients over time and are uniquely positioned to help prepare for the changes that come with the transition through adolescents to adulthood. Although programs such as Got Transition offer some guidance on how to navigate the transition from pediatric care to adult care, there is less information on how to help those on the autism spectrum and their families transition to adulthood in other ways.


Assuntos
Transtorno Autístico , Adulto , Criança , Humanos , Adolescente , Transtorno Autístico/terapia , Cuidadores , Pediatras
10.
Cell Rep Med ; 3(12): 100794, 2022 12 20.
Artigo em Inglês | MEDLINE | ID: mdl-36306797

RESUMO

Recent advances and accomplishments of artificial intelligence (AI) and deep generative models have established their usefulness in medicinal applications, especially in drug discovery and development. To correctly apply AI, the developer and user face questions such as which protocols to consider, which factors to scrutinize, and how the deep generative models can integrate the relevant disciplines. This review summarizes classical and newly developed AI approaches, providing an updated and accessible guide to the broad computational drug discovery and development community. We introduce deep generative models from different standpoints and describe the theoretical frameworks for representing chemical and biological structures and their applications. We discuss the data and technical challenges and highlight future directions of multimodal deep generative models for accelerating drug discovery.


Assuntos
Inteligência Artificial , Descoberta de Drogas , Descoberta de Drogas/métodos
11.
Nat Rev Chem ; 6(4): 287-295, 2022 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-35783295

RESUMO

One aspirational goal of computational chemistry is to predict potent and drug-like binders for any protein, such that only those that bind are synthesized. In this Roadmap, we describe the launch of Critical Assessment of Computational Hit-finding Experiments (CACHE), a public benchmarking project to compare and improve small molecule hit-finding algorithms through cycles of prediction and experimental testing. Participants will predict small molecule binders for new and biologically relevant protein targets representing different prediction scenarios. Predicted compounds will be tested rigorously in an experimental hub, and all predicted binders as well as all experimental screening data, including the chemical structures of experimentally tested compounds, will be made publicly available, and not subject to any intellectual property restrictions. The ability of a range of computational approaches to find novel binders will be evaluated, compared, and openly published. CACHE will launch 3 new benchmarking exercises every year. The outcomes will be better prediction methods, new small molecule binders for target proteins of importance for fundamental biology or drug discovery, and a major technological step towards achieving the goal of Target 2035, a global initiative to identify pharmacological probes for all human proteins.

12.
J Chem Inf Model ; 50(11): 2029-40, 2010 Nov 22.
Artigo em Inglês | MEDLINE | ID: mdl-20977231

RESUMO

One approach to estimating the "chemical tractability" of a candidate protein target where we know the atomic resolution structure is to examine the physical properties of potential binding sites. A number of other workers have addressed this issue. We characterize ~290,000 "pockets" from ~42,000 protein crystal structures in terms of a three parameter "pocket space": volume, buriedness, and hydrophobicity. A metric DLID (drug-like density) measures how likely a pocket is to bind a drug-like molecule. This is calculated from the count of other pockets in its local neighborhood in pocket space that contain drug-like cocrystallized ligands and the count of total pockets in the neighborhood. Surprisingly, despite being defined locally, a global trend in DLID can be predicted by a simple linear regression on log(volume), buriedness, and hydrophobicity. Two levels of simplification are necessary to relate the DLID of individual pockets to "targets": taking the best DLID per Protein Data Bank (PDB) entry (because any given crystal structure can have many pockets), and taking the median DLID over all PDB entries for the same target (because different crystal structures of the same protein can vary because of artifacts and real conformational changes). We can show that median DLIDs for targets that are detectably homologous in sequence are reasonably similar and that median DLIDs correlate with the "druggability" estimate of Cheng et al. (Nature Biotechnology 2007, 25, 71-75).


Assuntos
Bases de Dados de Proteínas , Descoberta de Drogas/métodos , Preparações Farmacêuticas/química , Preparações Farmacêuticas/metabolismo , Proteínas/química , Proteínas/metabolismo , Animais , Bovinos , Humanos , Interações Hidrofóbicas e Hidrofílicas , Ligantes , Camundongos , Modelos Moleculares , Ligação Proteica , Conformação Proteica
13.
Front Chem ; 8: 624163, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33614597

RESUMO

Coronavirus disease 2019 (COVID-19) is an ongoing global pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), with very limited treatments so far. Demonstrated with good druggability, two major proteases of SARS-CoV-2, namely main protease (Mpro) and papain-like protease (PLpro) that are essential for viral maturation, have become the targets for many newly designed inhibitors. Unlike Mpro that has been heavily investigated, PLpro is not well-studied so far. Here, we carried out the in silico high-throughput screening of all FDA-approved drugs via the flexible docking simulation for potential inhibitors of PLpro and explored the molecular mechanism of binding between a known inhibitor rac5c and PLpro. Our results, from molecular dynamics simulation, show that the chances of drug repurposing for PLpro might be low. On the other hand, our long (about 450 ns) MD simulation confirms that rac5c can be bound stably inside the substrate-binding site of PLpro and unveils the molecular mechanism of binding for the rac5c-PLpro complex. The latter may help perform further structural optimization and design potent leads for inhibiting PLpro.

14.
Drug Discov Today ; 21(3): 473-80, 2016 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-26854423

RESUMO

Comparative effectiveness research (CER) provides evidence for the relative effectiveness and risks of different treatment options and informs decisions made by healthcare providers, payers, and pharmaceutical companies. CER data come from retrospective analyses as well as prospective clinical trials. Here, we describe the development of a text-mining pipeline based on natural language processing (NLP) that extracts key information from three different trial data sources: NIH ClinicalTrials.gov, WHO International Clinical Trials Registry Platform (ICTRP), and Citeline Trialtrove. The pipeline leverages tailored terminologies to produce an integrated and structured output, capturing any trials in which pharmaceutical products of interest are compared with another therapy. The timely information alerts generated by this system provide the earliest and most complete picture of emerging clinical research.


Assuntos
Pesquisa Comparativa da Efetividade , Mineração de Dados , Ensaios Clínicos como Assunto , Bases de Dados Factuais , Humanos , Processamento de Linguagem Natural , Sistema de Registros
15.
Drug Discov Today ; 21(5): 826-35, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-26979546

RESUMO

External content sources such as MEDLINE(®), National Institutes of Health (NIH) grants and conference websites provide access to the latest breaking biomedical information, which can inform pharmaceutical and biotechnology company pipeline decisions. The value of the sites for industry, however, is limited by the use of the public internet, the limited synonyms, the rarity of batch searching capability and the disconnected nature of the sites. Fortunately, many sites now offer their content for download and we have developed an automated internal workflow that uses text mining and tailored ontologies for programmatic search and knowledge extraction. We believe such an efficient and secure approach provides a competitive advantage to companies needing access to the latest information for a range of use cases and complements manually curated commercial sources.


Assuntos
Mineração de Dados , Descoberta de Drogas , Processamento de Linguagem Natural , Sistemas de Informação
16.
J Med Chem ; 45(4): 753-7, 2002 Feb 14.
Artigo em Inglês | MEDLINE | ID: mdl-11831887

RESUMO

Inhibitors of histone deacetylase (HDAC) have been shown to induce terminal differentiation of human tumor cell lines and to have antitumor effects in vivo. We have prepared analogues of suberoylanilide hydroxamic acid (SAHA) and trichostatin A and have evaluated them in a human HDAC enzyme inhibition assay, a p21(waf1) (p21) promoter assay, and in monolayer growth inhibition assays. One compound, 4-(dimethylamino)-N-[7-(hydroxyamino)-7-oxoheptyl]-benzamide, was found to affect the growth of a panel of eight human tumor cell lines differentially.


Assuntos
Antineoplásicos/síntese química , Benzamidas/síntese química , Inibidores Enzimáticos/síntese química , Inibidores de Histona Desacetilases , Ácidos Hidroxâmicos/síntese química , Hidroxilaminas/síntese química , Antineoplásicos/química , Antineoplásicos/farmacologia , Benzamidas/química , Benzamidas/farmacologia , Inibidor de Quinase Dependente de Ciclina p21 , Ciclinas/metabolismo , Ensaios de Seleção de Medicamentos Antitumorais , Inibidores Enzimáticos/química , Inibidores Enzimáticos/farmacologia , Humanos , Ácidos Hidroxâmicos/química , Ácidos Hidroxâmicos/farmacologia , Hidroxilaminas/química , Hidroxilaminas/farmacologia , Modelos Moleculares , Regiões Promotoras Genéticas , Relação Estrutura-Atividade , Células Tumorais Cultivadas
17.
J Mol Graph Model ; 38: 360-2, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23085175

RESUMO

Teach-Discover-Treat (TDT) is an initiative to promote the development and sharing of computational tools solicited through a competition with the aim to impact education and collaborative drug discovery for neglected diseases. Collaboration, multidisciplinary integration, and innovation are essential for successful drug discovery. This requires a workforce that is trained in state-of-the-art workflows and equipped with the ability to collaborate on platforms that are accessible and free. The TDT competition solicits high quality computational workflows for neglected disease targets, using freely available, open access tools.


Assuntos
Descoberta de Drogas/métodos , Comunicação Interdisciplinar , Simulação de Dinâmica Molecular , Software , Comportamento Cooperativo , Descoberta de Drogas/organização & administração , Humanos , Doenças Negligenciadas/tratamento farmacológico , Recursos Humanos
18.
Mol Inform ; 31(3-4): 231-45, 2012 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-27477094

RESUMO

A QSAR model for predicting passive permeability (Papp ) was derived from Papp values measured in the LLC-PK1 cell line. The QSAR method and descriptor set that performed best in terms of cross-validation was random forest with a combination of AP, DP, and MOE_2D descriptors. The QSAR model was used to predict the Caco-2 cell permeability for 313 compounds described in the literature with good success. We find that passive permeability for different cell lines can be predicted with similar molecular properties and descriptors. It is shown that the variation in experimental measurements of Papp is smaller than the error in QSAR predictions indicating that predictions are not quantitatively perfect, although qualitatively useful. We get better predictions if the training set is large and diverse, rather than smaller and more internally consistent. This is because prediction accuracy falls off quickly with decreasing similarity to the training set and it is therefore better to have as large a training set as possible. While single physical parameters are not as good as a full QSAR model in predicting Papp , logD seems the most important parameter. Intermediate values of logD are associated with higher Papp .

19.
Curr Top Med Chem ; 9(9): 844-53, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19754398

RESUMO

Steroid nuclear hormone binding receptors (SHRs) are ligand activated transcription factors involved in the regulation of target genes associated with key physiological and developmental processes. As such they are important targets for drug discovery. Crystal structures are now available for all members of the SHR family, however, earlier studies carried out using homology models proved to be quite valuable for understanding the binding of natural ligands and for designing novel therapeutic agents. The maleability of the binding pocket means that the crystal structure of an SHR in complex with one ligand may not suffice to explain the binding interactions of that same target with a different ligand. Consequently, induced fit docking and molecular dynamics are shown to be useful and necessary tools for understanding these receptors.


Assuntos
Descoberta de Drogas , Receptores Citoplasmáticos e Nucleares/química , Simulação por Computador , Ligantes , Simulação de Dinâmica Molecular , Estrutura Molecular , Ligação Proteica , Receptores Citoplasmáticos e Nucleares/efeitos dos fármacos , Relação Estrutura-Atividade
20.
J Chem Inf Model ; 49(8): 1974-85, 2009 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-19639957

RESUMO

We propose a direct QSAR methodology to predict how similar the inhibitor-binding profiles of two protein kinases are likely to be, based on the properties of the residues surrounding the ATP-binding site. We produce a random forest model for each of five data sets (one in-house, four from the literature) where multiple compounds are tested on many kinases. Each model is self-consistent by cross-validation, and all models point to only a few residues in the active site controlling the binding profiles. While all models include the "gatekeeper" as one of the important residues, consistent with previous literature, some models suggest other residues as being more important. We apply each model to predict the similarity in binding profile to all pairs in a set of 411 kinases from the human genome and get very different predictions from each model. This turns out not to be an issue with model-building but with the fact that the experimental data sets disagree about which kinases are similar to which others. It is possible to build a model combining all the data from the five data sets that is reasonably self-consistent but not surprisingly, given the disagreement between data sets, less self-consistent than the individual models.


Assuntos
Inibidores de Proteínas Quinases/metabolismo , Proteínas Quinases/metabolismo , Relação Quantitativa Estrutura-Atividade , Sítios de Ligação , Humanos , Modelos Moleculares , Ligação Proteica , Inibidores de Proteínas Quinases/química , Proteínas Quinases/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA