Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Bioinformatics ; 37(21): 3856-3864, 2021 11 05.
Artigo em Inglês | MEDLINE | ID: mdl-34213526

RESUMO

MOTIVATION: Clinical trials are the essential stage of every drug development program for the treatment to become available to patients. Despite the importance of well-structured clinical trial databases and their tremendous value for drug discovery and development such instances are very rare. Presently large-scale information on clinical trials is stored in clinical trial registers which are relatively structured, but the mappings to external databases of drugs and diseases are increasingly lacking. The precise production of such links would enable us to interrogate richer harmonized datasets for invaluable insights. RESULTS: We present a neural approach for medical concept normalization of diseases and drugs. Our two-stage approach is based on Bidirectional Encoder Representations from Transformers (BERT). In the training stage, we optimize the relative similarity of mentions and concept names from a terminology via triplet loss. In the inference stage, we obtain the closest concept name representation in a common embedding space to a given mention representation. We performed a set of experiments on a dataset of abstracts and a real-world dataset of trial records with interventions and conditions mapped to drug and disease terminologies. The latter includes mentions associated with one or more concepts (in-KB) or zero (out-of-KB, nil prediction). Experiments show that our approach significantly outperforms baseline and state-of-the-art architectures. Moreover, we demonstrate that our approach is effective in knowledge transfer from the scientific literature to clinical trial data. AVAILABILITY AND IMPLEMENTATION: We make code and data freely available at https://github.com/insilicomedicine/DILBERT.


Assuntos
Desenvolvimento de Medicamentos , Publicações , Humanos , Descoberta de Drogas
2.
Phys Chem Chem Phys ; 24(42): 25853-25863, 2022 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-36279016

RESUMO

Electronic wave function calculation is a fundamental task of computational quantum chemistry. Knowledge of the wave function parameters allows one to compute physical and chemical properties of molecules and materials. Unfortunately, it is infeasible to compute the wave functions analytically even for simple molecules. Classical quantum chemistry approaches such as the Hartree-Fock method or density functional theory (DFT) allow to compute an approximation of the wave function but are very computationally expensive. One way to lower the computational complexity is to use machine learning models that can provide sufficiently good approximations at a much lower computational cost. In this work we: (1) introduce a new curated large-scale dataset of electron structures of drug-like molecules, (2) establish a novel benchmark for the estimation of molecular properties in the multi-molecule setting, and (3) evaluate a wide range of methods with this benchmark. We show that the accuracy of recently developed machine learning models deteriorates significantly when switching from the single-molecule to the multi-molecule setting. We also show that these models lack generalization over different chemistry classes. In addition, we provide experimental evidence that larger datasets lead to better ML models in the field of quantum chemistry.

3.
Mol Pharm ; 15(10): 4378-4385, 2018 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-29473756

RESUMO

Convolutional neural networks (CNN) have been successfully used to handle three-dimensional data and are a natural match for data with spatial structure such as 3D molecular structures. However, a direct 3D representation of a molecule with atoms localized at voxels is too sparse, which leads to poor performance of the CNNs. In this work, we present a novel approach where atoms are extended to fill other nearby voxels with a transformation based on the wave transform. Experimenting on 4.5 million molecules from the Zinc database, we show that our proposed representation leads to better performance of CNN-based autoencoders than either the voxel-based representation or the previously used Gaussian blur of atoms and then successfully apply the new representation to classification tasks such as MACCS fingerprint prediction.


Assuntos
Redes Neurais de Computação , Animais , Bases de Dados Factuais , Humanos
4.
Mol Pharm ; 15(10): 4398-4405, 2018 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-30180591

RESUMO

Modern computational approaches and machine learning techniques accelerate the invention of new drugs. Generative models can discover novel molecular structures within hours, while conventional drug discovery pipelines require months of work. In this article, we propose a new generative architecture, entangled conditional adversarial autoencoder, that generates molecular structures based on various properties, such as activity against a specific protein, solubility, or ease of synthesis. We apply the proposed model to generate a novel inhibitor of Janus kinase 3, implicated in rheumatoid arthritis, psoriasis, and vitiligo. The discovered molecule was tested in vitro and showed good activity and selectivity.


Assuntos
Descoberta de Drogas/métodos , Aprendizado de Máquina , Animais , Humanos , Janus Quinase 3/metabolismo , Redes Neurais de Computação
5.
Mol Pharm ; 14(9): 3098-3104, 2017 09 05.
Artigo em Inglês | MEDLINE | ID: mdl-28703000

RESUMO

Deep generative adversarial networks (GANs) are the emerging technology in drug discovery and biomarker development. In our recent work, we demonstrated a proof-of-concept of implementing deep generative adversarial autoencoder (AAE) to identify new molecular fingerprints with predefined anticancer properties. Another popular generative model is the variational autoencoder (VAE), which is based on deep neural architectures. In this work, we developed an advanced AAE model for molecular feature extraction problems, and demonstrated its advantages compared to VAE in terms of (a) adjustability in generating molecular fingerprints; (b) capacity of processing very large molecular data sets; and (c) efficiency in unsupervised pretraining for regression model. Our results suggest that the proposed AAE model significantly enhances the capacity and efficiency of development of the new molecules with specific anticancer properties using the deep generative models.


Assuntos
Modelos Teóricos , Inteligência Artificial , Simulação por Computador , Formação de Conceito , Aprendizagem , Redes Neurais de Computação
6.
Sci Rep ; 14(1): 20612, 2024 Sep 04.
Artigo em Inglês | MEDLINE | ID: mdl-39232054

RESUMO

FeRh-based alloys have attracted significant attention due to their magnetic phase transition and significant magnetocaloric effects. These properties position them as promising candidates for fundamental research and practical applications, including magnetic cooling and targeted drug delivery. The study of FeRh alloys, particularly those where Rhodium or Iron atoms are substituted with other transition metals, is crucial as certain substitutions preserve the alloy's magnetocaloric properties. However, even within a specific structural type and without considering competing phases, determining which atom (Fe or Rh) is replaced upon introducing a third element remains unclear. This paper addresses this ambiguity through ab initio calculations. We propose an approach to predict whether a dopant will replace Fe or Rh, offering insights into the electronic and structural factors influencing the substitution. Additionally, we present a dataset of ab initio calculations on doped FeRh alloys, which will support future data-driven modeling efforts. Our findings not only advance the understanding of FeRh-based alloys but also contribute to the design of novel materials for experimental and industrial applications.

7.
Front Pharmacol ; 11: 269, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32362822

RESUMO

Gene expression profiles are useful for assessing the efficacy and side effects of drugs. In this paper, we propose a new generative model that infers drug molecules that could induce a desired change in gene expression. Our model-the Bidirectional Adversarial Autoencoder-explicitly separates cellular processes captured in gene expression changes into two feature sets: those related and unrelated to the drug incubation. The model uses related features to produce a drug hypothesis. We have validated our model on the LINCS L1000 dataset by generating molecular structures in the SMILES format for the desired transcriptional response. In the experiments, we have shown that the proposed model can generate novel molecular structures that could induce a given gene expression change or predict a gene expression difference after incubation of a given molecular structure. The code of the model is available at https://github.com/insilicomedicine/BiAAE.

9.
Front Pharmacol ; 11: 565644, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-33390943

RESUMO

Generative models are becoming a tool of choice for exploring the molecular space. These models learn on a large training dataset and produce novel molecular structures with similar properties. Generated structures can be utilized for virtual screening or training semi-supervized predictive models in the downstream tasks. While there are plenty of generative models, it is unclear how to compare and rank them. In this work, we introduce a benchmarking platform called Molecular Sets (MOSES) to standardize training and comparison of molecular generative models. MOSES provides training and testing datasets, and a set of metrics to evaluate the quality and diversity of generated structures. We have implemented and compared several molecular generation models and suggest to use our results as reference points for further advancements in generative chemistry research. The platform and source code are available at https://github.com/molecularsets/moses.

10.
Oncotarget ; 8(7): 10883-10890, 2017 Feb 14.
Artigo em Inglês | MEDLINE | ID: mdl-28029644

RESUMO

Recent advances in deep learning and specifically in generative adversarial networks have demonstrated surprising results in generating new images and videos upon request even using natural language as input. In this paper we present the first application of generative adversarial autoencoders (AAE) for generating novel molecular fingerprints with a defined set of parameters. We developed a 7-layer AAE architecture with the latent middle layer serving as a discriminator. As an input and output the AAE uses a vector of binary fingerprints and concentration of the molecule. In the latent layer we also introduced a neuron responsible for growth inhibition percentage, which when negative indicates the reduction in the number of tumor cells after the treatment. To train the AAE we used the NCI-60 cell line assay data for 6252 compounds profiled on MCF-7 cell line. The output of the AAE was used to screen 72 million compounds in PubChem and select candidate molecules with potential anti-cancer properties. This approach is a proof of concept of an artificially-intelligent drug discovery engine, where AAEs are used to generate new molecular fingerprints with the desired molecular properties.


Assuntos
Ensaios de Seleção de Medicamentos Antitumorais/métodos , Ensaios de Triagem em Larga Escala/métodos , Aprendizado de Máquina , Redes Neurais de Computação , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêutico , Linhagem Celular Tumoral , Tratamento Farmacológico/métodos , Humanos , Células K562 , Células MCF-7 , Reprodutibilidade dos Testes
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA