Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 4 de 4
Filtrar
Mais filtros

Base de dados
Ano de publicação
Tipo de documento
Intervalo de ano de publicação
1.
J Am Chem Soc ; 2024 Sep 25.
Artigo em Inglês | MEDLINE | ID: mdl-39322561

RESUMO

The function of a protein is predicated upon its three-dimensional fold. Representing its complex structure as a series of repeating secondary structural elements is one of the most useful ways by which we study, characterize, and visualize a protein. Consequently, experimental methods that quantify the secondary structure content allow us to connect a protein's structure to its function. Here, we introduce an automated gradient descent-based method we refer to as secondary-structure distribution by NMR that allows for rapid quantification of the protein secondary structure composition of a protein from a single, 1D 13C NMR spectrum without chemical shift assignments. The analysis of nearly 900 proteins with known structure and chemical shifts demonstrates the capabilities of our approach. We show that these results rival alternative techniques such as FT-IR and circular dichroism that are commonly used to estimate secondary structure compositions. The resulting method requires only the primary sequence of the protein and its referenced 13C NMR spectrum. Each residue is modeled in an ensemble of secondary structures with percentage contributions from random coil, α-helix, and ß-sheet secondary structures obtained by minimizing the difference between a simulated and experimental 1D 13C NMR spectrum. The capabilities of the method are demonstrated as applied to samples at natural abundance or enriched in 13C, acquired by either solution or solid-state NMR, and even on low magnetic field benchtop NMR spectrometers. This approach allows for rapid characterization of protein secondary structure across traditionally challenging to characterize states including liquid-liquid phase-separated, membrane-bound, or aggregated states.

2.
Nat Commun ; 15(1): 7818, 2024 Sep 06.
Artigo em Inglês | MEDLINE | ID: mdl-39251606

RESUMO

Retrosynthesis, the strategy of devising laboratory pathways by working backwards from the target compound, is crucial yet challenging. Enhancing retrosynthetic efficiency requires overcoming the vast complexity of chemical space, the limited known interconversions between molecules, and the challenges posed by limited experimental datasets. This study introduces generative machine learning methods for retrosynthetic planning. The approach features three innovations: generating reaction templates instead of reactants or synthons to create novel chemical transformations, allowing user selection of specific bonds to change for human-influenced synthesis, and employing a conditional kernel-elastic autoencoder (CKAE) to measure the similarity between generated and known reactions for chemical viability insights. These features form a coherent retrosynthetic framework, validated experimentally by designing a 3-step synthetic pathway for a challenging small molecule, demonstrating a significant improvement over previous 5-9 step approaches. This work highlights the utility and robustness of generative machine learning in addressing complex challenges in chemical synthesis.

3.
PNAS Nexus ; 3(4): pgae168, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38689710

RESUMO

We introduce the kernel-elastic autoencoder (KAE), a self-supervised generative model based on the transformer architecture with enhanced performance for molecular design. KAE employs two innovative loss functions: modified maximum mean discrepancy (m-MMD) and weighted reconstruction (LWCEL). The m-MMD loss has significantly improved the generative performance of KAE when compared to using the traditional Kullback-Leibler loss of VAE, or standard maximum mean discrepancy. Including the weighted reconstruction loss LWCEL, KAE achieves valid generation and accurate reconstruction at the same time, allowing for generative behavior that is intermediate between VAE and autoencoder not available in existing generative approaches. Further advancements in KAE include its integration with conditional generation, setting a new state-of-the-art benchmark in constrained optimizations. Moreover, KAE has demonstrated its capability to generate molecules with favorable binding affinities in docking applications, as evidenced by AutoDock Vina and Glide scores, outperforming all existing candidates from the training dataset. Beyond molecular design, KAE holds promise to solve problems by generation across a broad spectrum of applications.

4.
ACS Cent Sci ; 9(9): 1768-1774, 2023 Sep 27.
Artigo em Inglês | MEDLINE | ID: mdl-37780365

RESUMO

Density functional theory (DFT) is a powerful tool to model transition state (TS) energies to predict selectivity in chemical synthesis. However, a successful multistep synthesis campaign must navigate energetically narrow differences in pathways that create some limits to rapid and unambiguous application of DFT to these problems. While powerful data science techniques may provide a complementary approach to overcome this problem, doing so with the relatively small data sets that are widespread in organic synthesis presents a significant challenge. Herein, we show that a small data set can be labeled with features from DFT TS calculations to train a feed-forward neural network for predicting enantioselectivity of a Negishi cross-coupling reaction with P-chiral hindered phosphines. This approach to modeling enantioselectivity is compared with conventional approaches, including exclusive use of DFT energies and data science approaches, using features from ligands or ground states with neural network architectures.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA