Search | VHL Regional Portal

Protein Condensate Atlas from predictive models of heteromolecular condensate composition.

Saar, Kadi L; Scrutton, Rob M; Bloznelyte, Kotryna; Morgunov, Alexey S; Good, Lydia L; Lee, Alpha A; Teichmann, Sarah A; Knowles, Tuomas P J.

Nat Commun ; 15(1): 5418, 2024 Jul 10.

Article in English | MEDLINE | ID: mdl-38987300

ABSTRACT

Biomolecular condensates help cells organise their content in space and time. Cells harbour a variety of condensate types with diverse composition and many are likely yet to be discovered. Here, we develop a methodology to predict the composition of biomolecular condensates. We first analyse available proteomics data of cellular condensates and find that the biophysical features that determine protein localisation into condensates differ from known drivers of homotypic phase separation processes, with charge mediated protein-RNA and hydrophobicity mediated protein-protein interactions playing a key role in the former process. We then develop a machine learning model that links protein sequence to its propensity to localise into heteromolecular condensates. We apply the model across the proteome and find many of the top-ranked targets outside the original training data to localise into condensates as confirmed by orthogonal immunohistochemical staining imaging. Finally, we segment the condensation-prone proteome into condensate types based on an overlap with biomolecular interaction profiles to generate a Protein Condensate Atlas. Several condensate clusters within the Atlas closely match the composition of experimentally characterised condensates or regions within them, suggesting that the Atlas can be valuable for identifying additional components within known condensate systems and discovering previously uncharacterised condensates.

Subject(s)

Biomolecular Condensates , Machine Learning , Proteome , Proteomics , Humans , Proteomics/methods , Biomolecular Condensates/metabolism , Biomolecular Condensates/chemistry , Proteome/metabolism , Hydrophobic and Hydrophilic Interactions

Deconvoluting low yield from weak potency in direct-to-biology workflows with machine learning.

McCorkindale, William; Filep, Mihajlo; London, Nir; Lee, Alpha A; King-Smith, Emma.

RSC Med Chem ; 15(3): 1015-1021, 2024 Mar 20.

Article in English | MEDLINE | ID: mdl-38516605

ABSTRACT

High throughput and rapid biological evaluation of small molecules is an essential factor in drug discovery and development. Direct-to-biology (D2B), whereby compound purification is foregone, has emerged as a viable technique in time efficient screening, specifically for PROTAC design and biological evaluation. However, one notable limitation is the prerequisite of high yielding reactions to ensure the desired compound is indeed the compound responsible for biological activity. Herein, we report a machine learning based yield-assay deconfounder capable of deconvoluting low yield from low potency to identify false negatives. We validated this approach by identifying promising SARS-CoV-2 main protease inhibitors with nanomolar activity that rivaled potency observed from the standard D2B workflow. Furthermore, we show how our framework can be utilized in a broad, in silico screen to produce compounds of similar potency as a D2B assay.

Probing the chemical 'reactome' with high-throughput experimentation data.

King-Smith, Emma; Berritt, Simon; Bernier, Louise; Hou, Xinjun; Klug-McLeod, Jacquelyn L; Mustakis, Jason; Sach, Neal W; Tucker, Joseph W; Yang, Qingyi; Howard, Roger M; Lee, Alpha A.

Nat Chem ; 16(4): 633-643, 2024 Apr.

Article in English | MEDLINE | ID: mdl-38168924

ABSTRACT

High-throughput experimentation (HTE) has the potential to improve our understanding of organic chemistry by systematically interrogating reactivity across diverse chemical spaces. Notable bottlenecks include few publicly available large-scale datasets and the need for facile interpretation of these data's hidden chemical insights. Here we report the development of a high-throughput experimentation analyser, a robust and statistically rigorous framework, which is applicable to any HTE dataset regardless of size, scope or target reaction outcome, which yields interpretable correlations between starting material(s), reagents and outcomes. We improve the HTE data landscape with the disclosure of 39,000+ previously proprietary HTE reactions that cover a breadth of chemistry, including cross-coupling reactions and chiral salt resolutions. The high-throughput experimentation analyser was validated on cross-coupling and hydrogenation datasets, showcasing the elucidation of statistically significant hidden relationships between reaction components and outcomes, as well as highlighting areas of dataset bias and the specific reaction spaces that necessitate further investigation.

Predictive Minisci late stage functionalization with transfer learning.

King-Smith, Emma; Faber, Felix A; Reilly, Usa; Sinitskiy, Anton V; Yang, Qingyi; Liu, Bo; Hyek, Dennis; Lee, Alpha A.

Nat Commun ; 15(1): 426, 2024 Jan 15.

Article in English | MEDLINE | ID: mdl-38225239

ABSTRACT

Structural diversification of lead molecules is a key component of drug discovery to explore chemical space. Late-stage functionalizations (LSFs) are versatile methodologies capable of installing functional handles on richly decorated intermediates to deliver numerous diverse products in a single reaction. Predicting the regioselectivity of LSF is still an open challenge in the field. Numerous efforts from chemoinformatics and machine learning (ML) groups have made strides in this area. However, it is arduous to isolate and characterize the multitude of LSF products generated, limiting available data and hindering pure ML approaches. We report the development of an approach that combines a message passing neural network and 13C NMR-based transfer learning to predict the atom-wise probabilities of functionalization for Minisci and P450-based functionalizations. We validated our model both retrospectively and with a series of prospective experiments, showing that it accurately predicts the outcomes of Minisci-type and P450 transformations and outperforms the well-established Fukui-based reactivity indices and other machine learning reactivity-based algorithms.

Subject(s)

Drug Discovery , Neural Networks, Computer , Prospective Studies , Retrospective Studies , Drug Discovery/methods , Machine Learning

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL