ABSTRACT
SARS-CoV-2 has intricate mechanisms for initiating infection, immune evasion/suppression, and replication, which depend on the structure and dynamics of its constituent proteins. Many protein structures have been solved, but far less is known about their relevant conformational changes. To address this challenge, over a million citizen scientists banded together through the Folding@home distributed computing project to create the first exascale computer and simulate an unprecedented 0.1 seconds of the viral proteome. Our simulations capture dramatic opening of the apo Spike complex, far beyond that seen experimentally, which explains and successfully predicts the existence of cryptic epitopes. Different Spike homologues modulate the probabilities of open versus closed structures, balancing receptor binding and immune evasion. We also observe dramatic conformational changes across the proteome, which reveal over 50 cryptic pockets that expand targeting options for the design of antivirals. All data and models are freely available online, providing a quantitative structural atlas.
ABSTRACT
The COVID-19 pandemic is a stark reminder that a barren global antiviral pipeline has grave humanitarian consequences. Future pandemics could be prevented by accessible, easily deployable broad-spectrum oral antivirals and open knowledge bases that derisk and accelerate novel antiviral discovery and development. Here, we report the results of the COVID Moonshot, a fully open-science structure-enabled drug discovery campaign targeting the SARS-CoV-2 main protease. We discovered a novel chemical scaffold that is differentiated from current clinical candidates in terms of toxicity, resistance, and pharmacokinetics liabilities, and developed it into noncovalent orally-bioavailable nanomolar inhibitors with clinical potential. Our approach leveraged crowdsourcing, high-throughput structural biology, machine learning, and exascale molecular simulations. In the process, we generated a detailed map of the structural plasticity of the main protease, extensive structure-activity relationships for multiple chemotypes, and a wealth of biochemical activity data. In a first for a structure-based drug discovery campaign, all compound designs (>18,000 designs), crystallographic data (>500 ligand-bound X-ray structures), assay data (>10,000 measurements), and synthesized molecules (>2,400 compounds) for this campaign were shared rapidly and openly, creating a rich open and IP-free knowledgebase for future anti-coronavirus drug discovery.