Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 37
Filtrar
1.
J Chem Inf Model ; 63(18): 5834-5846, 2023 09 25.
Artigo em Inglês | MEDLINE | ID: mdl-37661856

RESUMO

Recent advances in cryo-electron microscopy (cryo-EM) have enabled modeling macromolecular complexes that are essential components of the cellular machinery. The density maps derived from cryo-EM experiments are often integrated with manual, knowledge-driven or artificial intelligence-driven and physics-guided computational methods to build, fit, and refine molecular structures. Going beyond a single stationary-structure determination scheme, it is becoming more common to interpret the experimental data with an ensemble of models that contributes to an average observation. Hence, there is a need to decide on the quality of an ensemble of protein structures on-the-fly while refining them against the density maps. We introduce such an adaptive decision-making scheme during the molecular dynamics flexible fitting (MDFF) of biomolecules. Using RADICAL-Cybertools, the new RADICAL augmented MDFF implementation (R-MDFF) is examined in high-performance computing environments for refinement of two prototypical protein systems, adenylate kinase and carbon monoxide dehydrogenase. For these test cases, use of multiple replicas in flexible fitting with adaptive decision making in R-MDFF improves the overall correlation to the density by 40% relative to the refinements of the brute-force MDFF. The improvements are particularly significant at high, 2-3 Å map resolutions. More importantly, the ensemble model captures key features of biologically relevant molecular dynamics that are inaccessible to a single-model interpretation. Finally, the pipeline is applicable to systems of growing sizes, which is demonstrated using ensemble refinement of capsid proteins from the chimpanzee adenovirus. The overhead for decision making remains low and robust to computing environments. The software is publicly available on GitHub and includes a short user guide to install R-MDFF on different computing environments, from local Linux-based workstations to high-performance computing environments.


Assuntos
Inteligência Artificial , Simulação de Dinâmica Molecular , Microscopia Crioeletrônica , Microscopia Eletrônica , Adenilato Quinase
2.
Int J High Perform Comput Appl ; 37(1): 28-44, 2023 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-36647365

RESUMO

We seek to completely revise current models of airborne transmission of respiratory viruses by providing never-before-seen atomic-level views of the SARS-CoV-2 virus within a respiratory aerosol. Our work dramatically extends the capabilities of multiscale computational microscopy to address the significant gaps that exist in current experimental methods, which are limited in their ability to interrogate aerosols at the atomic/molecular level and thus obscure our understanding of airborne transmission. We demonstrate how our integrated data-driven platform provides a new way of exploring the composition, structure, and dynamics of aerosols and aerosolized viruses, while driving simulation method development along several important axes. We present a series of initial scientific discoveries for the SARS-CoV-2 Delta variant, noting that the full scientific impact of this work has yet to be realized.

3.
J Chem Inf Model ; 62(1): 116-128, 2022 01 10.
Artigo em Inglês | MEDLINE | ID: mdl-34793155

RESUMO

Despite the recent availability of vaccines against the acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the search for inhibitory therapeutic agents has assumed importance especially in the context of emerging new viral variants. In this paper, we describe the discovery of a novel noncovalent small-molecule inhibitor, MCULE-5948770040, that binds to and inhibits the SARS-Cov-2 main protease (Mpro) by employing a scalable high-throughput virtual screening (HTVS) framework and a targeted compound library of over 6.5 million molecules that could be readily ordered and purchased. Our HTVS framework leverages the U.S. supercomputing infrastructure achieving nearly 91% resource utilization and nearly 126 million docking calculations per hour. Downstream biochemical assays validate this Mpro inhibitor with an inhibition constant (Ki) of 2.9 µM (95% CI 2.2, 4.0). Furthermore, using room-temperature X-ray crystallography, we show that MCULE-5948770040 binds to a cleft in the primary binding site of Mpro forming stable hydrogen bond and hydrophobic interactions. We then used multiple µs-time scale molecular dynamics (MD) simulations and machine learning (ML) techniques to elucidate how the bound ligand alters the conformational states accessed by Mpro, involving motions both proximal and distal to the binding site. Together, our results demonstrate how MCULE-5948770040 inhibits Mpro and offers a springboard for further therapeutic design.


Assuntos
COVID-19 , Inibidores de Proteases , Antivirais , Proteases 3C de Coronavírus , Humanos , Simulação de Acoplamento Molecular , Simulação de Dinâmica Molecular , Ácido Orótico/análogos & derivados , Piperazinas , SARS-CoV-2
4.
Int J High Perform Comput Appl ; 35(5): 432-451, 2021 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-38603008

RESUMO

We develop a generalizable AI-driven workflow that leverages heterogeneous HPC resources to explore the time-dependent dynamics of molecular systems. We use this workflow to investigate the mechanisms of infectivity of the SARS-CoV-2 spike protein, the main viral infection machinery. Our workflow enables more efficient investigation of spike dynamics in a variety of complex environments, including within a complete SARS-CoV-2 viral envelope simulation, which contains 305 million atoms and shows strong scaling on ORNL Summit using NAMD. We present several novel scientific discoveries, including the elucidation of the spike's full glycan shield, the role of spike glycans in modulating the infectivity of the virus, and the characterization of the flexible interactions between the spike and the human ACE2 receptor. We also demonstrate how AI can accelerate conformational sampling across different systems and pave the way for the future application of such methods to additional studies in SARS-CoV-2 and other molecular systems.

5.
BMC Bioinformatics ; 19(Suppl 18): 482, 2018 Dec 21.
Artigo em Inglês | MEDLINE | ID: mdl-30577753

RESUMO

BACKGROUND: Resistance to chemotherapy and molecularly targeted therapies is a major factor in limiting the effectiveness of cancer treatment. In many cases, resistance can be linked to genetic changes in target proteins, either pre-existing or evolutionarily selected during treatment. Key to overcoming this challenge is an understanding of the molecular determinants of drug binding. Using multi-stage pipelines of molecular simulations we can gain insights into the binding free energy and the residence time of a ligand, which can inform both stratified and personal treatment regimes and drug development. To support the scalable, adaptive and automated calculation of the binding free energy on high-performance computing resources, we introduce the High-throughput Binding Affinity Calculator (HTBAC). HTBAC uses a building block approach in order to attain both workflow flexibility and performance. RESULTS: We demonstrate close to perfect weak scaling to hundreds of concurrent multi-stage binding affinity calculation pipelines. This permits a rapid time-to-solution that is essentially invariant of the calculation protocol, size of candidate ligands and number of ensemble simulations. CONCLUSIONS: As such, HTBAC advances the state of the art of binding affinity calculations and protocols. HTBAC provides the platform to enable scientists to study a wide range of cancer drugs and candidate ligands in order to support personalized clinical decision making based on genome sequencing and drug discovery.


Assuntos
Ensaios de Triagem em Larga Escala/métodos , Ligação Proteica/fisiologia , Humanos
6.
J Chem Phys ; 149(18): 180901, 2018 Nov 14.
Artigo em Inglês | MEDLINE | ID: mdl-30441927

RESUMO

The field of computational molecular sciences (CMSs) has made innumerable contributions to the understanding of the molecular phenomena that underlie and control chemical processes, which is manifested in a large number of community software projects and codes. The CMS community is now poised to take the next transformative steps of better training in modern software design and engineering methods and tools, increasing interoperability through more systematic adoption of agreed upon standards and accepted best-practices, overcoming unnecessary redundancy in software effort along with greater reproducibility, and increasing the deployment of new software onto hardware platforms from in-house clusters to mid-range computing systems through to modern supercomputers. This in turn will have future impact on the software that will be created to address grand challenge science that we illustrate here: the formulation of diverse catalysts, descriptions of long-range charge and excitation transfer, and development of structural ensembles for intrinsically disordered proteins.

7.
J Comput Chem ; 35(9): 692-702, 2014 Apr 05.
Artigo em Inglês | MEDLINE | ID: mdl-24403093

RESUMO

The translocation of nucleotide molecules across biological and synthetic nanopores has attracted attention as a next generation technique for sequencing DNA. Computer simulations have the ability to provide atomistic-level insight into important states and processes, delivering a means to develop a fundamental understanding of the translocation event, for example, by extracting the free energy of the process. Even with current supercomputing facilities, the simulation of many-atom systems in fine detail is limited to shorter timescales than the real events they attempt to recreate. This imposes the need for enhanced simulation techniques that expand the scope of investigation in a given timeframe. There are numerous free energy calculation and translocation methodologies available, and it is by no means clear which method is best applied to a particular problem. This article explores the use of two popular free energy calculation methodologies in a nucleotide-nanopore translocation system, using the α-hemolysin nanopore. The first uses constant velocity-steered molecular dynamics (cv-SMD) in conjunction with Jarzynski's equality. The second applies an adaptive biasing force (ABF), which has not previously been applied to the nucleotide-nanpore system. The purpose of this study is to provide a comprehensive comparison of these methodologies, allowing for a detailed comparative assessment of the scientific merits, the computational cost, and the statistical quality of the data obtained from each technique. We find that the ABF method produces results that are closer to experimental measurements than those from cv-SMD, whereas the net errors are smaller for the same computational cost.


Assuntos
Nucleotídeos/metabolismo , Transporte Biológico , Simulação de Dinâmica Molecular
8.
PLoS Comput Biol ; 9(5): e1003069, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23704854

RESUMO

Riboswitches sense cellular concentrations of small molecules and use this information to adjust synthesis rates of related metabolites. Riboswitches include an aptamer domain to detect the ligand and an expression platform to control gene expression. Previous structural studies of riboswitches largely focused on aptamers, truncating the expression domain to suppress conformational switching. To link ligand/aptamer binding to conformational switching, we constructed models of an S-adenosyl methionine (SAM)-I riboswitch RNA segment incorporating elements of the expression platform, allowing formation of an antiterminator (AT) helix. Using Anton, a computer specially developed for long timescale Molecular Dynamics (MD), we simulated an extended (three microseconds) MD trajectory with SAM bound to a modeled riboswitch RNA segment. Remarkably, we observed a strand migration, converting three base pairs from an antiterminator (AT) helix, characteristic of the transcription ON state, to a P1 helix, characteristic of the OFF state. This conformational switching towards the OFF state is observed only in the presence of SAM. Among seven extended trajectories with three starting structures, the presence of SAM enhances the trend towards the OFF state for two out of three starting structures tested. Our simulation provides a visual demonstration of how a small molecule (<500 MW) binding to a limited surface can trigger a large scale conformational rearrangement in a 40 kDa RNA by perturbing the Free Energy Landscape. Such a mechanism can explain minimal requirements for SAM binding and transcription termination for SAM-I riboswitches previously reported experimentally.


Assuntos
Biologia Computacional/métodos , Modelos Genéticos , Riboswitch/genética , S-Adenosilmetionina/metabolismo , Análise por Conglomerados , Ligação de Hidrogênio , Ligantes , Simulação de Dinâmica Molecular , Conformação de Ácido Nucleico , RNA Bacteriano/química , RNA Bacteriano/genética , RNA Bacteriano/metabolismo , S-Adenosilmetionina/química , S-Adenosilmetionina/genética
9.
Front Bioinform ; 4: 1280971, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38812660

RESUMO

Radiation exposure poses a significant threat to human health. Emerging research indicates that even low-dose radiation once believed to be safe, may have harmful effects. This perception has spurred a growing interest in investigating the potential risks associated with low-dose radiation exposure across various scenarios. To comprehensively explore the health consequences of low-dose radiation, our study employs a robust statistical framework that examines whether specific groups of genes, belonging to known pathways, exhibit coordinated expression patterns that align with the radiation levels. Notably, our findings reveal the existence of intricate yet consistent signatures that reflect the molecular response to radiation exposure, distinguishing between low-dose and high-dose radiation. Moreover, we leverage a pathway-constrained variational autoencoder to capture the nonlinear interactions within gene expression data. By comparing these two analytical approaches, our study aims to gain valuable insights into the impact of low-dose radiation on gene expression patterns, identify pathways that are differentially affected, and harness the potential of machine learning to uncover hidden activity within biological networks. This comparative analysis contributes to a deeper understanding of the molecular consequences of low-dose radiation exposure.

10.
Nat Commun ; 15(1): 352, 2024 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-38191557

RESUMO

Heterogeneous response to Enzalutamide, a second-generation androgen receptor signaling inhibitor, is a central problem in castration-resistant prostate cancer (CRPC) management. Genome-wide systems investigation of mechanisms that govern Enzalutamide resistance promise to elucidate markers of heterogeneous treatment response and salvage therapies for CRPC patients. Focusing on the de novo role of MYC as a marker of Enzalutamide resistance, here we reconstruct a CRPC-specific mechanism-centric regulatory network, connecting molecular pathways with their upstream transcriptional regulatory programs. Mining this network with signatures of Enzalutamide response identifies NME2 as an upstream regulatory partner of MYC in CRPC and demonstrates that NME2-MYC increased activities can predict patients at risk of resistance to Enzalutamide, independent of co-variates. Furthermore, our experimental investigations demonstrate that targeting MYC and its partner NME2 is beneficial in Enzalutamide-resistant conditions and could provide an effective strategy for patients at risk of Enzalutamide resistance and/or for patients who failed Enzalutamide treatment.


Assuntos
Resistencia a Medicamentos Antineoplásicos , Neoplasias de Próstata Resistentes à Castração , Humanos , Masculino , Antagonistas de Receptores de Andrógenos , Benzamidas , Nucleosídeo NM23 Difosfato Quinases , Neoplasias de Próstata Resistentes à Castração/tratamento farmacológico , Neoplasias de Próstata Resistentes à Castração/genética , Transdução de Sinais
11.
Patterns (N Y) ; 4(11): 100875, 2023 Nov 10.
Artigo em Inglês | MEDLINE | ID: mdl-38035191

RESUMO

The need for efficient computational screening of molecular candidates that possess desired properties frequently arises in various scientific and engineering problems, including drug discovery and materials design. However, the enormous search space containing the candidates and the substantial computational cost of high-fidelity property prediction models make screening practically challenging. In this work, we propose a general framework for constructing and optimizing a high-throughput virtual screening (HTVS) pipeline that consists of multi-fidelity models. The central idea is to optimally allocate the computational resources to models with varying costs and accuracy to optimize the return on computational investment. Based on both simulated and real-world data, we demonstrate that the proposed optimal HTVS framework can significantly accelerate virtual screening without any degradation in terms of accuracy. Furthermore, it enables an adaptive operational strategy for HTVS, where one can trade accuracy for efficiency.

12.
Sci Rep ; 13(1): 2105, 2023 02 06.
Artigo em Inglês | MEDLINE | ID: mdl-36747041

RESUMO

Protein-ligand docking is a computational method for identifying drug leads. The method is capable of narrowing a vast library of compounds down to a tractable size for downstream simulation or experimental testing and is widely used in drug discovery. While there has been progress in accelerating scoring of compounds with artificial intelligence, few works have bridged these successes back to the virtual screening community in terms of utility and forward-looking development. We demonstrate the power of high-speed ML models by scoring 1 billion molecules in under a day (50 k predictions per GPU seconds). We showcase a workflow for docking utilizing surrogate AI-based models as a pre-filter to a standard docking workflow. Our workflow is ten times faster at screening a library of compounds than the standard technique, with an error rate less than 0.01% of detecting the underlying best scoring 0.1% of compounds. Our analysis of the speedup explains that another order of magnitude speedup must come from model accuracy rather than computing speed. In order to drive another order of magnitude of acceleration, we share a benchmark dataset consisting of 200 million 3D complex structures and 2D structure scores across a consistent set of 13 million "in-stock" molecules over 15 receptors, or binding sites, across the SARS-CoV-2 proteome. We believe this is strong evidence for the community to begin focusing on improving the accuracy of surrogate models to improve the ability to screen massive compound libraries 100 × or even 1000 × faster than current techniques and reduce missing top hits. The technique outlined aims to be a fast drop-in replacement for docking for screening billion-scale molecular libraries.


Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/metabolismo , Inteligência Artificial , Simulação de Acoplamento Molecular , Ligantes , Proteínas/metabolismo
13.
Biochemistry ; 51(33): 6487-9, 2012 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-22866962

RESUMO

The development of resistance to different drugs remains a major problem for a wide range of infections. In particular, combinations of specific mutations, which individually demonstrate no effect, exhibit significant cooperativity. Here we show that changes to the energy of ligand binding in different resistant HIV-1 proteases are correlated with the creation of water binding sites in the active site. This correlation is conserved across two drugs (ritonavir and lopinavir). We propose that individual mutations induce changes in flap packing that are insufficient to allow water binding but in combination allow access, leading to the observed cooperative resistance.


Assuntos
Domínio Catalítico/genética , Inibidores da Protease de HIV/química , Protease de HIV/química , Lopinavir/farmacologia , Ritonavir/farmacologia , Água/farmacologia , Farmacorresistência Viral/genética , Protease de HIV/genética , Mutação , Termodinâmica
14.
Data Brief ; 40: 107824, 2022 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-35141367

RESUMO

This new dataset is an ensemble of solar photovoltaic energy production simulations over the continental US. The simulations are carried out in three steps. First, a weather forecast system is used for the predictions of incoming insolation; then, forecast ensembles with 21 members are generated using the Analog Ensemble technique; finally, each ensemble member is used to simulate 13 different solar panels. In total, there are 21 × 13 = 273 simulated scenarios. Simulations are carried out for the entire year 2019, with a temporal resolution of one hour, and a spatial resolution of 12 km. The data provide a high spatio-temporal analysis of the power production under different weather and engineering scenarios. The size of the entire dataset is about 1 TB but can be openly accessed by days and scenarios. Details on how to access and use such a dataset are provided in this article.

15.
Nucleic Acids Res ; 37(19): 6528-39, 2009 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-19720737

RESUMO

The S-adenosylmethionine-1 (SAM-I) riboswitch mediates expression of proteins involved in sulfur metabolism via formation of alternative conformations in response to binding by SAM. Models for kinetic trapping of the RNA in the bound conformation require annealing of nonadjacent mRNA segments during a transcriptional pause. The entropic cost required to bring nonadjacent segments together should slow the folding process. To address this paradox, we performed molecular dynamics simulations on the SAM-I riboswitch aptamer domain with and without SAM, starting with the X-ray coordinates of the SAM-bound RNA. Individual trajectories are 200 ns, among the longest reported for an RNA of this size. We applied principle component analysis (PCA) to explore the global dynamics differences between these two trajectories. We observed a conformational switch between a stacked and nonstacked state of a nonadjacent dinucleotide in the presence of SAM. In the absence of SAM the coordination between a bound magnesium ion and the phosphate of A9, one of the nucleotides involved in the dinucleotide stack, is destabilized. An electrostatic potential map reveals a 'hot spot' at the Mg binding site in the presence of SAM. These results suggest that SAM binding helps to position J1/2 in a manner that is favorable for P1 helix formation.


Assuntos
RNA/química , S-Adenosilmetionina/metabolismo , Sítios de Ligação , Ligantes , Magnésio/química , Modelos Moleculares , Nucleotídeos/química , Fosfatos/química
16.
Methods Mol Biol ; 2302: 335-356, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-33877636

RESUMO

Molecular dynamics or MD simulation is gradually maturing into a tool for constructing in vivo models of living cells in atomistic details. The feasibility of such models is bolstered by integrating the simulations with data from microscopic, tomographic and spectroscopic experiments on exascale supercomputers, facilitated by the use of deep learning technologies. Over time, MD simulation has evolved from tens of thousands of atoms to over 100 million atoms comprising an entire cell organelle, a photosynthetic chromatophore vesicle from a purple bacterium. In this chapter, we present a step-by-step outline for preparing, executing and analyzing such large-scale MD simulations of biological systems that are essential to life processes. All scripts are provided via GitHub.


Assuntos
Bactérias/citologia , Cromatóforos Bacterianos/química , Biologia Computacional/métodos , Bactérias/química , Aprendizado Profundo , Simulação de Dinâmica Molecular
17.
Interface Focus ; 11(6): 20210018, 2021 Dec 06.
Artigo em Inglês | MEDLINE | ID: mdl-34956592

RESUMO

The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case, developed for linear accelerators, and physics-based methods. The two in silico methods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative infrastructural development that combines both approaches to accelerate drug discovery. The scale of the potential resulting workflow is such that it is dependent on supercomputing to achieve extremely high throughput. We have demonstrated the viability of this workflow for the study of inhibitors for four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead antiviral compounds through repurposing on a variety of supercomputers.

18.
bioRxiv ; 2021 Nov 15.
Artigo em Inglês | MEDLINE | ID: mdl-34816263

RESUMO

We seek to completely revise current models of airborne transmission of respiratory viruses by providing never-before-seen atomic-level views of the SARS-CoV-2 virus within a respiratory aerosol. Our work dramatically extends the capabilities of multiscale computational microscopy to address the significant gaps that exist in current experimental methods, which are limited in their ability to interrogate aerosols at the atomic/molecular level and thus ob-scure our understanding of airborne transmission. We demonstrate how our integrated data-driven platform provides a new way of exploring the composition, structure, and dynamics of aerosols and aerosolized viruses, while driving simulation method development along several important axes. We present a series of initial scientific discoveries for the SARS-CoV-2 Delta variant, noting that the full scientific impact of this work has yet to be realized. ACM REFERENCE FORMAT: Abigail Dommer 1† , Lorenzo Casalino 1† , Fiona Kearns 1† , Mia Rosenfeld 1 , Nicholas Wauer 1 , Surl-Hee Ahn 1 , John Russo, 2 Sofia Oliveira 3 , Clare Morris 1 , AnthonyBogetti 4 , AndaTrifan 5,6 , Alexander Brace 5,7 , TerraSztain 1,8 , Austin Clyde 5,7 , Heng Ma 5 , Chakra Chennubhotla 4 , Hyungro Lee 9 , Matteo Turilli 9 , Syma Khalid 10 , Teresa Tamayo-Mendoza 11 , Matthew Welborn 11 , Anders Christensen 11 , Daniel G. A. Smith 11 , Zhuoran Qiao 12 , Sai Krishna Sirumalla 11 , Michael O'Connor 11 , Frederick Manby 11 , Anima Anandkumar 12,13 , David Hardy 6 , James Phillips 6 , Abraham Stern 13 , Josh Romero 13 , David Clark 13 , Mitchell Dorrell 14 , Tom Maiden 14 , Lei Huang 15 , John McCalpin 15 , Christo- pherWoods 3 , Alan Gray 13 , MattWilliams 3 , Bryan Barker 16 , HarindaRajapaksha 16 , Richard Pitts 16 , Tom Gibbs 13 , John Stone 6 , Daniel Zuckerman 2 *, Adrian Mulholland 3 *, Thomas MillerIII 11,12 *, ShantenuJha 9 *, Arvind Ramanathan 5 *, Lillian Chong 4 *, Rommie Amaro 1 *. 2021. #COVIDisAirborne: AI-Enabled Multiscale Computational Microscopy ofDeltaSARS-CoV-2 in a Respiratory Aerosol. In Supercomputing '21: International Conference for High Perfor-mance Computing, Networking, Storage, and Analysis . ACM, New York, NY, USA, 14 pages. https://doi.org/finalDOI.

19.
J Chem Theory Comput ; 16(12): 7915-7925, 2020 Dec 08.
Artigo em Inglês | MEDLINE | ID: mdl-33170696

RESUMO

The accurate sampling of protein dynamics is an ongoing challenge despite the utilization of high-performance computer (HPC) systems. Utilizing only "brute force" molecular dynamics (MD) simulations requires an unacceptably long time to solution. Adaptive sampling methods allow a more effective sampling of protein dynamics than standard MD simulations. Depending on the restarting strategy, the speed up can be more than 1 order of magnitude. One challenge limiting the utilization of adaptive sampling by domain experts is the relatively high complexity of efficiently running adaptive sampling on HPC systems. We discuss how the ExTASY framework can set up new adaptive sampling strategies and reliably execute resulting workflows at scale on HPC platforms. Here, the folding dynamics of four proteins are predicted with no a priori information.


Assuntos
Computadores , Simulação de Dinâmica Molecular , Proteínas/química
20.
bioRxiv ; 2020 Nov 20.
Artigo em Inglês | MEDLINE | ID: mdl-33236007

RESUMO

We develop a generalizable AI-driven workflow that leverages heterogeneous HPC resources to explore the time-dependent dynamics of molecular systems. We use this workflow to investigate the mechanisms of infectivity of the SARS-CoV-2 spike protein, the main viral infection machinery. Our workflow enables more efficient investigation of spike dynamics in a variety of complex environments, including within a complete SARS-CoV-2 viral envelope simulation, which contains 305 million atoms and shows strong scaling on ORNL Summit using NAMD. We present several novel scientific discoveries, including the elucidation of the spike's full glycan shield, the role of spike glycans in modulating the infectivity of the virus, and the characterization of the flexible interactions between the spike and the human ACE2 receptor. We also demonstrate how AI can accelerate conformational sampling across different systems and pave the way for the future application of such methods to additional studies in SARS-CoV-2 and other molecular systems.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA