Interface design of SARS-CoV-2 symmetrical nsp7 dimer and machine learning-guided nsp7 sequence prediction reveals physicochemical properties and hotspots for nsp7 stability, adaptation, and therapeutic design.

Yadav, Amar Jeet; Kumar, Shivank; Maurya, Shweata; Bhagat, Khushboo; Padhi, Aditya K

Yadav, Amar Jeet; Kumar, Shivank; Maurya, Shweata; Bhagat, Khushboo; Padhi, Aditya K.

Afiliación

Yadav AJ; Laboratory for Computational Biology & Biomolecular Design, School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, Uttar Pradesh, India. aditya.bce@iitbhu.ac.in.
Kumar S; Laboratory for Computational Biology & Biomolecular Design, School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, Uttar Pradesh, India. aditya.bce@iitbhu.ac.in.
Maurya S; Laboratory for Computational Biology & Biomolecular Design, School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, Uttar Pradesh, India. aditya.bce@iitbhu.ac.in.
Bhagat K; Laboratory for Computational Biology & Biomolecular Design, School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, Uttar Pradesh, India. aditya.bce@iitbhu.ac.in.
Padhi AK; Laboratory for Computational Biology & Biomolecular Design, School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, Uttar Pradesh, India. aditya.bce@iitbhu.ac.in.

Phys Chem Chem Phys ; 26(18): 14046-14061, 2024 May 08.

Article en En | MEDLINE | ID: mdl-38686454

ABSTRACT

ABSTRACT

The COVID-19 pandemic, driven by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), necessitates a profound understanding of the virus and its lifecycle. As an RNA virus with high mutation rates, SARS-CoV-2 exhibits genetic variability leading to the emergence of variants with potential implications. Among its key proteins, the RNA-dependent RNA polymerase (RdRp) is pivotal for viral replication. Notably, RdRp forms dimers via non-structural protein (nsp) subunits, particularly nsp7, crucial for efficient viral RNA copying. Similar to the main protease (Mpro) of SARS-CoV-2, there is a possibility that the nsp7 might also undergo mutational selection events to generate more stable and adaptable versions of nsp7 dimer during virus evolution. However, efforts to obtain such cohesive and comprehensive information are lacking. To address this, we performed this study focused on deciphering the molecular intricacies of nsp7 dimerization using a multifaceted approach. Leveraging computational protein design (CPD), machine learning (ML), AlphaFold v2.0-based structural analysis, and several related computational approaches, we aimed to identify critical residues and mutations influencing nsp7 dimer stability and adaptation. Our methodology involved identifying potential hotspot residues within the dimeric nsp7 interface using an interface-based CPD approach. Through Rosetta-based symmetrical protein design, we designed and modulated nsp7 dimerization, considering selected interface residues. Analysis of physicochemical features revealed acceptable structural changes and several structural and residue-specific insights emphasizing the intricate nature of such protein-protein complexes. Our ML models, particularly the random forest regressor (RFR), accurately predicted binding affinities and ML-guided sequence predictions corroborated CPD findings, elucidating potential nsp7 mutations and their impact on binding affinity. Validation against clinical sequencing data demonstrated the predictive accuracy of our approach. Moreover, AlphaFold v2.0 structural analyses validated optimal dimeric configurations of affinity-enhancing designs, affirming methodological precision. Affinity-enhancing designs exhibited favourable energetics and higher binding affinity as compared to their counterparts. The obtained physicochemical properties, molecular interactions, and sequence predictions advance our understanding of SARS-CoV-2 evolution and inform potential avenues for therapeutic intervention against COVID-19.

Asunto(s)

ARN Polimerasa Dependiente de ARN de Coronavirus; Aprendizaje Automático; SARS-CoV-2; Humanos; Secuencia de Aminoácidos; ARN Polimerasa Dependiente de ARN de Coronavirus/genética; ARN Polimerasa Dependiente de ARN de Coronavirus/metabolismo; ARN Polimerasa Dependiente de ARN de Coronavirus/química; COVID-19/virología; Mutación; Multimerización de Proteína; SARS-CoV-2/genética; SARS-CoV-2/química; Proteínas no Estructurales Virales/genética; Proteínas no Estructurales Virales/química; Proteínas no Estructurales Virales/metabolismo

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Aprendizaje Automático / ARN Polimerasa Dependiente de ARN de Coronavirus / SARS-CoV-2 Límite: Humans Idioma: En Revista: Phys Chem Chem Phys Asunto de la revista: BIOFISICA / QUIMICA Año: 2024 Tipo del documento: Article País de afiliación: India

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google