Your browser doesn't support javascript.
loading
Struct2Graph: a graph attention network for structure based predictions of protein-protein interactions.
Baranwal, Mayank; Magner, Abram; Saldinger, Jacob; Turali-Emre, Emine S; Elvati, Paolo; Kozarekar, Shivani; VanEpps, J Scott; Kotov, Nicholas A; Violi, Angela; Hero, Alfred O.
Afiliación
  • Baranwal M; Division of Data and Decision Sciences, Tata Consultancy Services Research, Mumbai, India. baranwal.mayank@tcs.com.
  • Magner A; Systems and Control Engineering Group, Indian Institute of Technology, Bombay, India. baranwal.mayank@tcs.com.
  • Saldinger J; Department of Computer Science, University of Albany, SUNY, Albany, USA.
  • Turali-Emre ES; Department of Chemical Engineering, University of Michigan, Ann Arbor, USA.
  • Elvati P; Department of Biomedical Engineering, University of Michigan, Ann Arbor, USA.
  • Kozarekar S; Department of Mechanical Engineering, University of Michigan, Ann Arbor, USA.
  • VanEpps JS; Department of Chemical Engineering, University of Michigan, Ann Arbor, USA.
  • Kotov NA; Department of Biomedical Engineering, University of Michigan, Ann Arbor, USA.
  • Violi A; Department of Emergency Medicine, University of Michigan, Ann Arbor, USA.
  • Hero AO; Biointerfaces Institute, University of Michigan, Ann Arbor, USA.
BMC Bioinformatics ; 23(1): 370, 2022 Sep 10.
Article en En | MEDLINE | ID: mdl-36088285
ABSTRACT

BACKGROUND:

Development of new methods for analysis of protein-protein interactions (PPIs) at molecular and nanometer scales gives insights into intracellular signaling pathways and will improve understanding of protein functions, as well as other nanoscale structures of biological and abiological origins. Recent advances in computational tools, particularly the ones involving modern deep learning algorithms, have been shown to complement experimental approaches for describing and rationalizing PPIs. However, most of the existing works on PPI predictions use protein-sequence information, and thus have difficulties in accounting for the three-dimensional organization of the protein chains.

RESULTS:

In this study, we address this problem and describe a PPI analysis based on a graph attention network, named Struct2Graph, for identifying PPIs directly from the structural data of folded protein globules. Our method is capable of predicting the PPI with an accuracy of 98.89% on the balanced set consisting of an equal number of positive and negative pairs. On the unbalanced set with the ratio of 110 between positive and negative pairs, Struct2Graph achieves a fivefold cross validation average accuracy of 99.42%. Moreover, Struct2Graph can potentially identify residues that likely contribute to the formation of the protein-protein complex. The identification of important residues is tested for two different interaction types (a) Proteins with multiple ligands competing for the same binding area, (b) Dynamic protein-protein adhesion interaction. Struct2Graph identifies interacting residues with 30% sensitivity, 89% specificity, and 87% accuracy.

CONCLUSIONS:

In this manuscript, we address the problem of prediction of PPIs using a first of its kind, 3D-structure-based graph attention network (code available at https//github.com/baranwa2/Struct2Graph ). Furthermore, the novel mutual attention mechanism provides insights into likely interaction sites through its unsupervised knowledge selection process. This study demonstrates that a relatively low-dimensional feature embedding learned from graph structures of individual proteins outperforms other modern machine learning classifiers based on global protein features. In addition, through the analysis of single amino acid variations, the attention mechanism shows preference for disease-causing residue variations over benign polymorphisms, demonstrating that it is not limited to interface residues.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Proteínas Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: India

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Algoritmos / Proteínas Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: BMC Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2022 Tipo del documento: Article País de afiliación: India