Your browser doesn't support javascript.
loading
GRAFENE: Graphlet-based alignment-free network approach integrates 3D structural and sequence (residue order) data to improve protein structural comparison.
Faisal, Fazle E; Newaz, Khalique; Chaney, Julie L; Li, Jun; Emrich, Scott J; Clark, Patricia L; Milenkovic, Tijana.
Afiliación
  • Faisal FE; Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA.
  • Newaz K; Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN, 46556, USA.
  • Chaney JL; Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA.
  • Li J; Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA.
  • Emrich SJ; Interdisciplinary Center for Network Science and Applications, University of Notre Dame, Notre Dame, IN, 46556, USA.
  • Clark PL; Eck Institute for Global Health, University of Notre Dame, Notre Dame, IN, 46556, USA.
  • Milenkovic T; Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN, 46556, USA.
Sci Rep ; 7(1): 14890, 2017 11 02.
Article en En | MEDLINE | ID: mdl-29097661
ABSTRACT
Initial protein structural comparisons were sequence-based. Since amino acids that are distant in the sequence can be close in the 3-dimensional (3D) structure, 3D contact approaches can complement sequence approaches. Traditional 3D contact approaches study 3D structures directly and are alignment-based. Instead, 3D structures can be modeled as protein structure networks (PSNs). Then, network approaches can compare proteins by comparing their PSNs. These can be alignment-based or alignment-free. We focus on the latter. Existing network alignment-free approaches have drawbacks 1) They rely on naive measures of network topology. 2) They are not robust to PSN size. They cannot integrate 3) multiple PSN measures or 4) PSN data with sequence data, although this could improve comparison because the different data types capture complementary aspects of the protein structure. We address this by 1) exploiting well-established graphlet measures via a new network alignment-free approach, 2) introducing normalized graphlet measures to remove the bias of PSN size, 3) allowing for integrating multiple PSN measures, and 4) using ordered graphlets to combine the complementary PSN data and sequence (specifically, residue order) data. We compare synthetic networks and real-world PSNs more accurately and faster than existing network (alignment-free and alignment-based), 3D contact, or sequence approaches.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Programas Informáticos / Proteínas Idioma: En Revista: Sci Rep Año: 2017 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Programas Informáticos / Proteínas Idioma: En Revista: Sci Rep Año: 2017 Tipo del documento: Article País de afiliación: Estados Unidos