Your browser doesn't support javascript.
loading
Hierarchical Structure of Protein Sequence.
Nekrasov, Alexei N; Kozmin, Yuri P; Kozyrev, Sergey V; Ziganshin, Rustam H; de Brevern, Alexandre G; Anashkina, Anastasia A.
Afiliação
  • Nekrasov AN; Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia.
  • Kozmin YP; Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia.
  • Kozyrev SV; Steklov Mathematical Institute and of Russian Academy of Sciences, 8 Gubkina St., 119991 Moscow, Russia.
  • Ziganshin RH; Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, The Russian Academy of Sciences, Miklukho-Maklaya St. 16/10, 117997 Moscow, Russia.
  • de Brevern AG; INSERM UMR S-1134, DSIMB, Univ. Paris, INTS, Lab. of Excellence GR-Ex 6, rue Alexandre Cabanel, CEDEX 15, 75739 Paris, France.
  • Anashkina AA; Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Vavilov St. 32, 119991 Moscow, Russia.
Int J Mol Sci ; 22(15)2021 Aug 03.
Article em En | MEDLINE | ID: mdl-34361104
Most non-communicable diseases are associated with dysfunction of proteins or protein complexes. The relationship between sequence and structure has been analyzed for a long time, and the analysis of the sequences organization in domains and motifs remains an actual research area. Here, we propose a mathematical method for revealing the hierarchical organization of protein sequences. The method is based on the pentapeptide as a unit of protein sequences. Employing the frequency of occurrence of pentapeptides in sequences of natural proteins and a special mathematical approach, this method revealed a hierarchical structure in the protein sequence. The method was applied to 24,647 non-homologous protein sequences with sizes ranging from 50 to 400 residues from the NRDB90 database. Statistical analysis of the branching points of the graphs revealed 11 characteristic values of y (the width of the inscribed function), showing the relationship of these multiple fragments of the sequences. Several examples illustrate how fragments of the protein spatial structure correspond to the elements of the hierarchical structure of the protein sequence. This methodology provides a promising basis for a mathematically-based classification of the elements of the spatial organization of proteins. Elements of the hierarchical structure of different levels of the hierarchy can be used to solve biotechnological and medical problems.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Conformação Proteica / Algoritmos / Proteínas / Bases de Dados de Proteínas Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Conformação Proteica / Algoritmos / Proteínas / Bases de Dados de Proteínas Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article