A novel descriptor of protein sequences and its application.
J Theor Biol
; 347: 109-17, 2014 Apr 21.
Article
in En
| MEDLINE
| ID: mdl-24412564
In this paper, a dynamic 3-D graphical representation of protein sequences is introduced based on three physical-chemical properties of amino acids. The coordinates of the graph have direct biological significance, which could reflect the innate structure of the proteins. The information of principal moments of inertia and range of axis coordinate are extracted as a novel mixed descriptor and proposed for the comparison of protein primary sequences. Meanwhile, the Euclidean distance of the normalized descriptor vectors which avoid the influence of the difference in length of protein sequences under consideration is employed as a quantitative measurement of the similarity of proteins. Finally, we take the nine ND5 (NADH dehydrogenase subunit 5) proteins for example and illustrate the effectiveness of our approach.
Key words
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Proteins
Language:
En
Journal:
J Theor Biol
Year:
2014
Document type:
Article
Country of publication: