Your browser doesn't support javascript.
loading
Protein fold recognition using segmentation conditional random fields (SCRFs).
Liu, Yan; Carbonell, Jaime; Weigele, Peter; Gopalakrishnan, Vanathi.
Affiliation
  • Liu Y; School of Computer Science, Carnegie Mellon University, Pittsburgh, PA 15213, USA. yanliu@cs.cmu.edu
J Comput Biol ; 13(2): 394-406, 2006 Mar.
Article in En | MEDLINE | ID: mdl-16597248
ABSTRACT
Protein fold recognition is an important step towards understanding protein three-dimensional structures and their functions. A conditional graphical model, i.e., segmentation conditional random fields (SCRFs), is proposed as an effective solution to this problem. In contrast to traditional graphical models, such as the hidden Markov model (HMM), SCRFs follow a discriminative approach. Therefore, it is flexible to include any features in the model, such as overlapping or long-range interaction features over the whole sequence. The model also employs a convex optimization function, which results in globally optimal solutions to the model parameters. On the other hand, the segmentation setting in SCRFs makes their graphical structures intuitively similar to the protein 3-D structures and more importantly provides a framework to model the long-range interactions between secondary structures directly. Our model is applied to predict the parallel beta-helix fold, an important fold in bacterial pathogenesis and carbohydrate binding/cleavage. The cross-family validation shows that SCRFs not only can score all known beta-helices higher than non-beta-helices in the Protein Data Bank (PDB), but also accurately locates rungs in known beta-helix proteins. Our method outperforms BetaWrap, a state-of-the-art algorithm for predicting beta-helix folds, and HMMER, a general motif detection algorithm based on HMM, and has the additional advantage of general application to other protein folds. Applying our prediction model to the Uniprot Database, we identify previously unknown potential beta-helices.
Subject(s)
Search on Google
Collection: 01-internacional Database: MEDLINE Main subject: Proteins / Sequence Alignment / Protein Folding / Databases, Protein Type of study: Clinical_trials / Prognostic_studies Limits: Animals / Humans Language: En Journal: J Comput Biol Journal subject: BIOLOGIA MOLECULAR / INFORMATICA MEDICA Year: 2006 Type: Article Affiliation country: United States
Search on Google
Collection: 01-internacional Database: MEDLINE Main subject: Proteins / Sequence Alignment / Protein Folding / Databases, Protein Type of study: Clinical_trials / Prognostic_studies Limits: Animals / Humans Language: En Journal: J Comput Biol Journal subject: BIOLOGIA MOLECULAR / INFORMATICA MEDICA Year: 2006 Type: Article Affiliation country: United States