RESUMEN
A prediction has been prepared ab initio for the secondary structure of the hydroxymethyldihydropterin pyrophosphokinase (HPPK) family of proteins starting from a set of aligned homologous protein sequences. Attempts to identify a fold by threading failed, judging by the inability to find a threading "hit" that had a secondary structure that was plausibly congruent to the predicted secondary structure for the HPPK family. Therefore, a set of tertiary structure models was assembled ab initio, where alternative models were built and used to select between alternative secondary structure models. This prediction report illustrates the importance of non-computational approaches to structure prediction at its present frontier, which is to obtain medium resolution models of tertiary structure.
Asunto(s)
Difosfotransferasas/química , Modelos Moleculares , Estructura Secundaria de Proteína , Secuencia de Aminoácidos , Animales , Difosfotransferasas/genética , Evolución Molecular , Datos de Secuencia Molecular , Conformación Proteica , Alineación de Secuencia , Análisis de SecuenciaRESUMEN
Two predictions have been prepared for the fold of initiation factor 5A (IF5A) starting from a set of homologous sequences. In the first, a secondary structural model was predicted for the protein in 1994, when only eleven homologs (and no eubacterial homologs) had been sequenced. The second was made recently, after genome projects had generated a total of 33 sequences for the protein family from species of all three kingdoms of life. With the second set of sequences, but not with the first, it was possible to predict that the N-terminal domain of the protein folds in a possibly open beta-barrel/sandwich core structure, with a short helix capping one side of the barrel. We place the pair of predictions in the public domain before an experimental structure is known. This example illustrates the impact of genome sequencing projects on structure prediction from sequence alignments.