RESUMO
The computational identification of all the low energy structures of a peptide given only its sequence is not an easy task even for small peptides,due to the multiple-minima problem and combinatorial explosion. We have developed an algorithm, called the MOLS technique,that addresses this problem, and have applied it to a number of different aspects of the study of peptide and protein structure. Conformational studies of oligopeptides, including loop sequences in proteins have been carried out using this technique. In general the calculations identified all the folds determined by previous studies,and in addition picked up other energetically favorable structures. The method was also used to map the energy surface of the peptides. In another application, we have combined the MOLS technique, using it to generate a library of low energy structures of an oligopeptide, with a genetic algorithm to predict protein structures. The method has also been applied to explore the conformational space of loops in protein structures.Further, it has been applied to the problem of docking a ligand in its receptor site, with encouraging results.
Assuntos
Algoritmos , Biologia Computacional , Modelos Moleculares , Peptídeos/química , Conformação Proteica , Análise de Sequência de Proteína , Animais , Técnicas de Química Combinatória , HumanosRESUMO
We have recently developed a computational technique that uses mutually orthogonal Latin square sampling to explore the conformational space of oligopeptides in an exhaustive manner. In this article, we report its use to analyze the conformational spaces of 120 protein loop sequences in proteins, culled from the PDB, having the length ranging from 5 to 10 residues. The force field used did not have any information regarding the sequences or structures that flanked the loop. The results of the analyses show that the native structure of the loop, as found in the PDB falls at one of the low energy points in the conformational landscape of the sequences. Thus, a large portion of the structural determinants of the loop may be considered intrinsic to the sequence, regardless of either adjacent sequences or structures, or the interactions that the atoms of the loop make with other residues in the protein or in neighboring proteins.
Assuntos
Proteínas/química , Sequência de Aminoácidos , Biologia Computacional , Cristalografia por Raios X , Glicina/química , Modelos Moleculares , Conformação Proteica , Estrutura Secundária de Proteína , Estrutura Terciária de Proteína , SolventesRESUMO
We combine a new, extremely fast technique to generate a library of low energy structures of an oligopeptide (by using mutually orthogonal Latin squares to sample its conformational space) with a genetic algorithm to predict protein structures. The protein sequence is divided into oligopeptides, and a structure library is generated for each. These libraries are used in a newly defined mutation operator that, together with variation, crossover, and diversity operators, is used in a modified genetic algorithm to make the prediction. Application to five small proteins has yielded near native structures.