Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
Sci Rep ; 11(1): 14268, 2021 07 12.
Artículo en Inglés | MEDLINE | ID: mdl-34253785

RESUMEN

DNA-Damage Response (DDR) proteins are crucial for maintaining the integrity of the genome by identifying and repairing errors in DNA. Variants affecting their function can have severe consequences since failure to repair damaged DNA can result in cells turning cancerous. Here, we compare germline and somatic variants in DDR genes, specifically looking at their locations in the corresponding three-dimensional (3D) structures, Pfam domains, and protein-protein interaction interfaces. We show that somatic variants in metastatic cases are more likely to be found in Pfam domains and protein interaction interfaces than are pathogenic germline variants or variants of unknown significance (VUS). We also show that there are hotspots in the structures of ATM and BRCA2 proteins where pathogenic germline, and recurrent somatic variants from primary and metastatic tumours, cluster together in 3D. Moreover, in the ATM, BRCA1 and BRCA2 genes from prostate cancer patients, the distributions of germline benign, pathogenic, VUS, and recurrent somatic variants differ across Pfam domains. Together, these results provide a better characterisation of the most recurrent affected regions in DDRs and could help in the understanding of individual susceptibility to tumour development.


Asunto(s)
Biología Computacional/métodos , Reparación del ADN , Predisposición Genética a la Enfermedad , Variación Genética , Neoplasias/genética , Proteínas de la Ataxia Telangiectasia Mutada/genética , Proteína BRCA1/genética , Proteína BRCA2/genética , Daño del ADN , ADN Glicosilasas/genética , Mutación de Línea Germinal , Humanos , Metástasis de la Neoplasia , Dominios Proteicos , Mapeo de Interacción de Proteínas
2.
Sci Rep ; 11(1): 2938, 2021 02 03.
Artículo en Inglés | MEDLINE | ID: mdl-33536532

RESUMEN

Reducing the burden of late-life morbidity requires an understanding of the mechanisms of ageing-related diseases (ARDs), defined as diseases that accumulate with increasing age. This has been hampered by the lack of formal criteria to identify ARDs. Here, we present a framework to identify ARDs using two complementary methods consisting of unsupervised machine learning and actuarial techniques, which we applied to electronic health records (EHRs) from 3,009,048 individuals in England using primary care data from the Clinical Practice Research Datalink (CPRD) linked to the Hospital Episode Statistics admitted patient care dataset between 1 April 2010 and 31 March 2015 (mean age 49.7 years (s.d. 18.6), 51% female, 70% white ethnicity). We grouped 278 high-burden diseases into nine main clusters according to their patterns of disease onset, using a hierarchical agglomerative clustering algorithm. Four of these clusters, encompassing 207 diseases spanning diverse organ systems and clinical specialties, had rates of disease onset that clearly increased with chronological age. However, the ages of onset for these four clusters were strikingly different, with median age of onset 82 years (IQR 82-83) for Cluster 1, 77 years (IQR 75-77) for Cluster 2, 69 years (IQR 66-71) for Cluster 3 and 57 years (IQR 54-59) for Cluster 4. Fitting to ageing-related actuarial models confirmed that the vast majority of these 207 diseases had a high probability of being ageing-related. Cardiovascular diseases and cancers were highly represented, while benign neoplastic, skin and psychiatric conditions were largely absent from the four ageing-related clusters. Our framework identifies and clusters ARDs and can form the basis for fundamental and translational research into ageing pathways.


Asunto(s)
Envejecimiento , Enfermedades Cardiovasculares/epidemiología , Ciencia de los Datos , Trastornos Mentales/epidemiología , Neoplasias/epidemiología , Edad de Inicio , Anciano , Anciano de 80 o más Años , Análisis por Conglomerados , Costo de Enfermedad , Registros Electrónicos de Salud/estadística & datos numéricos , Femenino , Humanos , Masculino , Persona de Mediana Edad , Atención Primaria de Salud/estadística & datos numéricos , Factores de Riesgo , Aprendizaje Automático no Supervisado
3.
BMC Bioinformatics ; 21(1): 586, 2020 Dec 29.
Artículo en Inglés | MEDLINE | ID: mdl-33375946

RESUMEN

BACKGROUND: Proteases are key drivers in many biological processes, in part due to their specificity towards their substrates. However, depending on the family and molecular function, they can also display substrate promiscuity which can also be essential. Databases compiling specificity matrices derived from experimental assays have provided valuable insights into protease substrate recognition. Despite this, there are still gaps in our knowledge of the structural determinants. Here, we compile a set of protease crystal structures with bound peptide-like ligands to create a protocol for modelling substrates bound to protease structures, and for studying observables associated to the binding recognition. RESULTS: As an application, we modelled a subset of protease-peptide complexes for which experimental cleavage data are available to compare with informational entropies obtained from protease-specificity matrices. The modelled complexes were subjected to conformational sampling using the Backrub method in Rosetta, and multiple observables from the simulations were calculated and compared per peptide position. We found that some of the calculated structural observables, such as the relative accessible surface area and the interaction energy, can help characterize a protease's substrate recognition, giving insights for the potential prediction of novel substrates by combining additional approaches. CONCLUSION: Overall, our approach provides a repository of protease structures with annotated data, and an open source computational protocol to reproduce the modelling and dynamic analysis of the protease-peptide complexes.


Asunto(s)
Modelos Moleculares , Péptido Hidrolasas/metabolismo , Péptidos/química , Péptidos/metabolismo , Automatización , Ligandos , Péptido Hidrolasas/química , Conformación Proteica , Programas Informáticos , Especificidad por Sustrato
4.
Trends Endocrinol Metab ; 30(2): 118-131, 2019 02.
Artículo en Inglés | MEDLINE | ID: mdl-30581056

RESUMEN

Increasing human life expectancy has posed increasing challenges for healthcare systems. As people age, they become more susceptible to chronic diseases, with an increasing burden of multimorbidity, and the associated polypharmacy. Accumulating evidence from work with laboratory animals has shown that ageing is a malleable process that can be ameliorated by genetic and environmental interventions. Drugs that modulate the ageing process may delay or even prevent the incidence of multiple diseases of ageing. To identify novel, anti-ageing drugs, several studies have developed computational drug-repurposing strategies. We review published studies showing the potential of current drugs to modulate ageing. Future studies should integrate current knowledge with multi-omics, health records, and drug safety data to predict drugs that can improve health in late life.


Asunto(s)
Envejecimiento/fisiología , Longevidad/fisiología , Animales , Humanos , Esperanza de Vida
5.
Cell Rep ; 21(3): 641-653, 2017 Oct 17.
Artículo en Inglés | MEDLINE | ID: mdl-29045833

RESUMEN

Reduced activity of nutrient-sensing signaling networks can extend organismal lifespan, yet the underlying biology remains unclear. We show that the anti-aging effects of rapamycin and reduced intestinal insulin/insulin growth factor (IGF) signaling (IIS) require the Drosophila FoxA transcription factor homolog Fork Head (FKH). Intestinal FKH induction extends lifespan, highlighting a role for the gut. FKH binds to and is phosphorylated by AKT and Target of Rapamycin. Gut-specific FKH upregulation improves gut barrier function in aged flies. Additionally, it increases the expression of nutrient transporters, as does lowered IIS. Evolutionary conservation of this effect of lowered IIS is suggested by the upregulation of related nutrient transporters in insulin receptor substrate 1 knockout mouse intestine. Our study highlights a critical role played by FKH in the gut in mediating anti-aging effects of reduced IIS. Malnutrition caused by poor intestinal absorption is a major problem in the elderly, and a better understanding of the mechanisms involved will have important therapeutic implications for human aging.


Asunto(s)
Drosophila melanogaster/metabolismo , Drosophila melanogaster/fisiología , Alimentos , Factores de Transcripción Forkhead/metabolismo , Absorción Intestinal , Mucosa Intestinal/metabolismo , Longevidad , Proteínas Nucleares/metabolismo , Animales , Restricción Calórica , Diferenciación Celular/efectos de los fármacos , Núcleo Celular/efectos de los fármacos , Núcleo Celular/metabolismo , Proteínas de Drosophila/metabolismo , Enterocitos/efectos de los fármacos , Enterocitos/metabolismo , Femenino , Insulina/metabolismo , Proteínas Sustrato del Receptor de Insulina/metabolismo , Absorción Intestinal/efectos de los fármacos , Intestinos/citología , Longevidad/efectos de los fármacos , Proteínas de Transporte de Membrana/metabolismo , Estrés Oxidativo/efectos de los fármacos , Fosforilación/efectos de los fármacos , Unión Proteica/efectos de los fármacos , Transporte de Proteínas/efectos de los fármacos , Proteínas Proto-Oncogénicas c-akt/metabolismo , Sirolimus/farmacología , Somatomedinas/metabolismo , Transcripción Genética/efectos de los fármacos , Regulación hacia Arriba/efectos de los fármacos
6.
Nucleic Acids Res ; 45(20): 11559-11569, 2017 Nov 16.
Artículo en Inglés | MEDLINE | ID: mdl-29036576

RESUMEN

DNA methylation is an important epigenetic modification in many species that is critical for development, and implicated in ageing and many complex diseases, such as cancer. Many cost-effective genome-wide analyses of DNA modifications rely on restriction enzymes capable of digesting genomic DNA at defined sequence motifs. There are hundreds of restriction enzyme families but few are used to date, because no tool is available for the systematic evaluation of restriction enzyme combinations that can enrich for certain sites of interest in a genome. Herein, we present customised Reduced Representation Bisulfite Sequencing (cuRRBS), a novel and easy-to-use computational method that solves this problem. By computing the optimal enzymatic digestions and size selection steps required, cuRRBS generalises the traditional MspI-based Reduced Representation Bisulfite Sequencing (RRBS) protocol to all restriction enzyme combinations. In addition, cuRRBS estimates the fold-reduction in sequencing costs and provides a robustness value for the personalised RRBS protocol, allowing users to tailor the protocol to their experimental needs. Moreover, we show in silico that cuRRBS-defined restriction enzymes consistently out-perform MspI digestion in many biological systems, considering both CpG and CHG contexts. Finally, we have validated the accuracy of cuRRBS predictions for single and double enzyme digestions using two independent experimental datasets.


Asunto(s)
Biología Computacional/métodos , Metilación de ADN/genética , Análisis de Secuencia de ADN/economía , Análisis de Secuencia de ADN/métodos , Secuenciación Completa del Genoma/métodos , Animales , Arabidopsis/genética , Sitios de Unión/genética , Factor de Unión a CCCTC/genética , Factor de Unión a CCCTC/metabolismo , Islas de CpG/genética , Enzimas de Restricción del ADN/química , Humanos , Células Madre Pluripotentes Inducidas/citología , Ratones , Factor Nuclear 1 de Respiración/genética , Factor Nuclear 1 de Respiración/metabolismo
7.
Aging (Albany NY) ; 6(2): 98-117, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24531613

RESUMEN

In C. elegans, increased lifespan in daf-2 insulin/IGF-1 receptor mutants is accompanied by up-regulation of the MDL-1 Mad basic helix-loop-helix leucine zipper transcription factor. Here we describe the role of mdl-1 in C. elegans germline proliferation and aging. The deletion allele mdl-1(tm311) shortened lifespan, and did so significantly more so in long-lived daf-2 mutants implying that mdl-1(+) contributes to effects of daf-2 on lifespan. mdl-1 mutant hermaphrodites also lay increased numbers of unfertilized oocytes. During aging, unfertilized oocytes in the uterus develop into tumors, whose development was accelerated by mdl-1(tm311). Opposite phenotypes were seen in daf-2 mutants, i.e. mdl-1 and daf-2 mutant germlines are hyperplastic and hypoplastic, respectively. Thus, MDL-1, like its mammalian orthologs, is an inhibitor of cell proliferation and growth that slows progression of an age-related pathology in C. elegans (uterine tumors). In addition, intestine-limited rescue of mdl-1 increased lifespan but not to wild type levels. Thus, mdl-1 likely acts both in the intestine and the germline to influence age-related mortality.


Asunto(s)
Envejecimiento/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Caenorhabditis elegans/metabolismo , Proteínas de Unión al ADN/metabolismo , Factores de Transcripción/metabolismo , Animales , Caenorhabditis elegans/genética , Proteínas de Caenorhabditis elegans/genética , Proteínas de Unión al ADN/genética , Femenino , Factores de Transcripción Forkhead , Genes myc , Hiperplasia , Hipertrofia , Mucosa Intestinal/metabolismo , Oocitos/crecimiento & desarrollo , Neoplasias Uterinas/genética
8.
Proteins ; 78(3): 518-29, 2010 Feb 15.
Artículo en Inglés | MEDLINE | ID: mdl-19787769

RESUMEN

The process of natural selection acts both on individual organisms within a population and on individual cells within an organism as they develop into cancer. In this work, we have taken a first step toward understanding the differences in selection pressures exerted on the human genome under these disparate circumstances. Focusing on single amino acid substitutions, we have found that cancer-related mutations (CRMs) are frequent in evolutionarily conserved sites, whereas single amino acid polymorphisms (SAPs) tend to appear in sites having a more relaxed evolutionary pressure. Those CRMs classed as cancer driver mutations show greater enrichment for conserved sites than passenger mutations. Consistent with this, driver mutations are enriched for sites annotated as key functional residues and their neighbors, and are more likely to be located on the surface of proteins than expected by chance. Overall the pattern of CRM and polymorphism is remarkably similar, but we do see a clear signal indicative of diversifying selection for disruptive amino acid substitutions in the cancer driver mutations. The ultimate consequence of the appearance of those mutations must be advantageous for the tumor cell, leading to cell population-growth and migration events similar to those seen in natural ecosystems.


Asunto(s)
Sustitución de Aminoácidos , Mutación , Neoplasias/genética , Selección Genética , Sitios de Unión/genética , Análisis Mutacional de ADN/métodos , Humanos , Polimorfismo Genético
9.
Proteins ; 78(5): 1120-36, 2010 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-19927322

RESUMEN

Most function prediction methods that identify cognate ligands from binding site analyses work on the assumption of molecular complementarity. These approaches build on the conjectured complementarity of geometrical and physicochemical properties between ligands and binding sites so that similar binding sites will bind similar ligands. We found that this assumption does not generally hold for protein-ligand interactions and observed that it is not the chemical composition of ligand molecules that dictates the complementarity between protein and ligand molecules, but that the ligand's share within the functional mechanism of a protein determines the degree of complementarity. Here, we present for a set of cognate ligands a descriptive analysis and comparison of the physicochemical properties that each ligand experiences in various nonhomologous binding pockets. The comparisons in each ligand set reveal large variations in their experienced physicochemical properties, suggesting that the same ligand can bind to distinct physicochemical environments. In some protein ligand complexes, the variation was found to correlate with the electrochemical characteristic of ligand molecules, whereas in others it was disclosed as a prerequisite for the biochemical function of the protein. To achieve binding, proteins were observed to engage in subtle balancing acts between electrostatic and hydrophobic interactions to generate stabilizing free energies of binding. For the presented analysis, a new method for scoring hydrophobicity from molecular environments was developed showing high correlations with experimental determined desolvation energies. The presented results highlight the complexities of molecular recognition and underline the challenges of computational structural biology in developing methods to detect these important subtleties.


Asunto(s)
Ligandos , Conformación Proteica , Proteínas , Adenosina Trifosfato/química , Adenosina Trifosfato/metabolismo , Algoritmos , Animales , Sitios de Unión , Hemo/química , Hemo/metabolismo , Interacciones Hidrofóbicas e Hidrofílicas , Modelos Moleculares , NAD/química , NAD/metabolismo , Pliegue de Proteína , Proteínas/química , Proteínas/metabolismo , Electricidad Estática
10.
Science ; 326(5949): 140-4, 2009 Oct 02.
Artículo en Inglés | MEDLINE | ID: mdl-19797661

RESUMEN

Caloric restriction (CR) protects against aging and disease, but the mechanisms by which this affects mammalian life span are unclear. We show in mice that deletion of ribosomal S6 protein kinase 1 (S6K1), a component of the nutrient-responsive mTOR (mammalian target of rapamycin) signaling pathway, led to increased life span and resistance to age-related pathologies, such as bone, immune, and motor dysfunction and loss of insulin sensitivity. Deletion of S6K1 induced gene expression patterns similar to those seen in CR or with pharmacological activation of adenosine monophosphate (AMP)-activated protein kinase (AMPK), a conserved regulator of the metabolic response to CR. Our results demonstrate that S6K1 influences healthy mammalian life-span and suggest that therapeutic manipulation of S6K1 and AMPK might mimic CR and could provide broad protection against diseases of aging.


Asunto(s)
Envejecimiento/fisiología , Longevidad/fisiología , Proteínas Quinasas S6 Ribosómicas 90-kDa/metabolismo , Transducción de Señal , Proteínas Quinasas Activadas por AMP/metabolismo , Tejido Adiposo Blanco/metabolismo , Animales , Densidad Ósea , Restricción Calórica , Femenino , Eliminación de Gen , Expresión Génica , Regulación de la Expresión Génica , Insulina/metabolismo , Hígado/metabolismo , Masculino , Ratones , Ratones Endogámicos C57BL , Actividad Motora , Músculo Esquelético/metabolismo , Proteínas Quinasas/metabolismo , Proteínas Quinasas S6 Ribosómicas 90-kDa/genética , Subgrupos de Linfocitos T/inmunología , Serina-Treonina Quinasas TOR , Transcripción Genética
11.
J Mol Biol ; 390(3): 560-77, 2009 Jul 17.
Artículo en Inglés | MEDLINE | ID: mdl-19447117

RESUMEN

The MACiE database contains 223 distinct step-wise enzyme reaction mechanisms and holds representatives from each EC sub-subclass where there is a crystal structure and sufficient evidence in the literature to support a mechanism. Each catalytic step of every reaction sequence in MACiE is fully annotated so that it includes the function of the catalytic residues involved in the reaction and the mechanism by which substrates are transformed into products. Using MACiE as a knowledge base, we have seen that the top 10 most catalytic residues are histidine, aspartate, glutamate, lysine, cysteine, arginine, serine, threonine, tyrosine and tryptophan. Of these only seven (cysteine, histidine, aspartate, lysine, serine, threonine and tyrosine) dominate catalysis and provide essentially five functional roles that are essential. Stabilisation is the most common and essential role for all classes of enzyme, followed by general acid/base (proton acceptor and proton donor) functionality, with nucleophilic addition following closely behind (nucleophile and nucleofuge). We investigated the occurrence of these residues in MACiE and the Catalytic Site Atlas and found that, as expected, certain residue types are associated with each functional role, with some residue types able to perform diverse roles. In addition, it was seen that different EC classes of enzyme have a tendency to employ different residues for catalysis. Further, we show that whilst the differences between EC classes in catalytic residue composition are not immediately obvious from the general classes of Ingold mechanisms, there is some weak correlation between the mechanisms involved in a given EC class and the functions that the catalytic amino acid residues are performing. The analysis presented here provides a valuable insight into the functional roles of catalytic amino acid residues, which may have applications in many aspects of enzymology, from the design of novel enzymes to the prediction and validation of enzyme reaction mechanisms.


Asunto(s)
Aminoácidos/química , Aminoácidos/metabolismo , Biocatálisis , Enzimas/química , Enzimas/metabolismo
12.
J Mol Biol ; 388(2): 356-80, 2009 May 01.
Artículo en Inglés | MEDLINE | ID: mdl-19265704

RESUMEN

In metalloproteins, the protein environment modulates metal properties to achieve the required goal, which can be protein stabilization or function. The analysis of metal sites at the atomic level of detail provided by protein structures can thus be of benefit in functional and evolutionary studies of proteins. In this work, we propose a structural bioinformatics approach to the study of metalloproteins based on structural templates of metal sites that include the PDB coordinates of protein residues forming the first and the second coordination sphere of the metal. We have applied this approach to non-heme iron sites, which have been analyzed at various levels. Templates of sites located in different protein domains have been compared, showing that similar sites can be found in unrelated proteins as the result of convergent evolution. Templates of sites located in proteins of a large superfamily have been compared, showing possible mechanisms of divergent evolution of proteins to achieve different functions. Furthermore, template comparisons have been used to predict the function of uncharacterized proteins, showing that similarity searches focused on metal sites can be advantageously combined with typical whole-domain comparisons. Structural templates of metal sites, finally, may constitute the basis for a systematic classification of metalloproteins in databases.


Asunto(s)
Biología Computacional/métodos , Hierro/química , Proteínas de Hierro no Heme/química , Sitios de Unión , Bases de Datos de Proteínas , Evolución Molecular , Conformación Proteica
13.
Nat Biotechnol ; 27(2): 157-67, 2009 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-19204698

RESUMEN

Molecular recognition between proteins and their interacting partners underlies the biochemistry of living organisms. Specificity in this recognition is thought to be essential, whereas promiscuity is often associated with unwanted side effects, poor catalytic properties and errors in biological function. Recent experimental evidence suggests that promiscuity, not only in interactions but also in the actual function of proteins, is not as rare as was previously thought. This has implications not only for our fundamental understanding of molecular recognition and how protein function has evolved over time but also in the realm of biotechnology. Understanding protein promiscuity is becoming increasingly important not only to optimize protein engineering applications in areas as diverse as synthetic biology and metagenomics but also to lower attrition rates in drug discovery programs, identify drug interaction surfaces less susceptible to escape mutations and potentiate the power of polypharmacology.


Asunto(s)
Biotecnología/tendencias , Unión Proteica/fisiología , Conformación Proteica , Proteínas , Animales , Biología Computacional , Diseño de Fármacos , Ambiente , Evolución Molecular , Regulación de la Expresión Génica , Humanos , Ligandos , Modelos Moleculares , Docilidad , Ingeniería de Proteínas , Pliegue de Proteína , Multimerización de Proteína , Procesamiento Proteico-Postraduccional , Proteínas/química , Proteínas/fisiología , Solventes , Relación Estructura-Actividad
14.
J Mol Biol ; 372(5): 1261-77, 2007 Oct 05.
Artículo en Inglés | MEDLINE | ID: mdl-17727879

RESUMEN

We report, for the first time, on the statistics of chemical mechanisms and amino acid residue functions that occur in enzyme reaction sequences using the MACiE database of 202 distinct enzyme reaction mechanisms as a knowledge base. MACiE currently holds representatives from each Enzyme Commission sub-subclass where there is an available crystal structure and sufficient evidence in the primary literature for a mechanism. Each catalytic step of every reaction sequence in MACiE is fully annotated, so that it includes the function of the catalytic residues involved in the reaction and the chemical mechanisms by which substrates are transformed into products. We show that the most catalytic amino acid residues are histidine, cysteine and aspartate, which are also the residues whose side-chains are more likely to serve as reactants, and that have the greatest versatility of function. We show that electrophilic reactions in enzymes are very rare, and the majority of enzyme reactions rely upon nucleophilic and general acid/base chemistry. However, although rare, radical (homolytic) reactions are much more common than electrophilic reactions. Thus, the majority of amino acid residues perform stabilisation roles (as spectators) or proton shuttling roles (as reactants). The analysis presented provides a better understanding of the mechanisms of enzyme catalysis and may act as an initial step in the validation and prediction of mechanism in an enzyme active site.


Asunto(s)
Bases de Datos de Proteínas , Enzimas , Conformación Proteica , Aminoácidos/química , Aminoácidos/metabolismo , Sitios de Unión , Catálisis , Estabilidad de Enzimas , Enzimas/química , Enzimas/metabolismo , Estructura Molecular , Procesamiento Proteico-Postraduccional , Protones , Reproducibilidad de los Resultados
15.
J Mol Biol ; 364(4): 836-52, 2006 Dec 08.
Artículo en Inglés | MEDLINE | ID: mdl-17034815

RESUMEN

Here, we present an automatic assignment of potential cognate ligands to domains of enzymes in the CATH and SCOP protein domain classifications on the basis of structural data available in the wwPDB. This procedure involves two steps; firstly, we assign the binding of particular ligands to particular domains; secondly, we compare the chemical similarity of the PDB ligands to ligands in KEGG in order to assign cognate ligands. We find that use of the Enzyme Commission (EC) numbers is necessary to enable efficient and accurate cognate ligand assignment. The PROCOGNATE database currently has cognate ligand mapping for 3277 (4118) protein structures and 351 (302) superfamilies, as described by the CATH and (SCOP) databases, respectively. We find that just under half of all ligands are only and always bound by a single domain, with 16% bound by more than one domain and the remainder of the ligands showing a variety of binding modes. This finding has implications for domain recombination and the evolution of new protein functions. Domain architecture or context is also found to affect substrate specificity of particular domains, and we discuss example cases. The most popular PDB ligands are all found to be generic components of crystallisation buffers, highlighting the non-cognate ligand problem inherent in the PDB. In contrast, the most popular cognate ligands are all found to be universal cellular currencies of reducing power and energy such as NADH, FADH2 and ATP, respectively, reflecting the fact that the vast majority of enzymatic reactions utilise one of these popular co-factors. These ligands all share a common adenine ribonucleotide moiety, suggesting that many different domain superfamilies have converged to bind this chemical framework.


Asunto(s)
Enzimas/química , Mapeo de Interacción de Proteínas/métodos , Nucleótidos de Adenina , Sitios de Unión , Bases de Datos de Proteínas , Enzimas/metabolismo , Ligandos , Unión Proteica , Especificidad por Sustrato
16.
J Mol Biol ; 356(4): 928-44, 2006 Mar 03.
Artículo en Inglés | MEDLINE | ID: mdl-16405908

RESUMEN

The phenomenon of molecular recognition, which underpins almost all biological processes, is dynamic, complex and subtle. Establishing an interaction between a pair of molecules involves mutual structural rearrangements guided by a highly convoluted energy landscape, the accurate mapping of which continues to elude us. Increased understanding of the degree to which the conformational space of a ligand is restricted upon binding may have important implications for docking studies, structure refinement and for function prediction methods based on geometrical comparisons of ligands or their binding sites. Here, we present an analysis of the conformational variability exhibited by three of the most ubiquitous biological ligands in nature, ATP, NAD and FAD. First, we demonstrate qualitatively that these ligands bind to proteins in widely varying conformations, including several cases in which parts of the molecule assume energetically unfavourable orientations. Next, by comparing the distribution of bound ligand shapes with the set of all possible molecular conformations, we provide a quantitative assessment of previous observations that ligands tend to unfold when binding to proteins. We show that, while extended forms of ligands are indeed common in ligand-protein structures, instances of ligands in almost maximally compact arrangements can also be found. Thirdly, we compare the conformational variation in two sets of ligand molecules, those bound to homologous proteins, and those bound to unrelated proteins. Although most superfamilies bind ligands in a fairly conserved manner, we find several cases in which significant variation in ligand configuration is observed.


Asunto(s)
Adenosina Trifosfato , Flavina-Adenina Dinucleótido , NAD , Conformación Proteica , Proteínas/química , Adenosina Trifosfato/química , Adenosina Trifosfato/metabolismo , Análisis por Conglomerados , Flavina-Adenina Dinucleótido/química , Flavina-Adenina Dinucleótido/metabolismo , Modelos Moleculares , Estructura Molecular , NAD/química , NAD/metabolismo , Unión Proteica
17.
Proteins ; 57(2): 365-80, 2004 Nov 01.
Artículo en Inglés | MEDLINE | ID: mdl-15340924

RESUMEN

We present a method called DAVROS to detect, localize, and validate repeating motifs in protein structure allowing for insertions and deletions. DAVROS uses the score matrix from a structural alignment program (SAP) to search for repeating motifs using an algorithm based on concepts from signal processing and the statistical properties of the alignments. The method was tested against a nonredundant Protein Data Bank, and each chain was assigned a score. For the top 50 chains ranked by score, 70% contain repeating motifs detected without error. These represent 14 types of fold covering alpha, beta, and alphabeta protein classes. A second data set comprising protein chains in different sequence families for triosephosphate isomerase (TIM) barrel, leucine-rich repeat (LRR), trefoil, and alpha-alpha barrel folds was used to assess the ability of DAVROS to detect all motifs within a specific fold. For the second test set, the percentage of motifs detected was highest for the LRR chains (88.7%) and least for the TIM barrels (60%). This variability results from the regularity of the LRR motif compared to the alphabeta units of the TIM barrel, which generally have many more indels. These reduce the strength of the repeat signal in the SAP matrix, making repeat detection more difficult.


Asunto(s)
Proteínas/química , Secuencias Repetitivas de Aminoácido , Secuencias de Aminoácidos , Bases de Datos de Proteínas , Péptidos/química , Pliegue de Proteína , Estructura Terciaria de Proteína
18.
Proteins ; 52(3): 427-35, 2003 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-12866053

RESUMEN

A detailed comparison of the active sites in beta-ketoacyl synthases (KAS) and related enzymes has been made. Using three-dimensional templates of the three catalytic residues to scan the protein structural database reveals differences in both the geometry and the catalytic role of equivalent residues in different members of the family. The template based on the catalytic cysteine and two histidines in the KAS I and II is totally specific for this family, with no false hits. However, the role of the histidines in catalysis is different between KAS I/II and thiolase on the one hand and KAS III/chalcone synthase on the other. In contrast, a template comprising only cysteine and one histidine is not specific with many hits including members of the KAS family, metal binding sites, other active sites in nonhomologous proteins, and some "random" nonactive sites.


Asunto(s)
3-Oxoacil-(Proteína Transportadora de Acil) Sintasa/química , Biología Computacional/métodos , 3-Oxoacil-(Proteína Transportadora de Acil) Sintasa/metabolismo , Sitios de Unión , Cisteína/química , Cisteína/metabolismo , Bases de Datos de Proteínas , Enzimas/química , Enzimas/metabolismo , Histidina/química , Histidina/metabolismo , Isoenzimas/química , Isoenzimas/metabolismo , Malonil Coenzima A/metabolismo , Estructura Molecular
19.
J Mol Biol ; 330(4): 719-34, 2003 Jul 18.
Artículo en Inglés | MEDLINE | ID: mdl-12850142

RESUMEN

Structural genomics projects aim to provide a sharp increase in the number of structures of functionally unannotated, and largely unstudied, proteins. Algorithms and tools capable of deriving information about the nature, and location, of functional sites within a structure are increasingly useful therefore. Here, a neural network is trained to identify the catalytic residues found in enzymes, based on an analysis of the structure and sequence. The neural network output, and spatial clustering of the highly scoring residues are then used to predict the location of the active site.A comparison of the performance of differently trained neural networks is presented that shows how information from sequence and structure come together to improve the prediction accuracy of the network. Spatial clustering of the network results provides a reliable way of finding likely active sites. In over 69% of the test cases the active site is correctly predicted, and a further 25% are partially correctly predicted. The failures are generally due to the poor quality of the automatically generated sequence alignments. We also present predictions identifying the active site, and potential functional residues in five recently solved enzyme structures, not used in developing the method. The method correctly identifies the putative active site in each case. In most cases the likely functional residues are identified correctly, as well as some potentially novel functional groups.


Asunto(s)
Sitios de Unión , Enzimas/química , Redes Neurales de la Computación , Algoritmos , Proteínas Bacterianas/química , Dominio Catalítico , Biología Computacional , Glicósido Hidrolasas/química , N-Metiltransferasa de Histona-Lisina/química , Intrones , Metiltransferasas/química , Modelos Moleculares , Pentosiltransferasa/química , Estructura Terciaria de Proteína , Programas Informáticos
20.
Structure ; 10(10): 1435-51, 2002 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-12377129

RESUMEN

To improve our understanding of the evolution of novel functions, we performed a sequence, structural, and functional analysis of homologous enzymes and nonenzymes of known three-dimensional structure. In most examples identified, the nonenzyme is derived from an ancestral catalytic precursor (as opposed to the reverse evolutionary scenario, nonenzyme to enzyme), and the active site pocket has been disrupted in some way, owing to the substitution of critical catalytic residues and/or steric interactions that impede substrate binding and catalysis. Pairwise sequence identity is typically insignificant, and almost one-half of the enzyme and nonenzyme pairs do not share any similarity in function. Heterooligomeric enzymes comprising homologous subunits in which one chain is catalytically inactive and enzyme polypeptides that contain internal catalytic and noncatalytic duplications of an ancient enzyme domain are also discussed.


Asunto(s)
Enzimas/química , Biopolímeros , Dominio Catalítico , Enzimas/genética , Evolución Molecular , Modelos Moleculares , Conformación Proteica
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA