Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 21
Filtrar
1.
Biochemistry ; 55(28): 3847-50, 2016 07 19.
Artículo en Inglés | MEDLINE | ID: mdl-27347689

RESUMEN

In addition to completing the Watson-Crick nucleobase matching "concept" (big pairs with small, hydrogen bond donors pair with hydrogen bond acceptors), artificially expanded genetic information systems (AEGIS) also challenge DNA polymerases with a complete set of mismatches, including wobble mismatches. Here, we explore wobble mismatches with AEGIS with DNA polymerase 1 from Escherichia coli. Remarkably, we find that the polymerase tolerates an AEGIS:standard wobble that has the same geometry as the G:T wobble that polymerases have evolved to exclude but excludes a wobble geometry that polymerases have never encountered in natural history. These results suggest certain limits to "structural analogy" and "evolutionary guidance" as tools to help synthetic biologists expand DNA alphabets.


Asunto(s)
Disparidad de Par Base , ADN Polimerasa I/metabolismo , ADN/genética , ADN/metabolismo , Evolución Molecular , Emparejamiento Base , ADN/química , Escherichia coli/enzimología , Unión Proteica
2.
Bioinformatics ; 31(3): 311-7, 2015 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-25306399

RESUMEN

MOTIVATION: Despite advances in high-throughput sequencing, marine metagenomic samples remain largely opaque. A typical sample contains billions of microbial organisms from thousands of genomes and quadrillions of DNA base pairs. Its derived metagenomic dataset underrepresents this complexity by orders of magnitude because of the sparseness and shortness of sequencing reads. Read shortness and sequencing errors pose a major challenge to accurate species and functional annotation. This includes distinguishing known from novel species. Often the majority of reads cannot be annotated and thus cannot help our interpretation of the sample. RESULTS: Here, we demonstrate quantitatively how careful assembly of marine metagenomic reads within, but also across, datasets can alleviate this problem. For 10 simulated datasets, each with species complexity modeled on a real counterpart, chimerism remained within the same species for most contigs (97%). For 42 real pyrosequencing ('454') datasets, assembly increased the proportion of annotated reads, and even more so when datasets were pooled, by on average 1.6% (max 6.6%) for species, 9.0% (max 28.7%) for Pfam protein domains and 9.4% (max 22.9%) for PANTHER gene families. Our results outline exciting prospects for data sharing in the metagenomics community. While chimeric sequences should be avoided in other areas of metagenomics (e.g. biodiversity analyses), conservative pooled assembly is advantageous for annotation specificity and sensitivity. Intriguingly, our experiment also found potential prospects for (low-cost) discovery of new species in 'old' data. CONTACT: dgerloff@ffame.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Proteínas Arqueales/genética , Proteínas Bacterianas/genética , Quimerismo , Conjuntos de Datos como Asunto , Genoma Arqueal , Metagenómica , Anotación de Secuencia Molecular/métodos , Análisis de Secuencia de ADN/métodos , Genoma Bacteriano
3.
J Am Chem Soc ; 137(21): 6734-7, 2015 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-25966323

RESUMEN

Axiomatically, the density of information stored in DNA, with just four nucleotides (GACT), is higher than in a binary code, but less than it might be if synthetic biologists succeed in adding independently replicating nucleotides to genetic systems. Such addition could also add functional groups not found in natural DNA, but useful for molecular performance. Here, we consider two new nucleotides (Z and P, 6-amino-5-nitro-3-(1'-ß-D-2'-deoxyribo-furanosyl)-2(1H)-pyridone and 2-amino-8-(1'-ß-D-2'-deoxyribofuranosyl)-imidazo[1,2-a]-1,3,5-triazin-4(8H)-one). These are designed to pair via complete Watson-Crick geometry. These were added to a library of oligonucleotides used in a laboratory in vitro evolution (LIVE) experiment; the GACTZP library was challenged to deliver molecules that bind selectively to liver cancer cells, but not to untransformed liver cells. Unlike in classical in vitro selection, low levels of mutation allow this system to evolve to create binding molecules not necessarily present in the original library. Over a dozen binding species were recovered. The best had Z and/or P in their sequences. Several had multiple, nearby, and adjacent Zs and Ps. Only the weaker binders contained no Z or P at all. This suggests that this system explored much of the sequence space available to this genetic system and that GACTZP libraries are richer reservoirs of functionality than standard libraries.


Asunto(s)
ADN/química , ADN/síntesis química , ADN/genética , Biblioteca de Genes , Células Hep G2 , Humanos , Modelos Moleculares , Reacción en Cadena de la Polimerasa
4.
PLoS Genet ; 8(4): e1002626, 2012.
Artículo en Inglés | MEDLINE | ID: mdl-22511878

RESUMEN

Biofilms are dense microbial communities. Although widely distributed and medically important, how biofilm cells interact with one another is poorly understood. Recently, we described a novel process whereby myxobacterial biofilm cells exchange their outer membrane (OM) lipoproteins. For the first time we report here the identification of two host proteins, TraAB, required for transfer. These proteins are predicted to localize in the cell envelope; and TraA encodes a distant PA14 lectin-like domain, a cysteine-rich tandem repeat region, and a putative C-terminal protein sorting tag named MYXO-CTERM, while TraB encodes an OmpA-like domain. Importantly, TraAB are required in donors and recipients, suggesting bidirectional transfer. By use of a lipophilic fluorescent dye, we also discovered that OM lipids are exchanged. Similar to lipoproteins, dye transfer requires TraAB function, gliding motility and a structured biofilm. Importantly, OM exchange was found to regulate swarming and development behaviors, suggesting a new role in cell-cell communication. A working model proposes TraA is a cell surface receptor that mediates cell-cell adhesion for OM fusion, in which lipoproteins/lipids are transferred by lateral diffusion. We further hypothesize that cell contact-dependent exchange helps myxobacteria to coordinate their social behaviors.


Asunto(s)
Proteínas de la Membrana Bacteriana Externa/genética , Comunicación Celular , Membrana Celular , Metabolismo de los Lípidos , Myxococcus xanthus/genética , Proteínas de la Membrana Bacteriana Externa/metabolismo , Proteínas Bacterianas/genética , Biopelículas/crecimiento & desarrollo , Adhesión Celular/genética , Comunicación Celular/genética , Membrana Celular/genética , Membrana Celular/metabolismo , Metabolismo de los Lípidos/genética , Proteínas Motoras Moleculares/genética , Myxococcus xanthus/citología , Conformación Proteica , Transporte de Proteínas/genética
5.
Bioinformatics ; 29(18): 2350-2, 2013 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-23894142

RESUMEN

SUMMARY: The Malaria Genome Exploration Tool (MaGnET) is a software tool enabling intuitive 'exploration-style' visualization of functional genomics data relating to the malaria parasite, Plasmodium falciparum. MaGnET provides innovative integrated graphic displays for different datasets, including genomic location of genes, mRNA expression data, protein-protein interactions and more. Any selection of genes to explore made by the user is easily carried over between the different viewers for different datasets, and can be changed interactively at any point (without returning to a search). AVAILABILITY AND IMPLEMENTATION: Free online use (Java Web Start) or download (Java application archive and MySQL database; requires local MySQL installation) at http://malariagenomeexplorer.org CONTACT: joanna.sharman@ed.ac.uk or dgerloff@ffame.org SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genoma de Protozoos , Plasmodium falciparum/genética , Programas Informáticos , Genómica , Plasmodium falciparum/metabolismo , Proteínas Protozoarias/genética , Proteínas Protozoarias/metabolismo
6.
Nat Commun ; 15(1): 4057, 2024 May 14.
Artículo en Inglés | MEDLINE | ID: mdl-38744910

RESUMEN

With just four building blocks, low sequence information density, few functional groups, poor control over folding, and difficulties in forming compact folds, natural DNA and RNA have been disappointing platforms from which to evolve receptors, ligands, and catalysts. Accordingly, synthetic biology has created "artificially expanded genetic information systems" (AEGIS) to add nucleotides, functionality, and information density. With the expected improvements seen in AegisBodies and AegisZymes, the task for synthetic biologists shifts to developing for expanded DNA the same analytical tools available to natural DNA. Here we report one of these, an enzyme-assisted sequencing of expanded genetic alphabet (ESEGA) method to sequence six-letter AEGIS DNA. We show how ESEGA analyses this DNA at single base resolution, and applies it to optimized conditions for six-nucleotide PCR, assessing the fidelity of various DNA polymerases, and extending this to AEGIS components with functional groups. This supports the renewed exploitation of expanded DNA alphabets in biotechnology.


Asunto(s)
ADN , Secuenciación de Nucleótidos de Alto Rendimiento , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , ADN/genética , ADN/metabolismo , Biología Sintética/métodos , ADN Polimerasa Dirigida por ADN/metabolismo , ADN Polimerasa Dirigida por ADN/genética , Reacción en Cadena de la Polimerasa/métodos , Secuencia de Bases , Análisis de Secuencia de ADN/métodos
7.
Nucleic Acids Res ; 39(Database issue): D705-11, 2011 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-21081561

RESUMEN

Binary subcomplexes in proteins database (BISC) is a new protein-protein interaction (PPI) database linking up the two communities most active in their characterization: structural biology and functional genomics researchers. The BISC resource offers users (i) a structural perspective and related information about binary subcomplexes (i.e. physical direct interactions between proteins) that are either structurally characterized or modellable entries in the main functional genomics PPI databases BioGRID, IntAct and HPRD; (ii) selected web services to further investigate the validity of postulated PPI by inspection of their hypothetical modelled interfaces. Among other uses we envision that this resource can help identify possible false positive PPI in current database records. BISC is freely available at http://bisc.cse.ucsc.edu.


Asunto(s)
Bases de Datos de Proteínas , Complejos Multiproteicos/química , Bases de Datos Genéticas , Genómica , Modelos Moleculares , Complejos Multiproteicos/genética , Mapeo de Interacción de Proteínas
8.
STAR Protoc ; 4(1): 101905, 2023 03 17.
Artículo en Inglés | MEDLINE | ID: mdl-36528856

RESUMEN

A significant number of proteins are annotated as functionally uncharacterized proteins. Within this protocol, we describe how to use protein family multiple sequence alignments and structural bioinformatics resources to design loss-of-function mutations of previously uncharacterized proteins within the glycosyltransferase family. We detail approaches to determine target protein active sites using three-dimensional modeling. We generate active site mutants and quantify any changes in enzymatic function by a glycosyltransferase assay. With modifications, this protocol could be applied to other metal-dependent enzymes. For complete details on the use and execution of this protocol, please refer to Ilina et al. (2022).1.


Asunto(s)
Bioensayo , Ingeniería de Proteínas , Biología Computacional , Glicosiltransferasas/genética , Mutación
9.
Res Sq ; 2023 Dec 21.
Artículo en Inglés | MEDLINE | ID: mdl-38196584

RESUMEN

Many efforts have sought to apply laboratory in vitro evolution (LIVE) to natural nucleic acid (NA) scaffolds to directly evolve functional molecules. However, synthetic biology can move beyond natural NA scaffolds to create molecular systems whose libraries are far richer reservoirs of functionality than natural NAs. For example, "artificially expanded genetic information systems" (AEGIS) add up to eight nucleotides to the four found in standard NA. Even in its simplest 6-letter versions, AEGIS adds functional groups, information density, and folding motifs that natural NA libraries lack. To complete this vision, however, tools are needed to sequence molecules that are created by AEGIS LIVE. Previous sequencing approaches, including approaches from our laboratories, exhibited limited performance and lost many sequences in diverse library mixtures. Here, we present a new approach that enzymatically transforms the target AEGIS DNA. With higher transliteration efficiency and fidelity, this Enzyme-Assisted Sequencing of Expanded Genetic Alphabet (ESEGA) approach produces substantially better sequences of 6-letter (AGCTZP) DNA than previous transliteration approaches. Therefore, ESEGA facilitates precise analysis of libraries, allowing 'next-generation deep sequencing' to accurately quantify the sequences of 6-letter DNA molecules at single base resolution. We then applied ESEGA to three tasks: (a) defining optimal conditions to perform 6-nucleotide PCR (b) evaluating the fidelity of 6-nucleotide PCR with various DNA polymerases, and (c) extending that evaluation to AEGIS components functionalized with alkynyl and aromatic groups. No other approach at present has this scope, allowing this work to be the next step towards exploiting the potential of expanded DNA alphabets in biotechnology.

10.
iScience ; 25(2): 103842, 2022 Feb 18.
Artículo en Inglés | MEDLINE | ID: mdl-35198895

RESUMEN

Glioblastoma (GBM) is the most aggressive primary brain tumor characterized by infiltrative growth of malignant glioma cells into the surrounding brain parenchyma. In this study, our analysis of GBM patient cohorts revealed a significantly higher expression of Glycosyltransferase 8 domain containing 1 (GLT8D1) compared to normal brain tissue and could be associated with impaired patient survival. Increased in vitro expression of GLT8D1 significantly enhanced migration of two different sphere-forming GBM cell lines. By in silico analysis we predicted the 3D-structure as well as the active site residues of GLT8D1. The introduction of point mutations in the predicted active site reduced its glycosyltransferase activity in vitro and consequently impaired GBM tumor cell migration. Examination of GLT8D1 interaction partners by LC-MS/MS implied proteins associated with cytoskeleton and intracellular transport as potential substrates. In conclusion, we demonstrated that the enzymatic activity of glycosyltransferase GLT8D1 promotes GBM cell migration.

11.
J Cell Biol ; 166(2): 179-91, 2004 Jul 19.
Artículo en Inglés | MEDLINE | ID: mdl-15249581

RESUMEN

The chromosomal passenger complex of Aurora B kinase, INCENP, and Survivin has essential regulatory roles at centromeres and the central spindle in mitosis. Here, we describe Borealin, a novel member of the complex. Approximately half of Aurora B in mitotic cells is complexed with INCENP, Borealin, and Survivin; and Borealin binds Survivin and INCENP in vitro. A second complex contains Aurora B and INCENP, but no Borealin or Survivin. Depletion of Borealin by RNA interference delays mitotic progression and results in kinetochore-spindle misattachments and an increase in bipolar spindles associated with ectopic asters. The extra poles, which apparently form after chromosomes achieve a bipolar orientation, severely disrupt the partitioning of chromosomes in anaphase. Borealin depletion has little effect on histone H3 serine10 phosphorylation. These results implicate the chromosomal passenger holocomplex in the maintenance of spindle integrity and suggest that histone H3 serine10 phosphorylation is performed by an Aurora B-INCENP subcomplex.


Asunto(s)
Proteínas de Ciclo Celular/metabolismo , Proteínas Nucleares/metabolismo , Huso Acromático , Secuencia de Aminoácidos , Animales , Sitios de Unión , Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/fisiología , División Celular , Proteínas Cromosómicas no Histona/metabolismo , Cromosomas , Humanos , Proteínas Inhibidoras de la Apoptosis , Sustancias Macromoleculares , Proteínas Asociadas a Microtúbulos/metabolismo , Mutación , Proteínas de Neoplasias , Proteínas Recombinantes de Fusión/genética , Alineación de Secuencia , Survivin
12.
Vaccine ; 35(2): 264-272, 2017 01 05.
Artículo en Inglés | MEDLINE | ID: mdl-27912985

RESUMEN

Pfs48/45 and Pfs25 are leading candidates for the development of Plasmodium falciparum transmission blocking vaccines (TBV). Expression of Pfs48/45 in the erythrocytic sexual stages and presentation to the immune system during infection in the human host also makes it ideal for natural boosting. However, it has been challenging to produce a fully folded, functionally active Pfs48/45, using various protein expression platforms. In this study, we demonstrate that full-length Pfs48/45 encoded by DNA plasmids is able to induce significant transmission reducing immune responses. DNA plasmids encoding Pfs48/45 based on native (WT), codon optimized (SYN), or codon optimized and mutated (MUT1 and MUT2), to prevent any asparagine (N)-linked glycosylation were compared with or without intramuscular electroporation (EP). EP significantly enhanced antibody titers and transmission blocking activity elicited by immunization with SYN Pfs48/45 DNA vaccine. Mosquito membrane feeding assays also revealed improved functional immunogenicity of SYN Pfs48/45 (N-glycosylation sites intact) as compared to MUT1 or MUT2 Pfs48/45 DNA plasmids (all N-glycosylation sites mutated). Boosting with recombinant Pfs48/45 protein after immunization with each of the different DNA vaccines resulted in significant boosting of antibody response and improved transmission reducing capabilities of all four DNA vaccines. Finally, immunization with a combination of DNA plasmids (SYN Pfs48/45 and SYN Pfs25) also provides support for the possibility of combining antigens targeting different life cycle stages in the parasite during transmission through mosquitoes.


Asunto(s)
Vacunas contra la Malaria/administración & dosificación , Vacunas contra la Malaria/inmunología , Malaria Falciparum/prevención & control , Glicoproteínas de Membrana/inmunología , Proteínas Protozoarias/inmunología , Vacunas de ADN/administración & dosificación , Vacunas de ADN/inmunología , Animales , Anticuerpos Antiprotozoarios/sangre , Antígenos de Protozoos/genética , Antígenos de Protozoos/inmunología , Transmisión de Enfermedad Infecciosa/prevención & control , Electroporación , Femenino , Malaria Falciparum/transmisión , Glicoproteínas de Membrana/genética , Ratones Endogámicos BALB C , Proteínas Protozoarias/genética
13.
Protein Eng Des Sel ; 18(8): 379-88, 2005 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-15976010

RESUMEN

Numerous mammalian proteins are constructed from a limited repertoire of module-types. Proteins belonging to the regulators of complement activation family--crucial for ensuring a complement-mediated immune response is targeted against infectious agents--are composed solely of complement control protein (CCP) modules. In the current study, CCP module sequences were grouped to allow selection of the most appropriate experimentally determined structures to serve as templates in an automated large-scale structure modelling procedure. The resulting 135 individual CCP module models, valuable in their own right, are available at the online database http://www.bru.ed.ac.uk/~dinesh/ccp-db.html. Comparisons of surface properties within a particular family of modules should be more informative than sequence alignments alone. A comparison of surface electrostatic features was undertaken for the first 28 CCP modules of complement receptor type 1 (CR1). Assignments to clusters based on surface properties differ from assignments to clusters based on sequences. This observation might reflect adaptive evolution of surface-exposed residues involved in protein-protein interactions. This illustrative example of a multiple surface-comparison was indeed able to pinpoint functional sites in CR1.


Asunto(s)
Modelos Moleculares , Receptores de Complemento/química , Secuencia de Aminoácidos , Animales , Bases de Datos de Proteínas , Humanos , Estructura Terciaria de Proteína , Alineación de Secuencia , Electricidad Estática , Homología Estructural de Proteína
14.
Open Biol ; 5(2): 150005, 2015 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-25716199

RESUMEN

SMC proteins are essential components of three protein complexes that are important for chromosome structure and function. The cohesin complex holds replicated sister chromatids together, whereas the condensin complex has an essential role in mitotic chromosome architecture. Both are involved in interphase genome organization. SMC-containing complexes are large (more than 650 kDa for condensin) and contain long anti-parallel coiled-coils. They are thus difficult subjects for conventional crystallographic and electron cryomicroscopic studies. Here, we have used amino acid-selective cross-linking and mass spectrometry combined with structure prediction to develop a full-length molecular draft three-dimensional structure of the SMC2/SMC4 dimeric backbone of chicken condensin. We assembled homology-based molecular models of the globular heads and hinges with the lengthy coiled-coils modelled in fragments, using numerous high-confidence cross-links and accounting for potential irregularities. Our experiments reveal that isolated condensin complexes can exist with their coiled-coil segments closely apposed to one another along their lengths and define the relative spatial alignment of the two anti-parallel coils. The centres of the coiled-coils can also approach one another closely in situ in mitotic chromosomes. In addition to revealing structural information, our cross-linking data suggest that both H2A and H4 may have roles in condensin interactions with chromatin.


Asunto(s)
Adenosina Trifosfatasas/química , Proteínas de Unión al ADN/química , Modelos Moleculares , Complejos Multiproteicos/química , Proteínas Nucleares/química , Animales , Línea Celular , Pollos , Cromosomas , Ligamiento Genético , Histonas/metabolismo , Mitosis , Conformación Proteica , Dominios y Motivos de Interacción de Proteínas , Proteínas Recombinantes de Fusión
15.
FEBS Lett ; 578(1-2): 163-8, 2004 Dec 03.
Artículo en Inglés | MEDLINE | ID: mdl-15581635

RESUMEN

The alternative NADH:ubiquinone oxidoreductase (NDH-2) from Escherichia coli is a membrane protein playing a prominent role in respiration by linking the reduction of NADH to the quinone pool. Remote sequence similarity reveals an evolutionary relation between alternative NADH:quinone oxidoreductases and the SCOP-family "FAD/NAD-linked reductases". We have created a structural model for NDH-2 from E. coli through comparative modelling onto a template from this family. Combined analysis of our model and sequence conservation allowed us to include the cofactor FAD and the substrate NADH in atomic detail. Furthermore, we propose the most plausible orientation of NDH-2 relative to the membrane and specify a region of the protein potentially involved in ubiquinone binding.


Asunto(s)
Complejo I de Transporte de Electrón/química , Complejo I de Transporte de Electrón/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Escherichia coli/enzimología , Secuencia de Aminoácidos , Sitios de Unión , Respiración de la Célula/fisiología , Cobre/metabolismo , Complejo I de Transporte de Electrón/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Flavina-Adenina Dinucleótido/química , Flavina-Adenina Dinucleótido/metabolismo , Modelos Moleculares , Datos de Secuencia Molecular , NAD/química , NAD/metabolismo , Unión Proteica , Estructura Terciaria de Proteína , Ubiquinona/metabolismo
16.
mBio ; 5(1): e00744-13, 2013 Dec 31.
Artículo en Inglés | MEDLINE | ID: mdl-24381303

RESUMEN

UNLABELLED: More than 20% of all protein domains are currently annotated as "domains of unknown function" (DUFs). About 2,700 DUFs are found in bacteria compared with just over 1,500 in eukaryotes. Over 800 DUFs are shared between bacteria and eukaryotes, and about 300 of these are also present in archaea. A total of 2,786 bacterial Pfam domains even occur in animals, including 320 DUFs. Evolutionary conservation suggests that many of these DUFs are important. Here we show that 355 essential proteins in 16 model bacterial species contain 238 DUFs, most of which represent single-domain proteins, clearly establishing the biological essentiality of DUFs. We suggest that experimental research should focus on conserved and essential DUFs (eDUFs) for functional analysis given their important function and wide taxonomic distribution, including bacterial pathogens. IMPORTANCE: The functional units of proteins are domains. Typically, each domain has a distinct structure and function. Genomes encode thousands of domains, and many of the domains have no known function (domains of unknown function [DUFs]). They are often ignored as of little relevance, given that many of them are found in only a few genomes. Here we show that many DUFs are essential DUFs (eDUFs) based on their presence in essential proteins. We also show that eDUFs are often essential even if they are found in relatively few genomes. However, in general, more common DUFs are more often essential than rare DUFs.


Asunto(s)
Bacterias/genética , Proteínas Bacterianas/química , Proteínas Bacterianas/genética , Estructura Terciaria de Proteína , Secuencia Conservada
18.
FEBS Lett ; 586(17): 2711-6, 2012 Aug 14.
Artículo en Inglés | MEDLINE | ID: mdl-22584059

RESUMEN

BRCT domains are versatile protein modular domains found as single units or as multiple copies in more than 20 different proteins in the human genome. Interestingly, most BRCT-containing proteins function in the same biological process, the DNA damage response network, but show specificity in their molecular interactions. BRCT domains have been found to bind a wide array of ligands from proteins, phosphorylated linear motifs, and DNA. Here we discuss the biology of BRCT domains and how a domain-centric analysis can aid in the understanding of signal transduction events in the DNA damage response network.


Asunto(s)
Proteína BRCA1/química , Estructura Terciaria de Proteína , Proteínas/química , Secuencias de Aminoácidos , Animales , Simulación por Computador , Daño del ADN , Genoma Humano , Humanos , Ligandos , Modelos Moleculares , Conformación Molecular , Fosforilación , Filogenia , Unión Proteica , Conformación Proteica , Transducción de Señal , Electricidad Estática
19.
Protein Sci ; 21(6): 769-85, 2012 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-22528593

RESUMEN

Abstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed. More complex mutational processes such as insertion and deletion and domain rearrangements and even circular permutations should be evaluated. The role of intrinsically disordered proteins is still controversial, but may be increasingly important to consider. Protein geometry and protein dynamics as a deviation from static considerations of protein structure are also important. Protein expression level is known to be a major determinant of evolutionary rate and several considerations including selection at the mRNA level and the role of interaction specificity are discussed. Lastly, the relationship between modeling and needed high-throughput experimental data as well as experimental examination of protein evolution using ancestral sequence resurrection and in vitro biochemistry are presented, towards an aim of ultimately generating better models for biological inference and prediction.


Asunto(s)
Evolución Molecular , Proteínas/química , Proteínas/genética , Secuencia de Aminoácidos , Animales , Humanos , Modelos Moleculares , Datos de Secuencia Molecular , Conformación Proteica , Pliegue de Proteína , ARN Mensajero/genética , Alineación de Secuencia
20.
PLoS One ; 4(9): e7148, 2009 Sep 24.
Artículo en Inglés | MEDLINE | ID: mdl-19777061

RESUMEN

BACKGROUND: HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat. METHODOLOGY AND PRINCIPAL FINDINGS: Testing algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries. SIGNIFICANCE: A protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high sensitivity at a low false positive rate and will therefore greatly enhance the accuracy of predictions of HEAT and ARM domains.


Asunto(s)
Biología Computacional/métodos , Alineación de Secuencia/instrumentación , Alineación de Secuencia/métodos , Análisis de Secuencia de Proteína/instrumentación , Análisis de Secuencia de Proteína/métodos , Algoritmos , Secuencia de Aminoácidos , Animales , Automatización , Calibración , Humanos , Datos de Secuencia Molecular , Conformación Proteica , Pliegue de Proteína , Estructura Terciaria de Proteína , Proteínas/química , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA