RESUMEN
When template DNA is saturated with a single-stranded DNA binding protein (SSB), strings of three or four contiguous hexanucleotides (hexamers) can cooperate through base-stacking interactions to prime DNA synthesis specifically from the 3' end of the string. Under the same conditions, priming by individual hexamers is suppressed. Strings of three of four hexamers representing more than 200 of the 4096 possible hexamers primed easily readable sequence ladders at more than 75 different sites in single-stranded or denatured double-stranded templates 6.4 kilobases to 40 kilobase pairs long, with a success rate of 60 to 90 percent. A synthesis of 1 micromole of hexamer supplies enough material for thousands of primings, so multiple libraries of all 4096 hexamers could be distributed at a reasonable cost. Such libraries would allow rapid and economical sequencing. Automating this strategy could increase the speed and efficiency of large-scale DNA sequencing by at least an order of magnitude.
Asunto(s)
Secuencia de Bases , ADN/genética , Técnicas Genéticas , Sitios de Unión , ADN/metabolismo , ADN Viral/genética , Proteínas de Unión al ADN/metabolismo , Escherichia coli/metabolismo , Datos de Secuencia Molecular , Desnaturalización de Ácido Nucleico , Oligodesoxirribonucleótidos , Radioisótopos de Azufre , Moldes GenéticosRESUMEN
The etiological agent of Lyme disease is the tick-borne spirochete, Borrelia burgdorferi. A major antigen of B. burgdorferi is a 31 kDa lipoprotein called outer surface protein A (OspA). Recently, a truncated form of OspA (lacking 17 amino acids at the N-terminus) was cloned, expressed and purified in large quantities (Dunn, J.J., Lade, B.A. and Barbour, A.G. (1990) Protein Expression and Purification 1, 159-168). The truncated protein (OspA-257) is water-soluble, retains the ability to bind antibodies from the sera of Lyme disease patients and may prove useful in development of a vaccine against Lyme disease. We have used far UV circular dichroism (CD) and fluorescence spectroscopy to characterize the secondary structure of and to study conformational changes in OspA-257. CD spectra from 260 to 178 nm predict five classes of secondary structure: alpha-helix (11%), anti-parallel beta-sheet (32%), parallel beta-sheet (10%), beta-turns (18%) and aperiodic structures (including 'random coil') (30%). Analysis of the primary sequence of OspA yielded the most likely sites for alpha-helical regions (residues 100-107, 121-134, 253-273) and for antigenic determinants (Lys-46, Asp-82, Lys-231). CD spectra of the native protein show little change from pH 3 to 11. Thermal denaturation curves, indicate that 'salt bridges' play a role in stabilizing the native protein. Both thermal and chemical denaturations that eliminate all secondary structure as judged by CD or fluorescence are reversible. Denaturation by guanidine-HCl (gdn-HCl) appears to be a cooperative, two-state transition, as indicated by a sudden change in the CD spectrum at approximately 0.75 M gdn-HCl, and an isodichroic point at 208 nm in all CD spectra measured from 0.0-1.75 M gdn-HCl.
Asunto(s)
Antígenos Bacterianos/química , Antígenos de Superficie/química , Proteínas de la Membrana Bacteriana Externa , Proteínas Bacterianas/química , Grupo Borrelia Burgdorferi/química , Borrelia burgdorferi , Lipoproteínas , Proteínas de la Membrana/química , Secuencia de Aminoácidos , Anticuerpos Antibacterianos/inmunología , Antígenos Bacterianos/inmunología , Antígenos de Superficie/inmunología , Proteínas Bacterianas/inmunología , Vacunas Bacterianas , Grupo Borrelia Burgdorferi/inmunología , Dicroismo Circular , Enfermedad de Lyme/inmunología , Proteínas de la Membrana/inmunología , Datos de Secuencia Molecular , Conformación Proteica , Desnaturalización Proteica , Espectrometría de FluorescenciaRESUMEN
Outer surface protein A (OspA) is a major antigen of Borrelia burgdorferi, the etiological agent of Lyme disease. A recombinant form of OspA (OspA-257) from B. burgdorferi, strain B31, contains 257 amino acids and a single tryptophan residue at position 216 (Trp-216). Mapping studies indicate that Trp-216 is involved in the epitope for the agglutinating monoclonal antibody 105.5. However, the fluorescence emission maximum of the native protein is 330 nm, indicating that Trp-216 is not solvent-exposed. Primary structure analysis suggests an alpha-helical conformation for residues approx. 204-217, which, if located on the protein surface, would allow Trp-216 to be buried, while leaving hydrophilic residues on the opposite side of the helix exposed. This helix would place Lys-212 within approx. 6 A of Trp-216; the presence of such a positively-charged residue can, in principle, be ascertained from fluorescence quenching studies. Stern-Volmer plots confirm that Trp-216 is indeed buried in the native protein, but is readily accessible to the small polar quencher, acrylamide. Furthermore, the dominant component of the fluorescence emission shows only weak dynamic quenching by the positively-charged quencher, Cs+, while the minor component undergoes static quenching by I-, indicating the proximity of a positively-charged residue. These data are consistent with the existence of an alpha-helix from residues 204-217 in the predicted orientation at the protein surface, hence indicating the structure of the antigenic determinant.
Asunto(s)
Antígenos de Superficie/química , Proteínas de la Membrana Bacteriana Externa/química , Grupo Borrelia Burgdorferi/química , Borrelia burgdorferi , Epítopos/química , Lipoproteínas , Secuencia de Aminoácidos , Vacunas Bacterianas , Lisina/química , Datos de Secuencia Molecular , Estructura Molecular , Espectrometría de Fluorescencia/métodos , Triptófano/químicaRESUMEN
We have determined the genome sequences of two closely related lytic bacteriophages, SP6 and K1-5, which infect Salmonella typhimurium LT2 and Escherichia coli serotypes K1 and K5, respectively. The genome organization of these phages is almost identical with the notable exception of the tail fiber genes that confer the different host specificities. The two phages have diverged extensively at the nucleotide level but they are still more closely related to each other than either is to any other phage currently characterized. The SP6 and K1-5 genomes contain, respectively, 43,769 bp and 44,385 bp, with 174 bp and 234 bp direct terminal repeats. About half of the 105 putative open reading frames in the two genomes combined show no significant similarity to database proteins with a known or predicted function that is obviously beneficial for growth of a bacteriophage. The overall genome organization of SP6 and K1-5 is comparable to that of the T7 group of phages, although the specific order of genes coding for DNA metabolism functions has not been conserved. Low levels of nucleotide similarity between genomes in the T7 and SP6 groups suggest that they diverged a long time ago but, on the basis of this conservation of genome organization, they are expected to have retained similar developmental strategies.
Asunto(s)
Bacteriófago T7/genética , Escherichia coli/virología , Exones/genética , Intrones/genética , Fagos de Salmonella/genética , Proteínas de la Cola de los Virus/genética , Secuencia de Bases , Escherichia coli/genética , Escherichia coli/metabolismo , Regulación de la Expresión Génica , Genes Reguladores/genética , Genoma , Datos de Secuencia Molecular , Podoviridae/genética , Regiones Promotoras Genéticas/genética , Salmonella typhimurium , Homología de Secuencia de Ácido Nucleico , Proteínas de la Cola de los Virus/química , Virión/genéticaRESUMEN
Analysis of polyA-selected RNAs and cDNA clones from several human cell lines revealed the presence of a 93-bp exon in the PI kinase homology region of DNA-PKcs that was not present in the cDNA sequence derived from HeLa cell cDNA clones. RT-PCR showed that most DNA-PKcs mRNAs in human cells have this exon. Thus, the nascent DNA-PKcs polypeptide is composed of 4127 aa and has a predicted mol. wt. of 469021 (470 kDa).
Asunto(s)
Empalme Alternativo/genética , Proteínas de Unión al ADN , Exones/genética , Proteínas Serina-Treonina Quinasas/genética , ARN Mensajero/genética , Secuencia de Aminoácidos , Secuencia de Bases , Proteína Quinasa Activada por ADN , Células HeLa , Humanos , Datos de Secuencia Molecular , Proteínas Nucleares , Sistemas de Lectura Abierta/genéticaRESUMEN
The use of strings of contiguous short primers rather than a single long primer has the potential to greatly diminish the cost, time and management efforts associated with sequencing by primer walking. For maximum impact, this chemistry must be adaptable to current fluorescent automated sequencers, which, while allowing automated acquisition of data, have reduced sensitivity when compared with radioactive-based sequencing. The ability to use hexamer strings on Applied Biosystems DNA sequencers has now been demonstrated for single-stranded templates. Procedures are described that allow sequence to be obtained up to 400 bases from the priming site. Signal strength is sufficient in this region to allow single base resolution and highly accurate automatic base calling to be performed by the sequencer. While these conditions can no doubt be further optimized, these results show the feasibility of inexpensive primer walking using hexamer string primers on currently available commercial DNA sequencers. This should have a wide range of applications from genome sequencing projects to the sequencing of cDNA clones without the necessity of creating nested deletions or the necessity of spending inordinate amounts of time and money on oligonucleotide synthesis.
Asunto(s)
Análisis de Secuencia de ADN/métodos , Secuencia de Bases , Fluorescencia , Datos de Secuencia MolecularRESUMEN
Two strategies for DNA sequencing by primer walking using short oligonucleotide primer libraries have been successfully employed along with capillary electrophoresis using replaceable polymer solutions of linear polyacrylamide and fluorescence detection. A 3.5-kb stretch of the single-stranded M13mp18 template was sequenced with T7 PRISM dye-terminator/Sequenase chemistry. An in-house base-calling program offered read lengths of roughly 450 bases with an average of 97.8% accuracy.
Asunto(s)
Electroforesis Capilar/métodos , Análisis de Secuencia de ADN/métodos , Secuencia de Bases , Cartilla de ADN/genética , Datos de Secuencia Molecular , Moldes GenéticosRESUMEN
Thymidine kinase type II is an important part of the pyrimidine salvage pathway. The thymidine kinase gene from the thermophilic eubacterium Rhodothermus marinus was cloned, sequenced and overexpressed. The gene is 639 bp and encodes a protein of 213 amino acids with a calculated molecular mass of 23.6 kDa. It shows homology to other thymidine kinase proteins from eukaryotic and prokaryotic organisms. The recombinant protein is inhibited by dNTPs but not by dNDPs. It is a tetramer in its native state. Its optimum temperature of activity is 65 degrees C and it has a half life of 15 min at 90 degrees C. This is the first thymidine kinase to be described from a thermophilic bacterium.
Asunto(s)
Bacterias Gramnegativas/genética , Timidina Quinasa/genética , Secuencia de Aminoácidos , Clonación Molecular , Genes Bacterianos , Bacterias Gramnegativas/enzimología , Datos de Secuencia Molecular , Peso Molecular , Proteínas Recombinantes/biosíntesis , Timidina Quinasa/química , Timidina Quinasa/metabolismoRESUMEN
Isolating high-priority segments of genomes greatly enhances the efficiency of next-generation sequencing (NGS) by allowing researchers to focus on their regions of interest. For the 2010-11 DNA Sequencing Research Group (DSRG) study, we compared outcomes from two leading companies, Agilent Technologies (Santa Clara, CA, USA) and Roche NimbleGen (Madison, WI, USA), which offer custom-targeted genomic enrichment methods. Both companies were provided with the same genomic sample and challenged to capture identical genomic locations for DNA NGS. The target region totaled 3.5 Mb and included 31 individual genes and a 2-Mb contiguous interval. Each company was asked to design its best assay, perform the capture in replicates, and return the captured material to the DSRG-participating laboratories. Sequencing was performed in two different laboratories on Genome Analyzer IIx systems (Illumina, San Diego, CA, USA). Sequencing data were analyzed for sensitivity, specificity, and coverage of the desired regions. The success of the enrichment was highly dependent on the design of the capture probes. Overall, coverage variability was higher for the Agilent samples. As variant discovery is the ultimate goal for a typical targeted sequencing project, we compared samples for their ability to sequence single-nucleotide polymorphisms (SNPs) as a test of the ability to capture both chromosomes from the sample. In the targeted regions, we detected 2546 SNPs with the NimbleGen samples and 2071 with Agilent's. When limited to the regions that both companies included as baits, the number of SNPs was â¼1000 for each, with Agilent and NimbleGen finding a small number of unique SNPs not found by the other.
Asunto(s)
ADN/genética , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Polimorfismo de Nucleótido Simple/genética , Cromosomas/genética , Genoma Humano , Genotipo , HumanosRESUMEN
The enzymatic characteristics of a protein kinase purified from thylakoids are further described. ATP (KM approximately 30 microM) and Mg2+ ion (greater than 1.0 mM) were required for activity, while ADP was a competitive inhibitor (Ki = 100 microM). Activity was 55% inhibited by the sulfhydryl inhibitor p-chloromercuribenzoate (1 mM) and was less sensitive to substituted maleimides. Lysine-rich histones (H1) were utilized as an exogenous phosphorylation substrate both by thylakoid-bound kinase and by isolated enzyme; threonine was predominantly phosphorylated by the in situ enzyme, whereas the isolated enzyme phosphorylated closely related serine residues as determined by peptide mapping. Detergents that proved useful in extracting the kinase from thylakoids markedly inhibited activity of the isolated enzyme, whereas Triton X-100 and 3-[(3-cholamidopropyl)dimethylammonio]-1-propane-sulfonic acid had little effect. The enzyme could be freed from detergent and behaved as an active monomer on size-exclusion chromatography. The phosphate contents of the light-harvesting chlorophyll a/b protein complex of photosystem II isolated from maximally phosphorylated thylakoid membranes of spinach and pea were equivalent to approximately 6% and approximately 19% phosphorylation, respectively. Corresponding values for nonphosphorylated membranes were approximately 3% and approximately 14.5%.
Asunto(s)
Plantas/enzimología , Proteínas Quinasas/metabolismo , Adenosina Difosfato/metabolismo , Adenosina Trifosfato/metabolismo , Histonas/farmacología , Magnesio/metabolismo , Fosfatos/análisis , Especificidad por SustratoRESUMEN
Endogenous dephosphorylation of the light-harvesting chlorophyll-protein complex of photosystem II in pea (Pisum sativum, L. cv Progress 9) thylakoids drives the state 2 to state 1 transition; the responsible enzyme is a thylakoid-bound, fluoride-sensitive phosphatase with a pH optimum of 8.0 (Bennett J [1980] Eur J Biochem 104: 85-89). An enzyme with these characteristics was isolated from well-washed thylakoids. Its molecular mass was estimated at 51.5 kD, and this monomer was catalytically active, although the activity was labile. The active site could be labeled with orthophosphate at pH 5.0. High levels of alkaline phosphatase activity were obtained with the assay substrate, 4-methylumbelliferyl phosphate (350 micromoles per minute per milligram purified enzyme). The isolated enzyme functioned as a phosphoprotein phosphatase toward phosphorylated histone III-S and phosphorylated, photosystem II-enriched particles from pea, with typical activities in the range of 200 to 600 picomoles per minute per milligram enzyme. These activities all had a pH optimum of 8.0 and were fluoride sensitive. The enzyme required magnesium ion for maximal activity but was not dependent on this ion. Evidence supporting a putative function for this phosphatase in dephosphorylation of thylakoid proteins came from the inhibition of this process by a polyclonal antibody preparation raised against the partially purified enzyme.
RESUMEN
Steady-state and time-resolved fluorescence techniques were used to monitor pH-induced conformational changes in spinach ferredoxin. An increase was seen in the wave-length maximum of tryptophan-73 (Trp-73) emission, from 325 nm below pH 6.0 to 342 nm above pH 7.0, indicating significantly diminished hydrophobicity, at pH 7.0, in the environment of the indole ring. Raising the solution pH from 6.0 to 7.6 also decreased the binding of the detergent Brij-96, showing that the ferredoxin molecule as a whole became more hydrophilic at higher pH. Nonionic (acrylamide) and ionic (I- and Cs+) quenchers were used to probe the tryptophan environment. Trp-73 is partially shielded from I-, presumably by negatively charged residues, as predicted from the amino acid sequence and three-dimensional structure of plant-type ferredoxins. Ionic strength and pH effects on tryptophan fluorescence lifetimes follow a pattern common to single-tryptophan proteins: the emission decays can be fit to a biexponential model in which the lifetime of the excited state increases with increasing pH. The indication of a pH-induced conformational change in the range pH 6.0 to 7.6 is discussed with reference to the physiological association of ferredoxin with ferredoxin:NADP+ oxidoreductase and the rise in chloroplast stromal pH in the light.
Asunto(s)
Ferredoxinas/ultraestructura , Detergentes/química , Ferredoxinas/química , Concentración de Iones de Hidrógeno , Liposomas/química , Concentración Osmolar , Plantas , Conformación Proteica , Espectrometría de Fluorescencia , Triptófano/químicaRESUMEN
A 30-kb genomic segment containing the promoter and first 9 exons of PRKDC, the gene encoding the catalytic subunit (DNA-PKcs) of the human DNA-activated protein kinase, DNA-PK, was isolated and partially sequenced. Sequence comparison with the NCBI nonredundant database revealed the locations of the first 13 exons of the upstream gene, MCM4. MCM4 is an essential component of a protein complex that prevents DNA from being replicated more than once per cell cycle. The MCM4 and DNA-PKcs promoters are in CpG islands separated by approximately 700 bp, and transcription from each initiates at multiple, closely spaced sites. Both promoters lack TATA boxes, and the MCM4 promoter also lacks an initiator (Inr) element but has an inverted CCAAT box. The DNA-PKcs promoter has an Inr-like sequence as well as a downstream MED-1 element. The two promoters appear to function independently, as sequences required for core promoter activity do not overlap, and sequences extending into the 5' region of each gene had little or no effect on transcription of the other gene, as shown in transient transfection assays. The arrangement of the PRKDC/MCM4 gene pair is similar to that of the ATM/E14(NPAT) gene pair. ATM, the product of the gene mutated in ataxia telangiectasia, and DNA-PKcs function in pathways that detect or repair DNA damage and are members of a family of large, serine/threonine kinases that are closely related to phosphatidylinositol 3 kinases.
Asunto(s)
Proteínas de Ciclo Celular/genética , Cromosomas Humanos Par 8 , ADN Helicasas , Proteínas de Unión al ADN , Regiones Promotoras Genéticas , Proteínas Serina-Treonina Quinasas/genética , Secuencia de Aminoácidos , Animales , Secuencia de Bases , Proteínas de Ciclo Celular/metabolismo , Proteína Quinasa Activada por ADN , Humanos , Ratones , Componente 4 del Complejo de Mantenimiento de Minicromosoma , Datos de Secuencia Molecular , Proteínas Nucleares , Proteínas Serina-Treonina Quinasas/metabolismo , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Transcripción GenéticaRESUMEN
Penetration of membranes by diphtheria toxin in vivo is at least partially triggered by a low pH-induced conformational change occurring within the lumen of an acidic organelle. In order to gain insight into the nature of this change the behavior of the toxin at high pH was characterized and compared to that previously determined at low pH. We find that near pH 10.5 a major conformational change occurs. This change is accompanied by a marked decrease in fluorescence intensity, a red shift in fluorescence emission maximum, and increased susceptibility of protein fluorescence to acrylamide quenching. Differential scanning calorimetry shows that the high pH conformational change involves a cooperative endothermic unfolding transition. These changes at high pH are very similar to those induced by low pH, supporting the conclusion that the changes at low pH also involve a denaturation-like process. In addition, at high pH the toxin gains the ability to bind to model membranes, again similar to its behavior at low pH. On the basis of these studies we conclude that exposure of hydrophobic sequences due to partial unfolding is one dominating component in inducing hydrophobic behavior at both high and low pH, but that at low pH Asp/Glu protonation also contributes to hydrophobicity.
Asunto(s)
Toxina Diftérica/química , Membranas Artificiales , Rastreo Diferencial de Calorimetría , Concentración de Iones de Hidrógeno , Micelas , Permeabilidad , Conformación Proteica , Desnaturalización Proteica , Espectrometría de FluorescenciaRESUMEN
The genesis of small repeats involved in infrequent recombinations in plant mitochondrial genomes remains unclear. We propose that at least some of the small repeats are generated in a similar way to the large, highly recombinogenic, plant mitochondrial repeats. A 314-bp sequence was detected as a small, rarely recombining mitochondrial repeat in the genus Phaseolus. Two of the recombinational forms were predominant, while two others were found in substoichiometric amounts in the species P. vulgaris, P. polyanthus and P. coccineus. However, the pairs of predominant and substoichiometric forms were distinct in each genome, indicating that a mechanism other than recombination is responsible for their maintenance in high or low copy number. In P. lunatus, which is phylogenetically quite remote from the other species examined, only one form of the 314-bp repeat was predominant, while the other forms were present in substoichiometric amounts. In this genome, we also identified sequences containing the terminal 11 or 7 bp of the 314-bp repeat. These configurations could serve as intermediates during generation of the repeat. We presume that two site-specific recombinations between the intermediates and the predominant form found in P. lunatus resulted in creation of the two new forms of the repeat. The fourth form of the repeat appeared after a further recombination that occurred at the substoichiometric level. The nature of this recombination, whether site-specific or homologous, is discussed. Beyond the evident similarities between the model presented in this work and the three-recombination models previously proposed to explain the formation of large, frequently recombining repeats, we did not detect any specific deletion associated with generation of the repeat.
Asunto(s)
ADN Mitocondrial/genética , Fabaceae/genética , Genoma de Planta , Plantas Medicinales , Secuencias Repetitivas de Ácidos Nucleicos , Secuencia de Bases , Fabaceae/ultraestructura , Datos de Secuencia Molecular , Recombinación GenéticaRESUMEN
A gene encoding a DNA polymerase I from the thermophilic eubacterium Rhodothermus marinus was identified. The gene was cloned, sequenced and expressed in Escherichia coli. The gene is 2772 bp long and encodes a protein of 924 amino acids with a calculated molecular mass of 104.8 kDa. Sequence analysis showed that a generally conserved Phe residue in the O-helix is substituted by a Tyr (position 756) in the R. marinus enzyme. A Tyr in this position decreases the discrimination against dideoxynucleotides which is a major advantage in DNA sequencing. The protein was purified, characterized and showed to contain specific DNA-polymerization activity of 3100 units/mg of protein, 5'-->3' exonuclease activity and a 3'-->5' proofreading activity. Its optimum activity was at 55 degrees C and it had a half-life of 2 min at 90 degrees C. A truncated form of the enzyme lacking the 5'-->3' exonuclease domain was also expressed in E. coli. It had a specific DNA-polymerization activity of 5000 units/mg of protein and lacked the 5'-->3' exonuclease activity. Its optimum activity was at 65 degrees C and it had a half-life of 11 min at 90 degrees C. It was usable for DNA sequencing. This is the first thermostable DNA polymerase described with the O-helix Phe-->Tyr substitution.