Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters








Database
Language
Publication year range
1.
Proc Natl Acad Sci U S A ; 103(42): 15582-7, 2006 Oct 17.
Article in English | MEDLINE | ID: mdl-17030794

ABSTRACT

Rhodococcus sp. RHA1 (RHA1) is a potent polychlorinated biphenyl-degrading soil actinomycete that catabolizes a wide range of compounds and represents a genus of considerable industrial interest. RHA1 has one of the largest bacterial genomes sequenced to date, comprising 9,702,737 bp (67% G+C) arranged in a linear chromosome and three linear plasmids. A targeted insertion methodology was developed to determine the telomeric sequences. RHA1's 9,145 predicted protein-encoding genes are exceptionally rich in oxygenases (203) and ligases (192). Many of the oxygenases occur in the numerous pathways predicted to degrade aromatic compounds (30) or steroids (4). RHA1 also contains 24 nonribosomal peptide synthase genes, six of which exceed 25 kbp, and seven polyketide synthase genes, providing evidence that rhodococci harbor an extensive secondary metabolism. Among sequenced genomes, RHA1 is most similar to those of nocardial and mycobacterial strains. The genome contains few recent gene duplications. Moreover, three different analyses indicate that RHA1 has acquired fewer genes by recent horizontal transfer than most bacteria characterized to date and far fewer than Burkholderia xenovorans LB400, whose genome size and catabolic versatility rival those of RHA1. RHA1 and LB400 thus appear to demonstrate that ecologically similar bacteria can evolve large genomes by different means. Overall, RHA1 appears to have evolved to simultaneously catabolize a diverse range of plant-derived compounds in an O(2)-rich environment. In addition to establishing RHA1 as an important model for studying actinomycete physiology, this study provides critical insights that facilitate the exploitation of these industrially important microorganisms.


Subject(s)
Bacterial Proteins , Genome, Bacterial , Metabolism , Rhodococcus , Bacterial Proteins/classification , Bacterial Proteins/genetics , Biological Evolution , Chromosome Mapping , Molecular Sequence Data , Phylogeny , Rhodococcus/genetics , Rhodococcus/metabolism
2.
Genome Res ; 15(5): 603-15, 2005 May.
Article in English | MEDLINE | ID: mdl-15837805

ABSTRACT

We have identified longevity-associated genes in a long-lived Caenorhabditis elegans daf-2 (insulin/IGF receptor) mutant using serial analysis of gene expression (SAGE), a method that efficiently quantifies large numbers of mRNA transcripts by sequencing short tags. Reduction of daf-2 signaling in these mutant worms leads to a doubling in mean lifespan. We prepared C. elegans SAGE libraries from 1, 6, and 10-d-old adult daf-2 and from 1 and 6-d-old control adults. Differences in gene expression between daf-2 libraries representing different ages and between daf-2 versus control libraries identified not only single genes, but whole gene families that were differentially regulated. These gene families are part of major metabolic pathways including lipid, protein, and energy metabolism, stress response, and cell structure. Similar expression patterns of closely related family members emphasize the importance of these genes in aging-related processes. Global analysis of metabolism-associated genes showed hypometabolic features in mid-life daf-2 mutants that diminish with advanced age. Comparison of our results to recent microarray studies highlights sets of overlapping genes that are highly conserved throughout evolution and thus represent strong candidate genes that control aging and longevity.


Subject(s)
Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans/genetics , Gene Expression Profiling , Gene Expression Regulation , Longevity/genetics , Mutation/genetics , Receptor, Insulin/genetics , Age Factors , Animals , Caenorhabditis elegans/physiology , Energy Metabolism/genetics , Oligonucleotide Array Sequence Analysis , Signal Transduction/genetics
3.
J Bacteriol ; 186(22): 7783-95, 2004 Nov.
Article in English | MEDLINE | ID: mdl-15516593

ABSTRACT

Rhodococcus sp. strain RHA1, a potent polychlorinated-biphenyl (PCB)-degrading strain, contains three linear plasmids ranging in size from 330 to 1,100 kb. As part of a genome sequencing project, we report here the complete sequence and characterization of the smallest and least-well-characterized of the RHA1 plasmids, pRHL3. The plasmid is an actinomycete invertron, containing large terminal inverted repeats with a tightly associated protein and a predicted open reading frame (ORF) that is similar to that of a mycobacterial rep gene. The pRHL3 plasmid has 300 putative genes, almost 21% of which are predicted to have a catabolic function. Most of these are organized into three clusters. One of the catabolic clusters was predicted to include limonene degradation genes. Consistent with this prediction, RHA1 grew on limonene, carveol, or carvone as the sole carbon source. The plasmid carries three cytochrome P450-encoding (CYP) genes, a finding consistent with the high number of CYP genes found in other actinomycetes. Two of the CYP genes appear to belong to novel families; the third belongs to CYP family 116 but appears to belong to a novel class based on the predicted domain structure of its reductase. Analyses indicate that pRHL3 also contains four putative "genomic islands" (likely to have been acquired by horizontal transfer), insertion sequence elements, 19 transposase genes, and a duplication that spans two ORFs. One of the genomic islands appears to encode resistance to heavy metals. The plasmid does not appear to contain any housekeeping genes. However, each of the three catabolic clusters contains related genes that appear to be involved in glucose metabolism.


Subject(s)
Plasmids/genetics , Polychlorinated Biphenyls/metabolism , Rhodococcus/genetics , Amino Acid Sequence , Bacterial Proteins/genetics , Base Sequence , Biodegradation, Environmental , Cytochrome P-450 Enzyme System/genetics , Gene Transfer, Horizontal , Molecular Sequence Data , Open Reading Frames/genetics , Phylogeny , Replication Origin , Rhodococcus/metabolism , Sequence Analysis, DNA , Telomere
4.
Genome Res ; 14(3): 478-90, 2004 Mar.
Article in English | MEDLINE | ID: mdl-14962987

ABSTRACT

We report 80,388 ESTs from 23 Atlantic salmon (Salmo salar) cDNA libraries (61,819 ESTs), 6 rainbow trout (Oncorhynchus mykiss) cDNA libraries (14,544 ESTs), 2 chinook salmon (Oncorhynchus tshawytscha) cDNA libraries (1317 ESTs), 2 sockeye salmon (Oncorhynchus nerka) cDNA libraries (1243 ESTs), and 2 lake whitefish (Coregonus clupeaformis) cDNA libraries (1465 ESTs). The majority of these are 3' sequences, allowing discrimination between paralogs arising from a recent genome duplication in the salmonid lineage. Sequence assembly reveals 28,710 different S. salar, 8981 O. mykiss, 1085 O. tshawytscha, 520 O. nerka, and 1176 C. clupeaformis putative transcripts. We annotate the submitted portion of our EST database by molecular function. Higher- and lower-molecular-weight fractions of libraries are shown to contain distinct gene sets, and higher rates of gene discovery are associated with higher-molecular weight libraries. Pyloric caecum library group annotations indicate this organ may function in redox control and as a barrier against systemic uptake of xenobiotics. A microarray is described, containing 7356 salmonid elements representing 3557 different cDNAs. Analyses of cross-species hybridizations to this cDNA microarray indicate that this resource may be used for studies involving all salmonids.


Subject(s)
Computational Biology/methods , Databases, Genetic , Expressed Sequence Tags , Hybridization, Genetic/genetics , Oligonucleotide Array Sequence Analysis/methods , Salmonidae/genetics , Animals , Gene Library , Genes/genetics , Genes/physiology , Genes, Duplicate/genetics , Molecular Sequence Data , Oncorhynchus/genetics , Oncorhynchus mykiss/genetics , Organ Specificity/genetics , Organ Specificity/physiology , Salmo salar/genetics , Species Specificity
5.
Science ; 300(5624): 1399-404, 2003 May 30.
Article in English | MEDLINE | ID: mdl-12730501

ABSTRACT

We sequenced the 29,751-base genome of the severe acute respiratory syndrome (SARS)-associated coronavirus known as the Tor2 isolate. The genome sequence reveals that this coronavirus is only moderately related to other known coronaviruses, including two human coronaviruses, HCoV-OC43 and HCoV-229E. Phylogenetic analysis of the predicted viral proteins indicates that the virus does not closely resemble any of the three previously known groups of coronaviruses. The genome sequence will aid in the diagnosis of SARS virus infection in humans and potential animal hosts (using polymerase chain reaction and immunological tests), in the development of antivirals (including neutralizing antibodies), and in the identification of putative epitopes for vaccine development.


Subject(s)
Genome, Viral , RNA, Viral/genetics , Severe acute respiratory syndrome-related coronavirus/genetics , Viral Proteins/genetics , 3' Untranslated Regions , 5' Untranslated Regions , Animals , Base Sequence , Conserved Sequence , Coronavirus/classification , Coronavirus/genetics , Coronavirus M Proteins , Coronavirus Nucleocapsid Proteins , DNA, Complementary , Frameshifting, Ribosomal , Humans , Membrane Glycoproteins/chemistry , Membrane Glycoproteins/genetics , Nucleocapsid Proteins/chemistry , Nucleocapsid Proteins/genetics , Open Reading Frames , Phylogeny , RNA, Viral/isolation & purification , RNA-Dependent RNA Polymerase/chemistry , RNA-Dependent RNA Polymerase/genetics , Regulatory Sequences, Nucleic Acid , Severe acute respiratory syndrome-related coronavirus/classification , Severe acute respiratory syndrome-related coronavirus/isolation & purification , Sequence Analysis, DNA , Severe Acute Respiratory Syndrome/virology , Spike Glycoprotein, Coronavirus , Viral Envelope Proteins/chemistry , Viral Envelope Proteins/genetics , Viral Matrix Proteins/chemistry , Viral Matrix Proteins/genetics , Viral Proteins/chemistry
6.
Nucleic Acids Res ; 30(11): 2460-8, 2002 Jun 01.
Article in English | MEDLINE | ID: mdl-12034834

ABSTRACT

We describe an efficient high-throughput method for accurate DNA sequencing of entire cDNA clones. Developed as part of our involvement in the Mammalian Gene Collection full-length cDNA sequencing initiative, the method has been used and refined in our laboratory since September 2000. Amenable to large scale projects, we have used the method to generate >7 Mb of accurate sequence from 3695 candidate full-length cDNAs. Sequencing is accomplished through the insertion of Mu transposon into cDNAs, followed by sequencing reactions primed with Mu-specific sequencing primers. Transposon insertion reactions are not performed with individual cDNAs but rather on pools of up to 96 clones. This pooling strategy reduces the number of transposon insertion sequencing libraries that would otherwise be required, reducing the costs and enhancing the efficiency of the transposon library construction procedure. Sequences generated using transposon-specific sequencing primers are assembled to yield the full-length cDNA sequence, with sequence editing and other sequence finishing activities performed as required to resolve sequence ambiguities. Although analysis of the many thousands (22 785) of sequenced Mu transposon insertion events revealed a weak sequence preference for Mu insertion, we observed insertion of the Mu transposon into 1015 of the possible 1024 5mer candidate insertion sites.


Subject(s)
Bacteriophage mu/genetics , DNA Transposable Elements/genetics , DNA, Complementary/genetics , Mutagenesis, Insertional/genetics , Recombination, Genetic/genetics , Sequence Analysis, DNA/methods , Base Composition , Cloning, Molecular , DNA Primers/genetics , Gene Library , Genetic Vectors/genetics , Monte Carlo Method , Physical Chromosome Mapping/methods , Sensitivity and Specificity , Sequence Analysis, DNA/economics , Substrate Specificity , Time Factors
SELECTION OF CITATIONS
SEARCH DETAIL