Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 16 de 16
Filtrar
1.
Nucleic Acids Res ; 40(Database issue): D632-40, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22102570

RESUMEN

ProPortal (http://proportal.mit.edu/) is a database containing genomic, metagenomic, transcriptomic and field data for the marine cyanobacterium Prochlorococcus. Our goal is to provide a source of cross-referenced data across multiple scales of biological organization--from the genome to the ecosystem--embracing the full diversity of ecotypic variation within this microbial taxon, its sister group, Synechococcus and phage that infect them. The site currently contains the genomes of 13 Prochlorococcus strains, 11 Synechococcus strains and 28 cyanophage strains that infect one or both groups. Cyanobacterial and cyanophage genes are clustered into orthologous groups that can be accessed by keyword search or through a genome browser. Users can also identify orthologous gene clusters shared by cyanobacterial and cyanophage genomes. Gene expression data for Prochlorococcus ecotypes MED4 and MIT9313 allow users to identify genes that are up or downregulated in response to environmental stressors. In addition, the transcriptome in synchronized cells grown on a 24-h light-dark cycle reveals the choreography of gene expression in cells in a 'natural' state. Metagenomic sequences from the Global Ocean Survey from Prochlorococcus, Synechococcus and phage genomes are archived so users can examine the differences between populations from diverse habitats. Finally, an example of cyanobacterial population data from the field is included.


Asunto(s)
Bacteriófagos/genética , Bases de Datos Genéticas , Prochlorococcus/genética , Genoma Bacteriano , Genoma Viral , Metagenómica , Familia de Multigenes , Prochlorococcus/virología , Estrés Fisiológico/genética , Synechococcus/genética , Biología de Sistemas , Integración de Sistemas , Transcripción Genética
2.
Proc Natl Acad Sci U S A ; 108(39): E757-64, 2011 Sep 27.
Artículo en Inglés | MEDLINE | ID: mdl-21844365

RESUMEN

Cyanophages infecting the marine cyanobacteria Prochlorococcus and Synechococcus encode and express genes for the photosynthetic light reactions. Sequenced cyanophage genomes lack Calvin cycle genes, however, suggesting that photosynthetic energy harvested via phage proteins is not used for carbon fixation. We report here that cyanophages carry and express a Calvin cycle inhibitor, CP12, whose host homologue directs carbon flux from the Calvin cycle to the pentose phosphate pathway (PPP). Phage CP12 was coexpressed with phage genes involved in the light reactions, deoxynucleotide biosynthesis, and the PPP, including a transaldolase gene that is the most prevalent PPP gene in cyanophages. Phage transaldolase was purified to homogeneity from several strains and shown to be functional in vitro, suggesting that it might facilitate increased flux through this key reaction in the host PPP, augmenting production of NADPH and ribose 5-phosphate. Kinetic measurements of phage and host transaldolases revealed that the phage enzymes have k(cat)/K(m) values only approximately one third of the corresponding host enzymes. The lower efficiency of phage transaldolase may be a tradeoff for other selective advantages such as reduced gene size: we show that more than half of host-like cyanophage genes are significantly shorter than their host homologues. Consistent with decreased Calvin cycle activity and increased PPP and light reaction activity under infection, the host NADPH/NADP ratio increased two-fold in infected cells. We propose that phage-augmented NADPH production fuels deoxynucleotide biosynthesis for phage replication, and that the selection pressures molding phage genomes involve fitness advantages conferred through mobilization of host energy stores.


Asunto(s)
Bacteriófagos/genética , Carbono/metabolismo , Cianobacterias/metabolismo , Genes Virales , Bacteriófagos/enzimología , Cianobacterias/enzimología , Cianobacterias/virología , Cinética , Datos de Secuencia Molecular , Transaldolasa/metabolismo , Transcripción Genética
3.
Nucleic Acids Res ; 38(Database issue): D396-400, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19906701

RESUMEN

Since 2003, MicrobesOnline (http://www.microbesonline.org) has been providing a community resource for comparative and functional genome analysis. The portal includes over 1000 complete genomes of bacteria, archaea and fungi and thousands of expression microarrays from diverse organisms ranging from model organisms such as Escherichia coli and Saccharomyces cerevisiae to environmental microbes such as Desulfovibrio vulgaris and Shewanella oneidensis. To assist in annotating genes and in reconstructing their evolutionary history, MicrobesOnline includes a comparative genome browser based on phylogenetic trees for every gene family as well as a species tree. To identify co-regulated genes, MicrobesOnline can search for genes based on their expression profile, and provides tools for identifying regulatory motifs and seeing if they are conserved. MicrobesOnline also includes fast phylogenetic profile searches, comparative views of metabolic pathways, operon predictions, a workbench for sequence analysis and integration with RegTransBase and other microbial genome resources. The next update of MicrobesOnline will contain significant new functionality, including comparative analysis of metagenomic sequence data. Programmatic access to the database, along with source code and documentation, is available at http://microbesonline.org/programmers.html.


Asunto(s)
Bacterias/genética , Biología Computacional/métodos , Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , Algoritmos , Biología Computacional/tendencias , Bases de Datos de Proteínas , Perfilación de la Expresión Génica , Genoma Bacteriano , Almacenamiento y Recuperación de la Información/métodos , Internet , Análisis de Secuencia por Matrices de Oligonucleótidos , Estructura Terciaria de Proteína , Programas Informáticos
4.
Sci Total Environ ; 805: 150121, 2022 Jan 20.
Artículo en Inglés | MEDLINE | ID: mdl-34534872

RESUMEN

Current estimates of COVID-19 prevalence are largely based on symptomatic, clinically diagnosed cases. The existence of a large number of undiagnosed infections hampers population-wide investigation of viral circulation. Here, we quantify the SARS-CoV-2 concentration and track its dynamics in wastewater at a major urban wastewater treatment facility in Massachusetts, between early January and May 2020. SARS-CoV-2 was first detected in wastewater on March 3. SARS-CoV-2 RNA concentrations in wastewater correlated with clinically diagnosed new COVID-19 cases, with the trends appearing 4-10 days earlier in wastewater than in clinical data. We inferred viral shedding dynamics by modeling wastewater viral load as a convolution of back-dated new clinical cases with the average population-level viral shedding function. The inferred viral shedding function showed an early peak, likely before symptom onset and clinical diagnosis, consistent with emerging clinical and experimental evidence. This finding suggests that SARS-CoV-2 concentrations in wastewater may be primarily driven by viral shedding early in infection. This work shows that longitudinal wastewater analysis can be used to identify trends in disease transmission in advance of clinical case reporting, and infer early viral shedding dynamics for newly infected individuals, which are difficult to capture in clinical investigations.


Asunto(s)
COVID-19 , SARS-CoV-2 , Humanos , ARN Viral , Esparcimiento de Virus , Aguas Residuales
5.
BMC Med Genomics ; 14(1): 238, 2021 10 01.
Artículo en Inglés | MEDLINE | ID: mdl-34598685

RESUMEN

BACKGROUND: Polygenic scores-which quantify inherited risk by integrating information from many common sites of DNA variation-may enable a tailored approach to clinical medicine. However, alongside considerable enthusiasm, we and others have highlighted a lack of standardized approaches for score disclosure. Here, we review the landscape of polygenic score reporting and describe a generalizable approach for development of a polygenic score disclosure tool for coronary artery disease. METHODS: We assembled a working group of clinicians, geneticists, data visualization specialists, and software developers. The group reviewed existing polygenic score reports and then designed a two-page mock report for coronary artery disease. We then conducted a qualitative user-experience study with this report using an interview guide focused on comprehension, experience, and attitudes. Interviews were transcribed and analyzed for themes identification to inform report revision. RESULTS: Review of nine existing polygenic score reports from commercial and academic groups demonstrated significant heterogeneity, reinforcing the need for additional efforts to study and standardize score disclosure. Using a newly developed mock score report, we conducted interviews with ten adult individuals (50% females, 70% without prior genetic testing experience, age range 20-70 years) recruited via an online platform. We identified three themes from interviews: (1) visual elements, such as color and simple graphics, enable participants to interpret, relate to, and contextualize their polygenic score, (2) word-based descriptions of risk and polygenic scores presented as percentiles were the best recognized and understood, (3) participants had varying levels of interest in understanding complex genomic information and therefore would benefit from additional resources that can adapt to their individual needs in real time. In response to user feedback, colors used for communicating risk were modified to minimize unintended color associations and odds ratios were removed. All 10 participants expressed interest in receiving a polygenic score report based on their personal genomic information. CONCLUSIONS: Our findings describe a generalizable approach to develop a polygenic score report understandable by potential patients. Although additional studies are needed across a wider spectrum of patient populations, these results are likely to inform ongoing efforts related to polygenic score disclosure within clinical practice.


Asunto(s)
Enfermedad de la Arteria Coronaria/genética , ADN/genética , Herencia Multifactorial , Adulto , Anciano , Femenino , Humanos , Masculino , Persona de Mediana Edad , Estudios Prospectivos , Investigación Cualitativa , Adulto Joven
6.
medRxiv ; 2021 Mar 14.
Artículo en Inglés | MEDLINE | ID: mdl-33758888

RESUMEN

Wastewater-based disease surveillance is a promising approach for monitoring community outbreaks. Here we describe a nationwide campaign to monitor SARS-CoV-2 in the wastewater of 159 counties in 40 U.S. states, covering 13% of the U.S. population from February 18 to June 2, 2020. Out of 1,751 total samples analyzed, 846 samples were positive for SARS-CoV-2 RNA, with overall viral concentrations declining from April to May. Wastewater viral titers were consistent with, and appeared to precede, clinical COVID-19 surveillance indicators, including daily new cases. Wastewater surveillance had a high detection rate (>80%) of SARS-CoV-2 when the daily incidence exceeded 13 per 100,000 people. Detection rates were positively associated with wastewater treatment plant catchment size. To our knowledge, this work represents the largest-scale wastewater-based SARS-CoV-2 monitoring campaign to date, encompassing a wide diversity of wastewater treatment facilities and geographic locations. Our findings demonstrate that a national wastewater-based approach to disease surveillance may be feasible and effective.

7.
Water Res ; 202: 117400, 2021 Sep 01.
Artículo en Inglés | MEDLINE | ID: mdl-34274898

RESUMEN

Wastewater-based disease surveillance is a promising approach for monitoring community outbreaks. Here we describe a nationwide campaign to monitor SARS-CoV-2 in the wastewater of 159 counties in 40 U.S. states, covering 13% of the U.S. population from February 18 to June 2, 2020. Out of 1,751 total samples analyzed, 846 samples were positive for SARS-CoV-2 RNA, with overall viral concentrations declining from April to May. Wastewater viral titers were consistent with, and appeared to precede, clinical COVID-19 surveillance indicators, including daily new cases. Wastewater surveillance had a high detection rate (>80%) of SARS-CoV-2 when the daily incidence exceeded 13 per 100,000 people. Detection rates were positively associated with wastewater treatment plant catchment size. To our knowledge, this work represents the largest-scale wastewater-based SARS-CoV-2 monitoring campaign to date, encompassing a wide diversity of wastewater treatment facilities and geographic locations. Our findings demonstrate that a national wastewater-based approach to disease surveillance may be feasible and effective.


Asunto(s)
COVID-19 , SARS-CoV-2 , Brotes de Enfermedades , Humanos , ARN Viral , Aguas Residuales
8.
Environ Microbiol ; 12(11): 3035-56, 2010 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-20662890

RESUMEN

T4-like myoviruses are ubiquitous, and their genes are among the most abundant documented in ocean systems. Here we compare 26 T4-like genomes, including 10 from non-cyanobacterial myoviruses, and 16 from marine cyanobacterial myoviruses (cyanophages) isolated on diverse Prochlorococcus or Synechococcus hosts. A core genome of 38 virion construction and DNA replication genes was observed in all 26 genomes, with 32 and 25 additional genes shared among the non-cyanophage and cyanophage subsets, respectively. These hierarchical cores are highly syntenic across the genomes, and sampled to saturation. The 25 cyanophage core genes include six previously described genes with putative functions (psbA, mazG, phoH, hsp20, hli03, cobS), a hypothetical protein with a potential phytanoyl-CoA dioxygenase domain, two virion structural genes, and 16 hypothetical genes. Beyond previously described cyanophage-encoded photosynthesis and phosphate stress genes, we observed core genes that may play a role in nitrogen metabolism during infection through modulation of 2-oxoglutarate. Patterns among non-core genes that may drive niche diversification revealed that phosphorus-related gene content reflects source waters rather than host strain used for isolation, and that carbon metabolism genes appear associated with putative mobile elements. As well, phages isolated on Synechococcus had higher genome-wide %G+C and often contained different gene subsets (e.g. petE, zwf, gnd, prnA, cpeT) than those isolated on Prochlorococcus. However, no clear diagnostic genes emerged to distinguish these phage groups, suggesting blurred boundaries possibly due to cross-infection. Finally, genome-wide comparisons of both diverse and closely related, co-isolated genomes provide a locus-to-locus variability metric that will prove valuable for interpreting metagenomic data sets.


Asunto(s)
Bacteriófago T4/genética , Cianobacterias/virología , Ácidos Cetoglutáricos/metabolismo , Myoviridae/genética , Compuestos de Amonio Cuaternario/metabolismo , Agua de Mar/virología , Bacteriófago T4/clasificación , Composición de Base , Evolución Molecular , Variación Genética , Genoma Viral , Metagenómica , Datos de Secuencia Molecular , Myoviridae/clasificación , Nitrógeno/metabolismo , Océanos y Mares , Prochlorococcus/virología , Agua de Mar/microbiología , Análisis de Secuencia de ADN , Synechococcus/virología , Proteínas del Núcleo Viral/genética , Proteínas de la Cola de los Virus/genética , Microbiología del Agua
9.
medRxiv ; 2020 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-32607521

RESUMEN

Current estimates of COVID-19 prevalence are largely based on symptomatic, clinically diagnosed cases. The existence of a large number of undiagnosed infections hampers population-wide investigation of viral circulation. Here, we use longitudinal wastewater analysis to track SARS-CoV-2 dynamics in wastewater at a major urban wastewater treatment facility in Massachusetts, between early January and May 2020. SARS-CoV-2 was first detected in wastewater on March 3. Viral titers in wastewater increased exponentially from mid-March to mid-April, after which they began to decline. Viral titers in wastewater correlated with clinically diagnosed new COVID-19 cases, with the trends appearing 4-10 days earlier in wastewater than in clinical data. We inferred viral shedding dynamics by modeling wastewater viral titers as a convolution of back-dated new clinical cases with the viral shedding function of an individual. The inferred viral shedding function showed an early peak, likely before symptom onset and clinical diagnosis, consistent with emerging clinical and experimental evidence. Finally, we found that wastewater viral titers at the neighborhood level correlate better with demographic variables than with population size. This work suggests that longitudinal wastewater analysis can be used to identify trends in disease transmission in advance of clinical case reporting, and may shed light on infection characteristics that are difficult to capture in clinical investigations, such as early viral shedding dynamics.

10.
Nucleic Acids Res ; 33(3): 880-92, 2005.
Artículo en Inglés | MEDLINE | ID: mdl-15701760

RESUMEN

We combine comparative genomic measures and the distance separating adjacent genes to predict operons in 124 completely sequenced prokaryotic genomes. Our method automatically tailors itself to each genome using sequence information alone, and thus can be applied to any prokaryote. For Escherichia coli K12 and Bacillus subtilis, our method is 85 and 83% accurate, respectively, which is similar to the accuracy of methods that use the same features but are trained on experimentally characterized transcripts. In Halobacterium NRC-1 and in Helicobacter pylori, our method correctly infers that genes in operons are separated by shorter distances than they are in E.coli, and its predictions using distance alone are more accurate than distance-only predictions trained on a database of E.coli transcripts. We use microarray data from six phylogenetically diverse prokaryotes to show that combining intergenic distance with comparative genomic measures further improves accuracy and that our method is broadly effective. Finally, we survey operon structure across 124 genomes, and find several surprises: H.pylori has many operons, contrary to previous reports; Bacillus anthracis has an unusual number of pseudogenes within conserved operons; and Synechocystis PCC 6803 has many operons even though it has unusually wide spacings between conserved adjacent genes.


Asunto(s)
Biología Computacional/métodos , Genómica/métodos , Operón , Secuencia de Bases , Epsilonproteobacteria/genética , Perfilación de la Expresión Génica , Genoma Bacteriano , Modelos Estadísticos , Análisis de Secuencia por Matrices de Oligonucleótidos , Células Procariotas/metabolismo , Seudogenes , ARN Mensajero/análisis , Reproducibilidad de los Resultados , Synechocystis/genética
11.
ISME J ; 7(9): 1827-41, 2013 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-23657361

RESUMEN

Viruses that infect marine cyanobacteria-cyanophages-often carry genes with orthologs in their cyanobacterial hosts, and the frequency of these genes can vary with habitat. To explore habitat-influenced genomic diversity more deeply, we used the genomes of 28 cultured cyanomyoviruses as references to identify phage genes in three ocean habitats. Only about 6-11% of genes were consistently observed in the wild, revealing high gene-content variability in these populations. Numerous shared phage/host genes differed in relative frequency between environments, including genes related to phosphorous acquisition, photorespiration, photosynthesis and the pentose phosphate pathway, possibly reflecting environmental selection for these genes in cyanomyovirus genomes. The strongest emergent signal was related to phosphorous availability; a higher fraction of genomes from relatively low-phosphorus environments-the Sargasso and Mediterranean Sea-contained host-like phosphorus assimilation genes compared with those from the N. Pacific Gyre. These genes are known to be upregulated when the host is phosphorous starved, a response mediated by pho box motifs in phage genomes that bind a host regulatory protein. Eleven cyanomyoviruses have predicted pho boxes upstream of the phosphate-acquisition genes pstS and phoA; eight of these have a conserved cyanophage-specific gene (PhCOG173) between the pho box and pstS. PhCOG173 is also found upstream of other shared phage/host genes, suggesting a unique regulatory role. Pho boxes are found upstream of high light-inducible (hli) genes in cyanomyoviruses, suggesting that this motif may have a broader role than regulating phosphorous-stress responses in infected hosts or that these hlis are involved in the phosphorous-stress response.


Asunto(s)
Bacteriófagos/clasificación , Ecosistema , Variación Genética , Fósforo/metabolismo , Prochlorococcus/virología , Agua de Mar/virología , Estrés Fisiológico/genética , Bacteriófagos/genética , Frecuencia de los Genes , Genes Virales/genética , Mar Mediterráneo , Metagenoma , Filogenia , Agua de Mar/microbiología
12.
ISME J ; 7(1): 184-98, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22895163

RESUMEN

Prochlorococcus is the numerically dominant photosynthetic organism throughout much of the world's oceans, yet little is known about the ecology and genetic diversity of populations inhabiting tropical waters. To help close this gap, we examined natural Prochlorococcus communities in the tropical Pacific Ocean using a single-cell whole-genome amplification and sequencing. Analysis of the gene content of just 10 single cells from these waters added 394 new genes to the Prochlorococcus pan-genome--that is, genes never before seen in a Prochlorococcus cell. Analysis of marker genes, including the ribosomal internal transcribed sequence, from dozens of individual cells revealed several representatives from two uncultivated clades of Prochlorococcus previously identified as HNLC1 and HNLC2. While the HNLC clades can dominate Prochlorococcus communities under certain conditions, their overall geographic distribution was highly restricted compared with other clades of Prochlorococcus. In the Atlantic and Pacific oceans, these clades were only found in warm waters with low Fe and high inorganic P levels. Genomic analysis suggests that at least one of these clades thrives in low Fe environments by scavenging organic-bound Fe, a process previously unknown in Prochlorococcus. Furthermore, the capacity to utilize organic-bound Fe appears to have been acquired horizontally and may be exchanged among other clades of Prochlorococcus. Finally, one of the single Prochlorococcus cells sequenced contained a partial genome of what appears to be a prophage integrated into the genome.


Asunto(s)
Genómica/métodos , Filogenia , Prochlorococcus/clasificación , Agua de Mar/microbiología , Análisis de la Célula Individual/métodos , Bacteriófagos/genética , Variación Genética , Hierro/metabolismo , Metagenómica/métodos , Océano Pacífico , Prochlorococcus/genética , Prochlorococcus/metabolismo , Prochlorococcus/virología , Agua de Mar/química , Sideróforos/metabolismo
13.
Appl Environ Microbiol ; 73(17): 5389-400, 2007 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-17630305

RESUMEN

Previous experiments examining the transcriptional profile of the anaerobe Desulfovibrio vulgaris demonstrated up-regulation of the Fur regulon in response to various environmental stressors. To test the involvement of Fur in the growth response and transcriptional regulation of D. vulgaris, a targeted mutagenesis procedure was used for deleting the fur gene. Growth of the resulting Deltafur mutant (JW707) was not affected by iron availability, but the mutant did exhibit increased sensitivity to nitrite and osmotic stresses compared to the wild type. Transcriptional profiling of JW707 indicated that iron-bound Fur acts as a traditional repressor for ferrous iron uptake genes (feoAB) and other genes containing a predicted Fur binding site within their promoter. Despite the apparent lack of siderophore biosynthesis genes within the D. vulgaris genome, a large 12-gene operon encoding orthologs to TonB and TolQR also appeared to be repressed by iron-bound Fur. While other genes predicted to be involved in iron homeostasis were unaffected by the presence or absence of Fur, alternative expression patterns that could be interpreted as repression or activation by iron-free Fur were observed. Both the physiological and transcriptional data implicate a global regulatory role for Fur in the sulfate-reducing bacterium D. vulgaris.


Asunto(s)
Proteínas Bacterianas/metabolismo , Desulfovibrio/metabolismo , Perfilación de la Expresión Génica , Regulación Bacteriana de la Expresión Génica , Mutación , Proteínas Represoras/genética , Proteínas Bacterianas/genética , Desulfovibrio/genética , Desulfovibrio/crecimiento & desarrollo , Desulfovibrio/fisiología , Hierro/metabolismo , Análisis de Secuencia por Matrices de Oligonucleótidos , Oxidación-Reducción , Estrés Oxidativo , Regiones Promotoras Genéticas , Sulfatos/metabolismo
14.
Appl Environ Microbiol ; 72(6): 4370-81, 2006 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-16751553

RESUMEN

Many of the proteins that are candidates for bioenergetic pathways involved with sulfate respiration in Desulfovibrio spp. have been studied, but complete pathways and overall cell physiology remain to be resolved for many environmentally relevant conditions. In order to understand the metabolism of these microorganisms under adverse environmental conditions for improved bioremediation efforts, Desulfovibrio vulgaris Hildenborough was used as a model organism to study stress response to nitrite, an important intermediate in the nitrogen cycle. Previous physiological studies demonstrated that growth was inhibited by nitrite and that nitrite reduction was observed to be the primary mechanism of detoxification. Global transcriptional profiling with whole-genome microarrays revealed coordinated cascades of responses to nitrite in pathways of energy metabolism, nitrogen metabolism, oxidative stress response, and iron homeostasis. In agreement with previous observations, nitrite-stressed cells showed a decrease in the expression of genes encoding sulfate reduction functions in addition to respiratory oxidative phosphorylation and ATP synthase activity. Consequently, the stressed cells had decreased expression of the genes encoding ATP-dependent amino acid transporters and proteins involved in translation. Other genes up-regulated in response to nitrite include the genes in the Fur regulon, which is suggested to be involved in iron homeostasis, and genes in the Per regulon, which is predicted to be responsible for oxidative stress response.


Asunto(s)
Desulfovibrio vulgaris/metabolismo , Nitritos/metabolismo , Transcripción Genética , Desulfovibrio vulgaris/efectos de los fármacos , Desulfovibrio vulgaris/genética , Desulfovibrio vulgaris/crecimiento & desarrollo , Metabolismo Energético , Cinética , Nitritos/farmacología , Oxidación-Reducción , Reproducibilidad de los Resultados , Sulfatos/metabolismo
15.
Genome Res ; 15(6): 809-19, 2005 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-15930492

RESUMEN

The organization of bacterial genes into operons was originally ascribed to the benefits of co-regulation. More recently, the "selfish operon" model, in which operons are formed by repeated gain and loss of genes, was proposed. Indeed, operons are often subject to horizontal gene transfer (HGT). On the other hand, non-HGT genes are particularly likely to be in operons. To clarify whether HGT is involved in operon formation, we identified recently formed operons in Escherichia coli K12. We show that genes that have homologs in distantly related bacteria but not in close relatives of E. coli--indicating HGT--form new operons at about the same rates as native genes. Furthermore, genes in new operons are no more likely than other genes to have phylogenetic trees that are inconsistent with the species tree. In contrast, essential genes and ubiquitous genes without paralogs--genes believed to undergo HGT rarely--often form new operons. We conclude that HGT is not a cause of operon formation but instead promotes the prevalence of pre-existing operons. To explain operon formation, we propose that new operons reduce the amount of regulatory information required to specify optimal expression patterns and infer that operons should be more likely to evolve than independent promoters when regulation is complex. Consistent with this hypothesis, operons have greater amounts of conserved regulatory sequences than do individually transcribed genes.


Asunto(s)
Escherichia coli K12/genética , Proteínas de Escherichia coli/genética , Evolución Molecular , Transferencia de Gen Horizontal , Modelos Genéticos , Operón/genética , Regulación Bacteriana de la Expresión Génica , Regiones Promotoras Genéticas
16.
Genome Res ; 15(7): 1015-22, 2005 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-15998914

RESUMEN

At present, hundreds of microbial genomes have been sequenced, and hundreds more are currently in the pipeline. The Virtual Institute for Microbial Stress and Survival has developed a publicly available suite of Web-based comparative genomic tools (http://www.microbesonline.org) designed to facilitate multispecies comparison among prokaryotes. Highlights of the MicrobesOnline Web site include operon and regulon predictions, a multispecies genome browser, a multispecies Gene Ontology browser, a comparative KEGG metabolic pathway viewer, a Bioinformatics Workbench for in-depth sequence analysis, and Gene Carts that allow users to save genes of interest for further study while they browse. In addition, we provide an interface for genome annotation, which like all of the tools reported here, is freely available to the scientific community.


Asunto(s)
Genoma Bacteriano , Genómica , Animales , Biología Computacional , Gráficos por Computador , Bases de Datos Genéticas , Bases de Datos de Proteínas , Estructura Terciaria de Proteína/genética , Programas Informáticos , Interfaz Usuario-Computador
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA