Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
1.
Nucleic Acids Res ; 52(10): 5478-5495, 2024 Jun 10.
Artículo en Inglés | MEDLINE | ID: mdl-38686794

RESUMEN

Genome mining is revolutionizing natural products discovery efforts. The rapid increase in available genomes demands comprehensive computational platforms to effectively extract biosynthetic knowledge encoded across bacterial pangenomes. Here, we present BGCFlow, a novel systematic workflow integrating analytics for large-scale genome mining of bacterial pangenomes. BGCFlow incorporates several genome analytics and mining tools grouped into five common stages of analysis such as: (i) data selection, (ii) functional annotation, (iii) phylogenetic analysis, (iv) genome mining, and (v) comparative analysis. Furthermore, BGCFlow provides easy configuration of different projects, parallel distribution, scheduled job monitoring, an interactive database to visualize tables, exploratory Jupyter Notebooks, and customized reports. Here, we demonstrate the application of BGCFlow by investigating the phylogenetic distribution of various biosynthetic gene clusters detected across 42 genomes of the Saccharopolyspora genus, known to produce industrially important secondary/specialized metabolites. The BGCFlow-guided analysis predicted more accurate dereplication of BGCs and guided the targeted comparative analysis of selected RiPPs. The scalable, interoperable, adaptable, re-entrant, and reproducible nature of the BGCFlow will provide an effective novel way to extract the biosynthetic knowledge from the ever-growing genomic datasets of biotechnologically relevant bacterial species.


Asunto(s)
Vías Biosintéticas , Genómica , Familia de Multigenes , Flujo de Trabajo , Vías Biosintéticas/genética , Minería de Datos , Bases de Datos Genéticas , Genoma Bacteriano , Genómica/métodos , Filogenia , Programas Informáticos
2.
Proc Natl Acad Sci U S A ; 120(15): e2218835120, 2023 04 11.
Artículo en Inglés | MEDLINE | ID: mdl-37011218

RESUMEN

The genomic diversity across strains of a species forms the genetic basis for differences in their behavior. A large-scale assessment of sequence variation has been made possible by the growing availability of strain-specific whole-genome sequences (WGS) and with the advent of large-scale databases of laboratory-acquired mutations. We define the Escherichia coli "alleleome" through a genome-scale assessment of amino acid (AA) sequence diversity in open reading frames across 2,661 WGS from wild-type strains. We observe a highly conserved alleleome enriched in mutations unlikely to affect protein function. In contrast, 33,000 mutations acquired in laboratory evolution experiments result in more severe AA substitutions that are rarely achieved by natural selection. Large-scale assessment of the alleleome establishes a method for the quantification of bacterial allelic diversity, reveals opportunities for synthetic biology to explore novel sequence space, and offers insights into the constraints governing evolution.


Asunto(s)
Escherichia coli , Variación Genética , Mutación , Escherichia coli/genética , Genoma Bacteriano/genética , Secuencia de Aminoácidos
3.
Metab Eng ; 76: 179-192, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36738854

RESUMEN

Although strain tolerance to high product concentrations is a barrier to the economically viable biomanufacturing of industrial chemicals, chemical tolerance mechanisms are often unknown. To reveal tolerance mechanisms, an automated platform was utilized to evolve Escherichia coli to grow optimally in the presence of 11 industrial chemicals (1,2-propanediol, 2,3-butanediol, glutarate, adipate, putrescine, hexamethylenediamine, butanol, isobutyrate, coumarate, octanoate, hexanoate), reaching tolerance at concentrations 60%-400% higher than initial toxic levels. Sequencing genomes of 223 isolates from 89 populations, reverse engineering, and cross-compound tolerance profiling were employed to uncover tolerance mechanisms. We show that: 1) cells are tolerized via frequent mutation of membrane transporters or cell wall-associated proteins (e.g., ProV, KgtP, SapB, NagA, NagC, MreB), transcription and translation machineries (e.g., RpoA, RpoB, RpoC, RpsA, RpsG, NusA, Rho), stress signaling proteins (e.g., RelA, SspA, SpoT, YobF), and for certain chemicals, regulators and enzymes in metabolism (e.g., MetJ, NadR, GudD, PurT); 2) osmotic stress plays a significant role in tolerance when chemical concentrations exceed a general threshold and mutated genes frequently overlap with those enabling chemical tolerance in membrane transporters and cell wall-associated proteins; 3) tolerization to a specific chemical generally improves tolerance to structurally similar compounds whereas a tradeoff can occur on dissimilar chemicals, and 4) using pre-tolerized starting isolates can hugely enhance the subsequent production of chemicals when a production pathway is inserted in many, but not all, evolved tolerized host strains, underpinning the need for evolving multiple parallel populations. Taken as a whole, this study provides a comprehensive genotype-phenotype map based on identified mutations and growth phenotypes for 223 chemical tolerant isolates.


Asunto(s)
Proteínas de Escherichia coli , Escherichia coli , Escherichia coli/genética , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Mutación , 1-Butanol/metabolismo , Proteínas de Transporte de Membrana/genética , Proteínas Represoras/genética , Factores de Elongación Transcripcional/genética , Factores de Elongación Transcripcional/metabolismo
4.
Nucleic Acids Res ; 49(D1): D112-D120, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33045728

RESUMEN

Independent component analysis (ICA) of bacterial transcriptomes has emerged as a powerful tool for obtaining co-regulated, independently-modulated gene sets (iModulons), inferring their activities across a range of conditions, and enabling their association to known genetic regulators. By grouping and analyzing genes based on observations from big data alone, iModulons can provide a novel perspective into how the composition of the transcriptome adapts to environmental conditions. Here, we present iModulonDB (imodulondb.org), a knowledgebase of prokaryotic transcriptional regulation computed from high-quality transcriptomic datasets using ICA. Users select an organism from the home page and then search or browse the curated iModulons that make up its transcriptome. Each iModulon and gene has its own interactive dashboard, featuring plots and tables with clickable, hoverable, and downloadable features. This site enhances research by presenting scientists of all backgrounds with co-expressed gene sets and their activity levels, which lead to improved understanding of regulator-gene relationships, discovery of transcription factors, and the elucidation of unexpected relationships between conditions and genetic regulatory activity. The current release of iModulonDB covers three organisms (Escherichia coli, Staphylococcus aureus and Bacillus subtilis) with 204 iModulons, and can be expanded to cover many additional organisms.


Asunto(s)
Proteínas Bacterianas/genética , Bases de Datos Genéticas , Regulación Bacteriana de la Expresión Génica , Bases del Conocimiento , Aprendizaje Automático , Transcriptoma , Bacillus subtilis/genética , Bacillus subtilis/metabolismo , Proteínas Bacterianas/metabolismo , Escherichia coli/genética , Escherichia coli/metabolismo , Perfilación de la Expresión Génica , Redes Reguladoras de Genes , Internet , Programas Informáticos , Staphylococcus aureus/genética , Staphylococcus aureus/metabolismo , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Transcripción Genética
5.
Proc Natl Acad Sci U S A ; 117(37): 23182-23190, 2020 09 15.
Artículo en Inglés | MEDLINE | ID: mdl-32873645

RESUMEN

Enzyme turnover numbers (kcats) are essential for a quantitative understanding of cells. Because kcats are traditionally measured in low-throughput assays, they can be inconsistent, labor-intensive to obtain, and can miss in vivo effects. We use a data-driven approach to estimate in vivo kcats using metabolic specialist Escherichia coli strains that resulted from gene knockouts in central metabolism followed by metabolic optimization via laboratory evolution. By combining absolute proteomics with fluxomics data, we find that in vivo kcats are robust against genetic perturbations, suggesting that metabolic adaptation to gene loss is mostly achieved through other mechanisms, like gene-regulatory changes. Combining machine learning and genome-scale metabolic models, we show that the obtained in vivo kcats predict unseen proteomics data with much higher precision than in vitro kcats. The results demonstrate that in vivo kcats can solve the problem of inconsistent and low-coverage parameterizations of genome-scale cellular models.


Asunto(s)
Escherichia coli/metabolismo , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Técnicas de Inactivación de Genes/métodos , Genoma/genética , Cinética , Aprendizaje Automático , Modelos Biológicos , Proteómica/métodos
6.
Nucleic Acids Res ; 48(18): 10157-10163, 2020 10 09.
Artículo en Inglés | MEDLINE | ID: mdl-32976587

RESUMEN

A genome contains the information underlying an organism's form and function. Yet, we lack formal framework to represent and study this information. Here, we introduce the Bitome, a matrix composed of binary digits (bits) representing the genomic positions of genomic features. We form a Bitome for the genome of Escherichia coli K-12 MG1655. We find that: (i) genomic features are encoded unevenly, both spatially and categorically; (ii) coding and intergenic features are recapitulated at high resolution; (iii) adaptive mutations are skewed towards genomic positions with fewer features; and (iv) the Bitome enhances prediction of adaptively mutated and essential genes. The Bitome is a formal representation of a genome and may be used to study its fundamental organizational properties.


Asunto(s)
Escherichia coli K12/genética , Genoma Bacteriano , Genómica
7.
Proc Natl Acad Sci U S A ; 116(50): 25287-25292, 2019 12 10.
Artículo en Inglés | MEDLINE | ID: mdl-31767748

RESUMEN

Evolution fine-tunes biological pathways to achieve a robust cellular physiology. Two and a half billion years ago, rapidly rising levels of oxygen as a byproduct of blooming cyanobacterial photosynthesis resulted in a redox upshift in microbial energetics. The appearance of higher-redox-potential respiratory quinone, ubiquinone (UQ), is believed to be an adaptive response to this environmental transition. However, the majority of bacterial species are still dependent on the ancient respiratory quinone, naphthoquinone (NQ). Gammaproteobacteria can biosynthesize both of these respiratory quinones, where UQ has been associated with aerobic lifestyle and NQ with anaerobic lifestyle. We engineered an obligate NQ-dependent γ-proteobacterium, Escherichia coli ΔubiC, and performed adaptive laboratory evolution to understand the selection against the use of NQ in an oxic environment and also the adaptation required to support the NQ-driven aerobic electron transport chain. A comparative systems-level analysis of pre- and postevolved NQ-dependent strains revealed a clear shift from fermentative to oxidative metabolism enabled by higher periplasmic superoxide defense. This metabolic shift was driven by the concerted activity of 3 transcriptional regulators (PdhR, RpoS, and Fur). Analysis of these findings using a genome-scale model suggested that resource allocation to reactive oxygen species (ROS) mitigation results in lower growth rates. These results provide a direct elucidation of a resource allocation tradeoff between growth rate and ROS mitigation costs associated with NQ usage under oxygen-replete condition.


Asunto(s)
Escherichia coli/crecimiento & desarrollo , Escherichia coli/metabolismo , Naftoquinonas/metabolismo , Estrés Oxidativo , Oxígeno/metabolismo , Aerobiosis , Evolución Biológica , Transporte de Electrón , Escherichia coli/genética , Oxo-Ácido-Liasas/genética , Oxo-Ácido-Liasas/metabolismo , Especies Reactivas de Oxígeno/metabolismo
8.
Proc Natl Acad Sci U S A ; 116(28): 14368-14373, 2019 07 09.
Artículo en Inglés | MEDLINE | ID: mdl-31270234

RESUMEN

Catalysis using iron-sulfur clusters and transition metals can be traced back to the last universal common ancestor. The damage to metalloproteins caused by reactive oxygen species (ROS) can prevent cell growth and survival when unmanaged, thus eliciting an essential stress response that is universal and fundamental in biology. Here we develop a computable multiscale description of the ROS stress response in Escherichia coli, called OxidizeME. We use OxidizeME to explain four key responses to oxidative stress: 1) ROS-induced auxotrophy for branched-chain, aromatic, and sulfurous amino acids; 2) nutrient-dependent sensitivity of growth rate to ROS; 3) ROS-specific differential gene expression separate from global growth-associated differential expression; and 4) coordinated expression of iron-sulfur cluster (ISC) and sulfur assimilation (SUF) systems for iron-sulfur cluster biosynthesis. These results show that we can now develop fundamental and quantitative genotype-phenotype relationships for stress responses on a genome-wide basis.


Asunto(s)
Proteínas Hierro-Azufre/genética , Hierro/metabolismo , Metaloproteínas/genética , Especies Reactivas de Oxígeno/metabolismo , Catálisis , Proliferación Celular/genética , Escherichia coli/genética , Escherichia coli/metabolismo , Regulación de la Expresión Génica/genética , Peróxido de Hidrógeno/metabolismo , Operón/genética , Estrés Oxidativo/genética , Azufre/metabolismo
9.
Nucleic Acids Res ; 47(D1): D1164-D1171, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30357390

RESUMEN

Adaptive Laboratory Evolution (ALE) has emerged as an experimental approach to discover causal mutations that confer desired phenotypic functions. ALE not only represents a controllable experimental approach to systematically discover genotype-phenotype relationships, but also allows for the revelation of the series of genetic alterations required to acquire the new phenotype. Numerous ALE studies have been published, providing a strong impetus for developing databases to warehouse experimental evolution information and make it retrievable for large-scale analysis. Here, the first step towards establishing this resource is presented: ALEdb (http://aledb.org). This initial release contains over 11 000 mutations that have been discovered from eleven ALE publications. ALEdb (i) is a web-based platform that comprehensively reports on ALE acquired mutations and their conditions, (ii) reports key mutations using previously established trends, (iii) enables a search-driven workflow to enhance user mutation functional analysis through mutation cross-reference, (iv) allows exporting of mutation query results for custom analysis, (v) includes a bibliome describing the databased experiment publications and (vi) contains experimental evolution mutations from multiple model organisms. Thus, ALEdb is an informative platform which will become increasingly revealing as the number of reported ALE experiments and identified mutations continue to expand.


Asunto(s)
Adaptación Fisiológica/genética , Biología Computacional/métodos , Bases de Datos Genéticas , Evolución Molecular Dirigida/métodos , Escherichia coli/genética , Mutación , Técnicas Bacteriológicas/métodos , Escherichia coli/crecimiento & desarrollo , Proteínas de Escherichia coli/genética , Genes Bacterianos/genética , Internet , Polimorfismo de Nucleótido Simple , Reproducibilidad de los Resultados
10.
BMC Genomics ; 21(1): 514, 2020 Jul 25.
Artículo en Inglés | MEDLINE | ID: mdl-32711472

RESUMEN

BACKGROUND: Adaptive Laboratory Evolution (ALE) has emerged as an experimental approach to discover mutations that confer phenotypic functions of interest. However, the task of finding and understanding all beneficial mutations of an ALE experiment remains an open challenge for the field. To provide for better results than traditional methods of ALE mutation analysis, this work applied enrichment methods to mutations described by a multiscale annotation framework and a consolidated set of ALE experiment conditions. A total of 25,321 unique genome annotations from various sources were leveraged to describe multiple scales of mutated features in a set of 35 Escherichia coli based ALE experiments. These experiments totalled 208 independent evolutions and 2641 mutations. Additionally, mutated features were statistically associated across a total of 43 unique experimental conditions to aid in deconvoluting mutation selection pressures. RESULTS: Identifying potentially beneficial, or key, mutations was enhanced by seeking coding and non-coding genome features significantly enriched by mutations across multiple ALE replicates and scales of genome annotations. The median proportion of ALE experiment key mutations increased from 62%, with only small coding and non-coding features, to 71% with larger aggregate features. Understanding key mutations was enhanced by considering the functions of broader annotation types and the significantly associated conditions for key mutated features. The approaches developed here were used to find and characterize novel key mutations in two ALE experiments: one previously unpublished with Escherichia coli grown on glycerol as a carbon source and one previously published with Escherichia coli tolerized to high concentrations of L-serine. CONCLUSIONS: The emergent adaptive strategies represented by sets of ALE mutations became more clear upon observing the aggregation of mutated features across small to large scale genome annotations. The clarification of mutation selection pressures among the many experimental conditions also helped bring these strategies to light. This work demonstrates how multiscale genome annotation frameworks and data-driven methods can help better characterize ALE mutations, and thus help elucidate the genotype-to-phenotype relationship of the studied organism.


Asunto(s)
Proteínas de Escherichia coli , Laboratorios , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Genoma , Mutación
11.
Microbiology (Reading) ; 166(2): 141-148, 2020 02.
Artículo en Inglés | MEDLINE | ID: mdl-31625833

RESUMEN

The ability of Escherichia coli to tolerate acid stress is important for its survival and colonization in the human digestive tract. Here, we performed adaptive laboratory evolution of the laboratory strain E. coli K-12 MG1655 at pH 5.5 in glucose minimal medium. After 800 generations, six independent populations under evolution had reached 18.0 % higher growth rates than their starting strain at pH 5.5, while maintaining comparable growth rates to the starting strain at pH 7. We characterized the evolved strains and found that: (1) whole genome sequencing of isolated clones from each evolved population revealed mutations in rpoC appearing in five of six sequenced clones; and (2) gene expression profiles revealed different strategies to mitigate acid stress, which are related to amino acid metabolism and energy production and conversion. Thus, a combination of adaptive laboratory evolution, genome resequencing and expression profiling revealed, on a genome scale, the strategies that E. coli uses to mitigate acid stress.


Asunto(s)
Ácidos/metabolismo , Adaptación Fisiológica/fisiología , Escherichia coli/fisiología , Adaptación Fisiológica/genética , Evolución Biológica , Medios de Cultivo/química , Medios de Cultivo/metabolismo , Escherichia coli/genética , Escherichia coli/crecimiento & desarrollo , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Perfilación de la Expresión Génica , Regulación Bacteriana de la Expresión Génica , Genoma Bacteriano/genética , Glucosa/metabolismo , Redes y Vías Metabólicas/genética , Mutación
12.
PLoS Comput Biol ; 15(3): e1006213, 2019 03.
Artículo en Inglés | MEDLINE | ID: mdl-30822347

RESUMEN

Understanding the fundamental characteristics of microbial communities could have far reaching implications for human health and applied biotechnology. Despite this, much is still unknown regarding the genetic basis and evolutionary strategies underlying the formation of viable synthetic communities. By pairing auxotrophic mutants in co-culture, it has been demonstrated that viable nascent E. coli communities can be established where the mutant strains are metabolically coupled. A novel algorithm, OptAux, was constructed to design 61 unique multi-knockout E. coli auxotrophic strains that require significant metabolite uptake to grow. These predicted knockouts included a diverse set of novel non-specific auxotrophs that result from inhibition of major biosynthetic subsystems. Three OptAux predicted non-specific auxotrophic strains-with diverse metabolic deficiencies-were co-cultured with an L-histidine auxotroph and optimized via adaptive laboratory evolution (ALE). Time-course sequencing revealed the genetic changes employed by each strain to achieve higher community growth rates and provided insight into mechanisms for adapting to the syntrophic niche. A community model of metabolism and gene expression was utilized to predict the relative community composition and fundamental characteristics of the evolved communities. This work presents new insight into the genetic strategies underlying viable nascent community formation and a cutting-edge computational method to elucidate metabolic changes that empower the creation of cooperative communities.


Asunto(s)
Adaptación Fisiológica , Escherichia coli/fisiología , Modelos Biológicos , Algoritmos , Evolución Biológica , Técnicas de Cocultivo , Escherichia coli/genética , Genes Bacterianos , Mutación
13.
Metab Eng ; 39: 141-150, 2017 01.
Artículo en Inglés | MEDLINE | ID: mdl-27908688

RESUMEN

L-serine is a promising building block biochemical with a high theoretical production yield from glucose. Toxicity of L-serine is however prohibitive for high-titer production in E. coli. Here, E. coli lacking L-serine degradation pathways was evolved for improved tolerance by gradually increasing L-serine concentration from 3 to 100g/L using adaptive laboratory evolution (ALE). Genome sequencing of isolated clones revealed multiplication of genetic regions, as well as mutations in thrA, thereby showing a potential mechanism of serine inhibition. Additional mutations were evaluated by MAGE combined with amplicon sequencing, revealing role of rho, lrp, pykF, eno, and rpoB on tolerance and fitness in minimal medium. Production using the tolerant strains resulted in 37g/L of L-serine with a 24% mass yield. The resulting titer is similar to the highest production reported for any organism thereby highlighting the potential of ALE for industrial biotechnology.


Asunto(s)
Evolución Molecular Dirigida/métodos , Escherichia coli/fisiología , Glucosa/metabolismo , Ingeniería Metabólica/métodos , Serina/biosíntesis , Serina/genética , Regulación hacia Arriba/genética , Vías Biosintéticas/genética , Regulación Bacteriana de la Expresión Génica/genética , Mejoramiento Genético/métodos , Redes y Vías Metabólicas/genética , Serina/aislamiento & purificación
14.
NAR Genom Bioinform ; 6(2): lqae041, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38774514

RESUMEN

Microbial genome sequences are rapidly accumulating, enabling large-scale studies of sequence variation. Existing studies primarily focus on coding regions to study amino acid substitution patterns in proteins. However, non-coding regulatory regions also play a distinct role in determining physiologic responses. To investigate intergenic sequence variation on a large-scale, we identified non-coding regulatory region alleles across 2350 Escherichia coli strains. This 'alleleome' consists of 117 781 unique alleles for 1169 reference regulatory regions (transcribing 1975 genes) at single base-pair resolution. We find that 64% of nucleotide positions are invariant, and variant positions vary in a median of just 0.6% of strains. Additionally, non-coding alleles are sufficient to recover E. coli phylogroups. We find that core promoter elements and transcription factor binding sites are significantly conserved, especially those located upstream of essential or highly-expressed genes. However, variability in conservation of transcription factor binding sites is significant both within and across regulons. Finally, we contrast mutations acquired during adaptive laboratory evolution with wild-type variation, finding that the former preferentially alter positions that the latter conserves. Overall, this analysis elucidates the wealth of information found in E. coli non-coding sequence variation and expands pangenomic studies to non-coding regulatory regions at single-nucleotide resolution.

15.
mSystems ; 9(3): e0125723, 2024 Mar 19.
Artículo en Inglés | MEDLINE | ID: mdl-38349131

RESUMEN

Limosilactobacillus reuteri, a probiotic microbe instrumental to human health and sustainable food production, adapts to diverse environmental shifts via dynamic gene expression. We applied the independent component analysis (ICA) to 117 RNA-seq data sets to decode its transcriptional regulatory network (TRN), identifying 35 distinct signals that modulate specific gene sets. Our findings indicate that the ICA provides a qualitative advancement and captures nuanced relationships within gene clusters that other methods may miss. This study uncovers the fundamental properties of L. reuteri's TRN and deepens our understanding of its arginine metabolism and the co-regulation of riboflavin metabolism and fatty acid conversion. It also sheds light on conditions that regulate genes within a specific biosynthetic gene cluster and allows for the speculation of the potential role of isoprenoid biosynthesis in L. reuteri's adaptive response to environmental changes. By integrating transcriptomics and machine learning, we provide a system-level understanding of L. reuteri's response mechanism to environmental fluctuations, thus setting the stage for modeling the probiotic transcriptome for applications in microbial food production. IMPORTANCE: We have studied Limosilactobacillus reuteri, a beneficial probiotic microbe that plays a significant role in our health and production of sustainable foods, a type of foods that are nutritionally dense and healthier and have low-carbon emissions compared to traditional foods. Similar to how humans adapt their lifestyles to different environments, this microbe adjusts its behavior by modulating the expression of genes. We applied machine learning to analyze large-scale data sets on how these genes behave across diverse conditions. From this, we identified 35 unique patterns demonstrating how L. reuteri adjusts its genes based on 50 unique environmental conditions (such as various sugars, salts, microbial cocultures, human milk, and fruit juice). This research helps us understand better how L. reuteri functions, especially in processes like breaking down certain nutrients and adapting to stressful changes. More importantly, with our findings, we become closer to using this knowledge to improve how we produce more sustainable and healthier foods with the help of microbes.


Asunto(s)
Limosilactobacillus reuteri , Probióticos , Humanos , Limosilactobacillus reuteri/genética , Perfilación de la Expresión Génica , Transcriptoma/genética , Aprendizaje Automático
16.
Cell Rep ; 42(9): 113105, 2023 09 26.
Artículo en Inglés | MEDLINE | ID: mdl-37713311

RESUMEN

Relationships between the genome, transcriptome, and metabolome underlie all evolved phenotypes. However, it has proved difficult to elucidate these relationships because of the high number of variables measured. A recently developed data analytic method for characterizing the transcriptome can simplify interpretation by grouping genes into independently modulated sets (iModulons). Here, we demonstrate how iModulons reveal deep understanding of the effects of causal mutations and metabolic rewiring. We use adaptive laboratory evolution to generate E. coli strains that tolerate high levels of the redox cycling compound paraquat, which produces reactive oxygen species (ROS). We combine resequencing, iModulons, and metabolic models to elucidate six interacting stress-tolerance mechanisms: (1) modification of transport, (2) activation of ROS stress responses, (3) use of ROS-sensitive iron regulation, (4) motility, (5) broad transcriptional reallocation toward growth, and (6) metabolic rewiring to decrease NADH production. This work thus demonstrates the power of iModulon knowledge mapping for evolution analysis.


Asunto(s)
Escherichia coli , Paraquat , Paraquat/farmacología , Especies Reactivas de Oxígeno/metabolismo , Escherichia coli/metabolismo , Transcriptoma/genética , Perfilación de la Expresión Génica
17.
Nat Commun ; 13(1): 3682, 2022 06 27.
Artículo en Inglés | MEDLINE | ID: mdl-35760776

RESUMEN

The bacterial respiratory electron transport system (ETS) is branched to allow condition-specific modulation of energy metabolism. There is a detailed understanding of the structural and biochemical features of respiratory enzymes; however, a holistic examination of the system and its plasticity is lacking. Here we generate four strains of Escherichia coli harboring unbranched ETS that pump 1, 2, 3, or 4 proton(s) per electron and characterized them using a combination of synergistic methods (adaptive laboratory evolution, multi-omic analyses, and computation of proteome allocation). We report that: (a) all four ETS variants evolve to a similar optimized growth rate, and (b) the laboratory evolutions generate specific rewiring of major energy-generating pathways, coupled to the ETS, to optimize ATP production capability. We thus define an Aero-Type System (ATS), which is a generalization of the aerobic bioenergetics and is a metabolic systems biology description of respiration and its inherent plasticity.


Asunto(s)
Escherichia coli , Biología de Sistemas , Transporte de Electrón/genética , Escherichia coli/metabolismo , Proteoma/metabolismo , Sistema Respiratorio
18.
ACS Synth Biol ; 10(12): 3379-3395, 2021 12 17.
Artículo en Inglés | MEDLINE | ID: mdl-34762392

RESUMEN

Microbes are being engineered for an increasingly large and diverse set of applications. However, the designing of microbial genomes remains challenging due to the general complexity of biological systems. Adaptive Laboratory Evolution (ALE) leverages nature's problem-solving processes to generate optimized genotypes currently inaccessible to rational methods. The large amount of public ALE data now represents a new opportunity for data-driven strain design. This study describes how novel strain designs, or genome sequences not yet observed in ALE experiments or published designs, can be extracted from aggregated ALE data and demonstrates this by designing, building, and testing three novel Escherichia coli strains with fitnesses comparable to ALE mutants. These designs were achieved through a meta-analysis of aggregated ALE mutations data (63 Escherichia coli K-12 MG1655 based ALE experiments, described by 93 unique environmental conditions, 357 independent evolutions, and 13 957 observed mutations), which additionally revealed global ALE mutation trends that inform on ALE-derived strain design principles. Such informative trends anticipate ALE-derived strain designs as largely gene-centric, as opposed to noncoding, and composed of a relatively small number of beneficial variants (approximately 6). These results demonstrate how strain design efforts can be enhanced by the meta-analysis of aggregated ALE data.


Asunto(s)
Escherichia coli K12 , Proteínas de Escherichia coli , Escherichia coli/genética , Escherichia coli K12/genética , Proteínas de Escherichia coli/genética , Laboratorios , Mutación/genética
19.
Nat Ecol Evol ; 4(10): 1402-1409, 2020 10.
Artículo en Inglés | MEDLINE | ID: mdl-32778753

RESUMEN

The ability of DNA to produce a functional protein even after transfer to a foreign host is of fundamental importance in both evolutionary biology and biotechnology, enabling horizontal gene transfer in the wild and heterologous expression in the lab. However, the influence of genetic particulars on DNA functionality in a new host is poorly understood, as are the evolutionary mechanisms of assimilation and refinement. Here, we describe an automation-enabled large-scale experiment wherein Escherichia coli strains were evolved in parallel after replacement of the genes pgi or tpiA with orthologous DNA from donor species spanning all domains of life, from humans to hyperthermophilic archaea. Via analysis of hundreds of clones evolved for 50,000+ cumulative generations across dozens of independent lineages, we show that orthogene-upregulating mutations can completely mitigate fitness defects that result from initial non-functionality, with coding sequence changes unnecessary. Gene target, donor species and genomic location of the swap all influenced outcomes-both the nature of adaptive mutations (often synonymous) and the frequency with which strains successfully evolved to assimilate the foreign DNA. Additionally, time series DNA sequencing and replay evolution experiments revealed transient copy number expansions, the contingency of lineage outcome on first-step mutations and the ability for strains to escape from suboptimal local fitness maxima. Overall, this study establishes the influence of various DNA and protein features on cross-species genetic interchangeability and evolutionary outcomes, with implications for both horizontal gene transfer and rational strain design.


Asunto(s)
Transferencia de Gen Horizontal , Genoma , Escherichia coli/genética , Genómica , Humanos , Análisis de Secuencia de ADN
20.
Nat Microbiol ; 4(3): 386-389, 2019 03.
Artículo en Inglés | MEDLINE | ID: mdl-30692668

RESUMEN

Pseudogenes represent open reading frames that have been damaged by mutations, rendering the gene product non-functional. Pseudogenes are found in many genomes and are not always eliminated, even if they are potentially 'wasteful'. This raises a fundamental question about their prevalence. Here we report pseudogene efeU repair that restores the iron uptake system of Escherichia coli under a designed selection pressure during adaptive laboratory evolution.


Asunto(s)
Reparación de la Incompatibilidad de ADN , Evolución Molecular Dirigida , Seudogenes , Selección Genética , Proteínas de Transporte de Catión/genética , Escherichia coli/genética , Proteínas de Escherichia coli/genética , Evolución Molecular , Hierro/metabolismo , Sistemas de Lectura Abierta , Filogenia
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA