Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Annu Rev Genet ; 51: 143-170, 2017 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-29178818

RESUMO

Archaea are major contributors to biogeochemical cycles, possess unique metabolic capabilities, and resist extreme stress. To regulate the expression of genes encoding these unique programs, archaeal cells use gene regulatory networks (GRNs) composed of transcription factor proteins and their target genes. Recent developments in genetics, genomics, and computational methods used with archaeal model organisms have enabled the mapping and prediction of global GRN structures. Experimental tests of these predictions have revealed the dynamical function of GRNs in response to environmental variation. Here, we review recent progress made in this area, from investigating the mechanisms of transcriptional regulation of individual genes to small-scale subnetworks and genome-wide global networks. At each level, archaeal GRNs consist of a hybrid of bacterial, eukaryotic, and uniquely archaeal mechanisms. We discuss this theme from the perspective of the role of individual transcription factors in genome-wide regulation, how these proteins interact to compile GRN topological structures, and how these topologies lead to emergent, high-level GRN functions. We conclude by discussing how systems biology approaches are a fruitful avenue for addressing remaining challenges, such as discovering gene function and the evolution of GRNs.


Assuntos
Archaea/genética , Proteínas Arqueais/genética , Regulação da Expressão Gênica em Archaea , Redes Reguladoras de Genes , Genoma Arqueal , Fatores de Transcrição/genética , Transcrição Gênica , Adaptação Biológica/genética , Archaea/metabolismo , Proteínas Arqueais/metabolismo , Mapeamento Cromossômico , Interação Gene-Ambiente , Redes e Vias Metabólicas/genética , Estresse Fisiológico/genética , Biologia de Sistemas/métodos , Fatores de Transcrição/metabolismo
2.
Proc Natl Acad Sci U S A ; 119(26): e2114021119, 2022 06 28.
Artigo em Inglês | MEDLINE | ID: mdl-35733251

RESUMO

Large-scale measurements linking genetic background to biological function have driven a need for models that can incorporate these data for reliable predictions and insight into the underlying biophysical system. Recent modeling efforts, however, prioritize predictive accuracy at the expense of model interpretability. Here, we present LANTERN (landscape interpretable nonparametric model, https://github.com/usnistgov/lantern), a hierarchical Bayesian model that distills genotype-phenotype landscape (GPL) measurements into a low-dimensional feature space that represents the fundamental biological mechanisms of the system while also enabling straightforward, explainable predictions. Across a benchmark of large-scale datasets, LANTERN equals or outperforms all alternative approaches, including deep neural networks. LANTERN furthermore extracts useful insights of the landscape, including its inherent dimensionality, a latent space of additive mutational effects, and metrics of landscape structure. LANTERN facilitates straightforward discovery of fundamental mechanisms in GPLs, while also reliably extrapolating to unexplored regions of genotypic space.


Assuntos
Interação Gene-Ambiente , Genótipo , Redes Neurais de Computação , Fenótipo , Teorema de Bayes , Mutação
3.
Mol Syst Biol ; 17(3): e10179, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33784029

RESUMO

Allostery is a fundamental biophysical mechanism that underlies cellular sensing, signaling, and metabolism. Yet a quantitative understanding of allosteric genotype-phenotype relationships remains elusive. Here, we report the large-scale measurement of the genotype-phenotype landscape for an allosteric protein: the lac repressor from Escherichia coli, LacI. Using a method that combines long-read and short-read DNA sequencing, we quantitatively measure the dose-response curves for nearly 105 variants of the LacI genetic sensor. The resulting data provide a quantitative map of the effect of amino acid substitutions on LacI allostery and reveal systematic sequence-structure-function relationships. We find that in many cases, allosteric phenotypes can be quantitatively predicted with additive or neural-network models, but unpredictable changes also occur. For example, we were surprised to discover a new band-stop phenotype that challenges conventional models of allostery and that emerges from combinations of nearly silent amino acid substitutions.


Assuntos
Genótipo , Repressores Lac/metabolismo , Fenótipo , Regulação Alostérica , Substituição de Aminoácidos , Escherichia coli/genética , Variação Genética
4.
PLoS Comput Biol ; 16(10): e1008366, 2020 10.
Artigo em Inglês | MEDLINE | ID: mdl-33104703

RESUMO

Substantive changes in gene expression, metabolism, and the proteome are manifested in overall changes in microbial population growth. Quantifying how microbes grow is therefore fundamental to areas such as genetics, bioengineering, and food safety. Traditional parametric growth curve models capture the population growth behavior through a set of summarizing parameters. However, estimation of these parameters from data is confounded by random effects such as experimental variability, batch effects or differences in experimental material. A systematic statistical method to identify and correct for such confounding effects in population growth data is not currently available. Further, our previous work has demonstrated that parametric models are insufficient to explain and predict microbial response under non-standard growth conditions. Here we develop a hierarchical Bayesian non-parametric model of population growth that identifies the latent growth behavior and response to perturbation, while simultaneously correcting for random effects in the data. This model enables more accurate estimates of the biological effect of interest, while better accounting for the uncertainty due to technical variation. Additionally, modeling hierarchical variation provides estimates of the relative impact of various confounding effects on measured population growth.


Assuntos
Bactérias/crescimento & desenvolvimento , Modelos Biológicos , Biologia de Sistemas/métodos , Bactérias/metabolismo , Teorema de Bayes , Estatísticas não Paramétricas
5.
Genome Res ; 27(2): 320-333, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27864351

RESUMO

Microbial growth curves are used to study differential effects of media, genetics, and stress on microbial population growth. Consequently, many modeling frameworks exist to capture microbial population growth measurements. However, current models are designed to quantify growth under conditions for which growth has a specific functional form. Extensions to these models are required to quantify the effects of perturbations, which often exhibit nonstandard growth curves. Rather than assume specific functional forms for experimental perturbations, we developed a general and robust model of microbial population growth curves using Gaussian process (GP) regression. GP regression modeling of high-resolution time-series growth data enables accurate quantification of population growth and allows explicit control of effects from other covariates such as genetic background. This framework substantially outperforms commonly used microbial population growth models, particularly when modeling growth data from environmentally stressed populations. We apply the GP growth model and develop statistical tests to quantify the differential effects of environmental perturbations on microbial growth across a large compendium of genotypes in archaea and yeast. This method accurately identifies known transcriptional regulators and implicates novel regulators of growth under standard and stress conditions in the model archaeal organism Halobacterium salinarum For yeast, our method correctly identifies known phenotypes for a diversity of genetic backgrounds under cyclohexamide stress and also detects previously unidentified oxidative stress sensitivity across a subset of strains. Together, these results demonstrate that the GP models are interpretable, recapitulating biological knowledge of growth response while providing new insights into the relevant parameters affecting microbial population growth.


Assuntos
Halobacterium salinarum/crescimento & desenvolvimento , Modelos Biológicos , Leveduras/crescimento & desenvolvimento , Halobacterium salinarum/genética , Distribuição Normal , Fenótipo , Leveduras/genética
6.
Nucleic Acids Res ; 45(17): 9990-10001, 2017 Sep 29.
Artigo em Inglês | MEDLINE | ID: mdl-28973467

RESUMO

Iron is required for key metabolic processes but is toxic in excess. This circumstance forces organisms across the tree of life to tightly regulate iron homeostasis. In hypersaline lakes dominated by archaeal species, iron levels are extremely low and subject to environmental change; however, mechanisms regulating iron homeostasis in archaea remain unclear. In previous work, we demonstrated that two transcription factors (TFs), Idr1 and Idr2, collaboratively regulate aspects of iron homeostasis in the model species Halobacterium salinarum. Here we show that Idr1 and Idr2 are part of an extended regulatory network of four TFs of the bacterial DtxR family that maintains intracellular iron balance. We demonstrate that each TF directly regulates at least one of the other DtxR TFs at the level of transcription. Dynamical modeling revealed interlocking positive feedback loop architecture, which exhibits bistable or oscillatory network dynamics depending on iron availability. TF knockout mutant phenotypes are consistent with model predictions. Together, our results support that this network regulates iron homeostasis despite variation in extracellular iron levels, consistent with dynamical properties of interlocking feedback architecture in eukaryotes. These results suggest that archaea use bacterial-type TFs in a eukaryotic regulatory network topology to adapt to harsh environments.


Assuntos
Proteínas Arqueais/genética , Retroalimentação Fisiológica , Regulação da Expressão Gênica em Archaea , Redes Reguladoras de Genes , Halobacterium salinarum/genética , Ferro/metabolismo , Proteínas Arqueais/metabolismo , Proteínas de Ligação a DNA/genética , Proteínas de Ligação a DNA/metabolismo , Halobacterium salinarum/metabolismo , Homeostase/genética , Mutação , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Transcrição Gênica
7.
PLoS Genet ; 11(1): e1004912, 2015 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-25569531

RESUMO

Networks of interacting transcription factors are central to the regulation of cellular responses to abiotic stress. Although the architecture of many such networks has been mapped, their dynamic function remains unclear. Here we address this challenge in archaea, microorganisms possessing transcription factors that resemble those of both eukaryotes and bacteria. Using genome-wide DNA binding location analysis integrated with gene expression and cell physiological data, we demonstrate that a bacterial-type transcription factor (TF), called RosR, and five TFIIB proteins, homologs of eukaryotic TFs, combinatorially regulate over 100 target genes important for the response to extremely high levels of peroxide. These genes include 20 other transcription factors and oxidative damage repair genes. RosR promoter occupancy is surprisingly dynamic, with the pattern of target gene expression during the transition from rapid growth to stress correlating strongly with the pattern of dynamic binding. We conclude that a hierarchical regulatory network orchestrated by TFs of hybrid lineage enables dynamic response and survival under extreme stress in archaea. This raises questions regarding the evolutionary trajectory of gene networks in response to stress.


Assuntos
Proteínas de Ligação a DNA/genética , Redes Reguladoras de Genes , Estresse Oxidativo/genética , Fator de Transcrição TFIIB/genética , Archaea/genética , Archaea/fisiologia , Regulação Bacteriana da Expressão Gênica , Motivos de Nucleotídeos/genética
9.
PLoS One ; 18(3): e0283548, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36989327

RESUMO

As synthetic biology expands and accelerates into real-world applications, methods for quantitatively and precisely engineering biological function become increasingly relevant. This is particularly true for applications that require programmed sensing to dynamically regulate gene expression in response to stimuli. However, few methods have been described that can engineer biological sensing with any level of quantitative precision. Here, we present two complementary methods for precision engineering of genetic sensors: in silico selection and machine-learning-enabled forward engineering. Both methods use a large-scale genotype-phenotype dataset to identify DNA sequences that encode sensors with quantitatively specified dose response. First, we show that in silico selection can be used to engineer sensors with a wide range of dose-response curves. To demonstrate in silico selection for precise, multi-objective engineering, we simultaneously tune a genetic sensor's sensitivity (EC50) and saturating output to meet quantitative specifications. In addition, we engineer sensors with inverted dose-response and specified EC50. Second, we demonstrate a machine-learning-enabled approach to predictively engineer genetic sensors with mutation combinations that are not present in the large-scale dataset. We show that the interpretable machine learning results can be combined with a biophysical model to engineer sensors with improved inverted dose-response curves.


Assuntos
Aprendizado de Máquina , Biologia Sintética , Biologia Sintética/métodos
10.
Synth Biol (Oxf) ; 7(1): ysac013, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36101862

RESUMO

Microbial cell culture is one of the most commonly performed protocols for synthetic biology, and laboratories are increasingly using 96-well plates and laboratory automation systems for cell culture. Here, we describe a method for reproducible microbial culture using laboratory automation systems, including automated liquid handling, automated plate sealing and de-sealing, automated incubation and measurement of growing cultures. We discuss the key considerations that, in our experience, are important for reproducibility and present statistical analyses of data from 150 automated microbial growth experiments performed over 27 months using our automated method.

11.
BMC Genom Data ; 23(1): 25, 2022 04 04.
Artigo em Inglês | MEDLINE | ID: mdl-35379171

RESUMO

BACKGROUND: The coronavirus nonstructural protein 5 (Nsp5) is a cysteine protease required for processing the viral polyprotein and is therefore crucial for viral replication. Nsp5 from several coronaviruses have also been found to cleave host proteins, disrupting molecular pathways involved in innate immunity. Nsp5 from the recently emerged SARS-CoV-2 virus interacts with and can cleave human proteins, which may be relevant to the pathogenesis of COVID-19. Based on the continuing global pandemic, and emerging understanding of coronavirus Nsp5-human protein interactions, we set out to predict what human proteins are cleaved by the coronavirus Nsp5 protease using a bioinformatics approach. RESULTS: Using a previously developed neural network trained on coronavirus Nsp5 cleavage sites (NetCorona), we made predictions of Nsp5 cleavage sites in all human proteins. Structures of human proteins in the Protein Data Bank containing a predicted Nsp5 cleavage site were then examined, generating a list of 92 human proteins with a highly predicted and accessible cleavage site. Of those, 48 are expected to be found in the same cellular compartment as Nsp5. Analysis of this targeted list of proteins revealed molecular pathways susceptible to Nsp5 cleavage and therefore relevant to coronavirus infection, including pathways involved in mRNA processing, cytokine response, cytoskeleton organization, and apoptosis. CONCLUSIONS: This study combines predictions of Nsp5 cleavage sites in human proteins with protein structure information and protein network analysis. We predicted cleavage sites in proteins recently shown to be cleaved in vitro by SARS-CoV-2 Nsp5, and we discuss how other potentially cleaved proteins may be relevant to coronavirus mediated immune dysregulation. The data presented here will assist in the design of more targeted experiments, to determine the role of coronavirus Nsp5 cleavage of host proteins, which is relevant to understanding the molecular pathology of coronavirus infection.


Assuntos
Proteases 3C de Coronavírus , Proteoma , SARS-CoV-2 , COVID-19 , Proteases 3C de Coronavírus/sangue , Humanos , SARS-CoV-2/enzimologia
12.
Curr Opin Syst Biol ; 23: 32-37, 2020 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-34611570

RESUMO

Precise engineering of biological systems requires quantitative, high-throughput measurements, exemplified by progress in directed evolution. New approaches allow high-throughput measurements of phenotypes and their corresponding genotypes. When integrated into directed evolution, these quantitative approaches enable the precise engineering of biological function. At the same time, the increasingly routine availability of large, high-quality data sets supports the integration of machine learning with directed evolution. Together, these advances herald striking capabilities for engineering biology.

13.
Front Microbiol ; 9: 3196, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30671033

RESUMO

Different weak organic acids have significant potential as topical treatments for wounds infected by opportunistic pathogens that are recalcitrant to standard treatments. These acids have long been used as bacteriostatic compounds in the food industry, and in some cases are already being used in the clinic. The effects of different organic acids vary with pH, concentration, and the specific organic acid used, but no studies to date on any opportunistic pathogens have examined the detailed interactions between these key variables in a controlled and systematic way. We have therefore comprehensively evaluated the effects of several different weak organic acids on growth of the opportunistic pathogen Pseudomonas aeruginosa. We used a semi-automated plate reader to generate growth profiles for two different strains (model laboratory strain PAO1 and clinical isolate PA1054 from a hospital burns unit) in a range of organic acids at different concentrations and pH, with a high level of replication for a total of 162,960 data points. We then compared two different modeling approaches for the interpretation of this time-resolved dataset: parametric logistic regression (with or without a component to include lag phase) vs. non-parametric Gaussian process (GP) regression. Because GP makes no prior assumptions about the nature of the growth, this method proved to be superior in cases where growth did not follow a standard sigmoid functional form, as is common when bacteria grow under stress. Acetic, propionic and butyric acids were all more detrimental to growth than the other acids tested, and although PA1054 grew better than PAO1 under non-stress conditions, this difference largely disappeared as the levels of stress increased. As expected from knowledge of how organic acids behave, their effect was significantly enhanced in combination with low pH, with this interaction being greatest in the case of propionic acid. Our approach lends itself to the characterization of combinatorial interactions between stressors, especially in cases where their impacts on growth render logistic growth models unsuitable.

14.
mSystems ; 2(5)2017.
Artigo em Inglês | MEDLINE | ID: mdl-28951888

RESUMO

Gene regulatory networks (GRNs) are critical for dynamic transcriptional responses to environmental stress. However, the mechanisms by which GRN regulation adjusts physiology to enable stress survival remain unclear. Here we investigate the functions of transcription factors (TFs) within the global GRN of the stress-tolerant archaeal microorganism Halobacterium salinarum. We measured growth phenotypes of a panel of TF deletion mutants in high temporal resolution under heat shock, oxidative stress, and low-salinity conditions. To quantitate the noncanonical functional forms of the growth trajectories observed for these mutants, we developed a novel modeling framework based on Gaussian process regression and functional analysis of variance (FANOVA). We employ unique statistical tests to determine the significance of differential growth relative to the growth of the control strain. This analysis recapitulated known TF functions, revealed novel functions, and identified surprising secondary functions for characterized TFs. Strikingly, we observed that the majority of the TFs studied were required for growth under multiple stress conditions, pinpointing regulatory connections between the conditions tested. Correlations between quantitative phenotype trajectories of mutants are predictive of TF-TF connections within the GRN. These phenotypes are strongly concordant with predictions from statistical GRN models inferred from gene expression data alone. With genome-wide and targeted data sets, we provide detailed functional validation of novel TFs required for extreme oxidative stress and heat shock survival. Together, results presented in this study suggest that many TFs function under multiple conditions, thereby revealing high interconnectivity within the GRN and identifying the specific TFs required for communication between networks responding to disparate stressors. IMPORTANCE To ensure survival in the face of stress, microorganisms employ inducible damage repair pathways regulated by extensive and complex gene networks. Many archaea, microorganisms of the third domain of life, persist under extremes of temperature, salinity, and pH and under other conditions. In order to understand the cause-effect relationships between the dynamic function of the stress network and ultimate physiological consequences, this study characterized the physiological role of nearly one-third of all regulatory proteins known as transcription factors (TFs) in an archaeal organism. Using a unique quantitative phenotyping approach, we discovered functions for many novel TFs and revealed important secondary functions for known TFs. Surprisingly, many TFs are required for resisting multiple stressors, suggesting cross-regulation of stress responses. Through extensive validation experiments, we map the physiological roles of these novel TFs in stress response back to their position in the regulatory network wiring. This study advances understanding of the mechanisms underlying how microorganisms resist extreme stress. Given the generality of the methods employed, we expect that this study will enable future studies on how regulatory networks adjust cellular physiology in a diversity of organisms.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA