RESUMEN
MOTIVATION: Mathematical models of biological processes altered in cancer are built using the knowledge of complex networks of signaling pathways, detailing the molecular regulations inside different cell types, such as tumor cells, immune and other stromal cells. If these models mainly focus on intracellular information, they often omit a description of the spatial organization among cells and their interactions, and with the tumoral microenvironment. RESULTS: We present here a model of tumor cell invasion simulated with PhysiBoSS, a multiscale framework, which combines agent-based modeling and continuous time Markov processes applied on Boolean network models. With this model, we aim to study the different modes of cell migration and to predict means to block it by considering not only spatial information obtained from the agent-based simulation but also intracellular regulation obtained from the Boolean model.Our multiscale model integrates the impact of gene mutations with the perturbation of the environmental conditions and allows the visualization of the results with 2D and 3D representations. The model successfully reproduces single and collective migration processes and is validated on published experiments on cell invasion. In silico experiments are suggested to search for possible targets that can block the more invasive tumoral phenotypes. AVAILABILITY AND IMPLEMENTATION: https://github.com/sysbio-curie/Invasion_model_PhysiBoSS.
Asunto(s)
Modelos Biológicos , Modelos Teóricos , Humanos , Simulación por Computador , Transducción de Señal/genética , Invasividad Neoplásica , Microambiente TumoralRESUMEN
We need to effectively combine the knowledge from surging literature with complex datasets to propose mechanistic models of SARS-CoV-2 infection, improving data interpretation and predicting key targets of intervention. Here, we describe a large-scale community effort to build an open access, interoperable and computable repository of COVID-19 molecular mechanisms. The COVID-19 Disease Map (C19DMap) is a graphical, interactive representation of disease-relevant molecular mechanisms linking many knowledge sources. Notably, it is a computational resource for graph-based analyses and disease modelling. To this end, we established a framework of tools, platforms and guidelines necessary for a multifaceted community of biocurators, domain experts, bioinformaticians and computational biologists. The diagrams of the C19DMap, curated from the literature, are integrated with relevant interaction and text mining databases. We demonstrate the application of network analysis and modelling approaches by concrete examples to highlight new testable hypotheses. This framework helps to find signatures of SARS-CoV-2 predisposition, treatment response or prioritisation of drug candidates. Such an approach may help deal with new waves of COVID-19 or similar pandemics in the long-term perspective.
Asunto(s)
COVID-19/inmunología , Biología Computacional/métodos , Bases de Datos Factuales , SARS-CoV-2/inmunología , Programas Informáticos , Antivirales/uso terapéutico , COVID-19/genética , COVID-19/virología , Gráficos por Computador , Citocinas/genética , Citocinas/inmunología , Minería de Datos/estadística & datos numéricos , Regulación de la Expresión Génica , Interacciones Microbiota-Huesped/genética , Interacciones Microbiota-Huesped/inmunología , Humanos , Inmunidad Celular/efectos de los fármacos , Inmunidad Humoral/efectos de los fármacos , Inmunidad Innata/efectos de los fármacos , Linfocitos/efectos de los fármacos , Linfocitos/inmunología , Linfocitos/virología , Redes y Vías Metabólicas/genética , Redes y Vías Metabólicas/inmunología , Células Mieloides/efectos de los fármacos , Células Mieloides/inmunología , Células Mieloides/virología , Mapeo de Interacción de Proteínas , SARS-CoV-2/efectos de los fármacos , SARS-CoV-2/genética , SARS-CoV-2/patogenicidad , Transducción de Señal , Factores de Transcripción/genética , Factores de Transcripción/inmunología , Proteínas Virales/genética , Proteínas Virales/inmunología , Tratamiento Farmacológico de COVID-19RESUMEN
Mathematical models can serve as a tool to formalize biological knowledge from diverse sources, to investigate biological questions in a formal way, to test experimental hypotheses, to predict the effect of perturbations and to identify underlying mechanisms. We present a pipeline of computational tools that performs a series of analyses to explore a logical model's properties. A logical model of initiation of the metastatic process in cancer is used as a transversal example. We start by analysing the structure of the interaction network constructed from the literature or existing databases. Next, we show how to translate this network into a mathematical object, specifically a logical model, and how robustness analyses can be applied to it. We explore the visualization of the stable states, defined as specific attractors of the model, and match them to cellular fates or biological read-outs. With the different tools we present here, we explain how to assign to each solution of the model a probability and how to identify genetic interactions using mutant phenotype probabilities. Finally, we connect the model to relevant experimental data: we present how some data analyses can direct the construction of the network, and how the solutions of a mathematical model can also be compared with experimental data, with a particular focus on high-throughput data in cancer biology. A step-by-step tutorial is provided as a Supplementary Material and all models, tools and scripts are provided on an accompanying website: https://github.com/sysbio-curie/Logical_modelling_pipeline.
Asunto(s)
Modelos Biológicos , Transducción de Señal , Biología Computacional/métodos , Simulación por Computador , Bases de Datos Factuales , Enfermedad , Epistasis Genética , Redes Reguladoras de Genes , Humanos , Modelos Logísticos , Conceptos Matemáticos , Redes y Vías Metabólicas , Mutación , Metástasis de la Neoplasia/genética , Metástasis de la Neoplasia/patología , Metástasis de la Neoplasia/fisiopatología , Programas Informáticos , Biología de Sistemas/estadística & datos numéricosRESUMEN
Artificial intelligence (AI) technologies can play a key role in preventing, detecting, and monitoring epidemics. In this paper, we provide an overview of the recently published literature on the COVID-19 pandemic in four strategic areas: (1) triage, diagnosis, and risk prediction; (2) drug repurposing and development; (3) pharmacogenomics and vaccines; and (4) mining of the medical literature. We highlight how AI-powered health care can enable public health systems to efficiently handle future outbreaks and improve patient outcomes.
Asunto(s)
Inteligencia Artificial , COVID-19/terapia , Medicina de Precisión/métodos , Humanos , Pandemias , Investigación , SARS-CoV-2/aislamiento & purificaciónRESUMEN
MOTIVATION: Due to the complexity and heterogeneity of multicellular biological systems, mathematical models that take into account cell signalling, cell population behaviour and the extracellular environment are particularly helpful. We present PhysiBoSS, an open source software which combines intracellular signalling using Boolean modelling (MaBoSS) and multicellular behaviour using agent-based modelling (PhysiCell). RESULTS: PhysiBoSS provides a flexible and computationally efficient framework to explore the effect of environmental and genetic alterations of individual cells at the population level, bridging the critical gap from single-cell genotype to single-cell phenotype and emergent multicellular behaviour. PhysiBoSS thus becomes very useful when studying heterogeneous population response to treatment, mutation effects, different modes of invasion or isomorphic morphogenesis events. To concretely illustrate a potential use of PhysiBoSS, we studied heterogeneous cell fate decisions in response to TNF treatment. We explored the effect of different treatments and the behaviour of several resistant mutants. We highlighted the importance of spatial information on the population dynamics by considering the effect of competition for resources like oxygen. AVAILABILITY AND IMPLEMENTATION: PhysiBoSS is freely available on GitHub (https://github.com/sysbio-curie/PhysiBoSS), with a Docker image (https://hub.docker.com/r/gletort/physiboss/). It is distributed as open source under the BSD 3-clause license. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Modelos Genéticos , Transducción de Señal , Programas Informáticos , Genotipo , Humanos , Transducción de Señal/genética , Análisis de SistemasRESUMEN
In the present economy, difficulties to access energy sources are real drawbacks to maintain our current lifestyle. In fact, increasing interests have been gathered around efficient strategies to use energy sources that do not generate high CO2 titers. Thus, science-funding agencies have invested more resources into research on hydrogen among other biofuels as interesting energy vectors. This article reviews present energy challenges and frames it into the present fuel usage landscape. Different strategies for hydrogen production are explained and evaluated. Focus is on biological hydrogen production; fermentation and photon-fuelled hydrogen production are compared. Mathematical models in biology can be used to assess, explore and design production strategies for industrially relevant metabolites, such as biofuels. We assess the diverse construction and uses of genome-scale metabolic models of cyanobacterium Synechocystis sp. PCC6803 to efficiently obtain biofuels. This organism has been studied as a potential photon-fuelled production platform for its ability to grow from carbon dioxide, water and photons, on simple culture media. Finally, we review studies that propose production strategies to weigh this organism's viability as a biofuel production platform. Overall, the work presented in this review unveils the industrial capabilities of cyanobacterium Synechocystis sp. PCC6803 to evolve interesting metabolites as a clean biofuel production platform.
Asunto(s)
Biocombustibles , Biotecnología/métodos , Hidrógeno/metabolismo , Modelos Biológicos , Synechocystis/metabolismo , Biología de SistemasRESUMEN
In systems biology, mathematical models and simulations play a crucial role in understanding complex biological systems. Different modelling frameworks are employed depending on the nature and scales of the system under study. For instance, signalling and regulatory networks can be simulated using Boolean modelling, whereas multicellular systems can be studied using agent-based modelling. Herein, we present PhysiBoSS 2.0, a hybrid agent-based modelling framework that allows simulating signalling and regulatory networks within individual cell agents. PhysiBoSS 2.0 is a redesign and reimplementation of PhysiBoSS 1.0 and was conceived as an add-on that expands the PhysiCell functionalities by enabling the simulation of intracellular cell signalling using MaBoSS while keeping a decoupled, maintainable and model-agnostic design. PhysiBoSS 2.0 also expands the set of functionalities offered to the users, including custom models and cell specifications, mechanistic submodels of substrate internalisation and detailed control over simulation parameters. Together with PhysiBoSS 2.0, we introduce PCTK, a Python package developed for handling and processing simulation outputs, and generating summary plots and 3D renders. PhysiBoSS 2.0 allows studying the interplay between the microenvironment, the signalling pathways that control cellular processes and population dynamics, suitable for modelling cancer. We show different approaches for integrating Boolean networks into multi-scale simulations using strategies to study the drug effects and synergies in models of cancer cell lines and validate them using experimental data. PhysiBoSS 2.0 is open-source and publicly available on GitHub with several repositories of accompanying interoperable tools.
Asunto(s)
Modelos Biológicos , Neoplasias , Humanos , Simulación por Computador , Transducción de Señal , Modelos Teóricos , Análisis de Sistemas , Microambiente TumoralRESUMEN
Introduction: The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing. Methods: Extensive community work allowed an impressive step forward in building interfaces between Systems Biology tools and platforms. Our framework can link biomolecules from omics data analysis and computational modelling to dysregulated pathways in a cell-, tissue- or patient-specific manner. Drug repurposing using text mining and AI-assisted analysis identified potential drugs, chemicals and microRNAs that could target the identified key factors. Results: Results revealed drugs already tested for anti-COVID-19 efficacy, providing a mechanistic context for their mode of action, and drugs already in clinical trials for treating other diseases, never tested against COVID-19. Discussion: The key advance is that the proposed framework is versatile and expandable, offering a significant upgrade in the arsenal for virus-host interactions and other complex pathologies.
Asunto(s)
COVID-19 , Humanos , SARS-CoV-2 , Reposicionamiento de Medicamentos , Biología de Sistemas , Simulación por ComputadorRESUMEN
Cyanobacteria are photosynthetic prokaryotes that are promising 'low-cost' microbial cell factories due to their simple nutritional requirements and metabolic plasticity, and the availability of tools for their genetic manipulation. The unicellular non-nitrogen-fixing Synechocystis sp. PCC 6803 is the best studied cyanobacterial strain and its genome was the first to be sequenced. The vast amount of physiological and molecular data available, together with a relatively small genome, makes Synechocystis suitable for computational metabolic modelling and to be used as a photoautotrophic chassis in synthetic biology applications. To prepare it for the introduction of a synthetic hydrogen producing device, a Synechocystis sp. PCC 6803 deletion mutant lacking an active bidirectional hydrogenase (ΔhoxYH) was produced and characterized at different levels: physiological, proteomic and transcriptional. The results showed that, under conditions favouring hydrogenase activity, 17 of the 210 identified proteins had significant differential fold changes in comparisons of the mutant with the wild-type. Most of these proteins are related to the redox and energy state of the cell. Transcriptional studies revealed that only six genes encoding those proteins exhibited significant differences in transcript levels. Moreover, the mutant exhibits similar growth behaviour compared with the wild-type, reflecting Synechocystis plasticity and metabolic adaptability. Overall, this study reveals that the Synechocystis ΔhoxYH mutant is robust and can be used as a photoautotrophic chassis for the integration of synthetic constructs, i.e. molecular constructs assembled from well characterized biological and/or synthetic parts (e.g. promoters, regulators, coding regions, terminators) designed for a specific purpose.
Asunto(s)
Proteínas Bacterianas/metabolismo , Hidrógeno/metabolismo , Hidrogenasas/metabolismo , Synechocystis/enzimología , Synechocystis/genética , Proteínas Bacterianas/genética , Regulación Bacteriana de la Expresión Génica , Hidrogenasas/genética , Mutación , Synechocystis/metabolismoRESUMEN
The emergence of cell resistance in cancer treatment is a complex phenomenon that emerges from the interplay of processes that occur at different scales. For instance, molecular mechanisms and population-level dynamics such as competition and cell-cell variability have been described as playing a key role in the emergence and evolution of cell resistances. Multi-scale models are a useful tool for studying biology at very different times and spatial scales, as they can integrate different processes occurring at the molecular, cellular, and intercellular levels. In the present work, we use an extended hybrid multi-scale model of 3T3 fibroblast spheroid to perform a deep exploration of the parameter space of effective treatment strategies based on TNF pulses. To explore the parameter space of effective treatments in different scenarios and conditions, we have developed an HPC-optimized model exploration workflow based on EMEWS. We first studied the effect of the cells' spatial distribution in the values of the treatment parameters by optimizing the supply strategies in 2D monolayers and 3D spheroids of different sizes. We later study the robustness of the effective treatments when heterogeneous populations of cells are considered. We found that our model exploration workflow can find effective treatments in all the studied conditions. Our results show that cells' spatial geometry and population variability should be considered when optimizing treatment strategies in order to find robust parameter sets.
RESUMEN
Prostate cancer is the second most occurring cancer in men worldwide. To better understand the mechanisms of tumorigenesis and possible treatment responses, we developed a mathematical model of prostate cancer which considers the major signalling pathways known to be deregulated. We personalised this Boolean model to molecular data to reflect the heterogeneity and specific response to perturbations of cancer patients. A total of 488 prostate samples were used to build patient-specific models and compared to available clinical data. Additionally, eight prostate cell line-specific models were built to validate our approach with dose-response data of several drugs. The effects of single and combined drugs were tested in these models under different growth conditions. We identified 15 actionable points of interventions in one cell line-specific model whose inactivation hinders tumorigenesis. To validate these results, we tested nine small molecule inhibitors of five of those putative targets and found a dose-dependent effect on four of them, notably those targeting HSP90 and PI3K. These results highlight the predictive power of our personalised Boolean models and illustrate how they can be used for precision oncology.
Asunto(s)
Medicina de Precisión , Neoplasias de la Próstata , Carcinogénesis , Proteínas HSP90 de Choque Térmico , Humanos , Masculino , Medicina de Precisión/métodos , Neoplasias de la Próstata/tratamiento farmacológico , Neoplasias de la Próstata/genética , Transducción de SeñalRESUMEN
Dendrograms are a way to represent relationships between organisms. Nowadays, these are inferred based on the comparison of genes or protein sequences by taking into account their differences and similarities. The genetic material of choice for the sequence alignments (all the genes or sets of genes) results in distinct inferred dendrograms. In this work, we evaluate differences between dendrograms reconstructed with different methodologies and for different sets of organisms chosen at random from a much larger set. A statistical analysis is performed to estimate fluctuations between the results obtained from the different methodologies that allows us to validate a systematic approach, based on the comparison of the organisms' metabolic networks for inferring dendrograms. This has the advantage that it allows the comparison of organisms very far away in the evolutionary tree even if they have no known ortholog gene in common. Our results show that dendrograms built using information from metabolic networks are similar to the standard sequence-based dendrograms and can be a complement to them.
Asunto(s)
Algoritmos , Biología Computacional/métodos , Redes Reguladoras de Genes , Proteínas/clasificación , Alineación de Secuencia/métodos , Animales , Bases de Datos Factuales , Humanos , Modelos Moleculares , Filogenia , Proteínas/genética , Proteínas/metabolismo , Análisis de Secuencia de ProteínaRESUMEN
Logical models of cancer pathways are typically built by mining the literature for relevant experimental observations. They are usually generic as they apply for large cohorts of individuals. As a consequence, they generally do not capture the heterogeneity of patient tumors and their therapeutic responses. We present here a novel framework, referred to as PROFILE, to tailor logical models to a particular biological sample such as a patient tumor. This methodology permits to compare the model simulations to individual clinical data, i.e., survival time. Our approach focuses on integrating mutation data, copy number alterations (CNA), and expression data (transcriptomics or proteomics) to logical models. These data need first to be either binarized or set between 0 and 1, and can then be incorporated in the logical model by modifying the activity of the node, the initial conditions or the state transition rates. The use of MaBoSS, a tool based on Monte-Carlo kinetic algorithm to perform stochastic simulations on logical models results in model state probabilities, and allows for a semi-quantitative study of the model phenotypes and perturbations. As a proof of concept, we use a published generic model of cancer signaling pathways and molecular data from METABRIC breast cancer patients. For this example, we test several combinations of data incorporation and discuss that, with these data, the most comprehensive patient-specific cancer models are obtained by modifying the nodes' activity of the model with mutations, in combination or not with CNA data, and altering the transition rates with RNA expression. We conclude that these model simulations show good correlation with clinical data such as patients' Nottingham prognostic index (NPI) subgrouping and survival time. We observe that two highly relevant cancer phenotypes derived from personalized models, Proliferation and Apoptosis, are biologically consistent prognostic factors: patients with both high proliferation and low apoptosis have the worst survival rate, and conversely. Our approach aims to combine the mechanistic insights of logical modeling with multi-omics data integration to provide patient-relevant models. This work leads to the use of logical modeling for precision medicine and will eventually facilitate the choice of patient-specific drug treatments by physicians.
RESUMEN
The current consensus recognizes four main medulloblastoma subgroups (wingless, Sonic hedgehog, group 3 and group 4). While medulloblastoma subgroups have been characterized extensively at the (epi-)genomic and transcriptomic levels, the proteome and phosphoproteome landscape remain to be comprehensively elucidated. Using quantitative (phospho)-proteomics in primary human medulloblastomas, we unravel distinct posttranscriptional regulation leading to highly divergent oncogenic signaling and kinase activity profiles in groups 3 and 4 medulloblastomas. Specifically, proteomic and phosphoproteomic analyses identify aberrant ERBB4-SRC signaling in group 4. Hence, enforced expression of an activated SRC combined with p53 inactivation induces murine tumors that resemble group 4 medulloblastoma. Therefore, our integrative proteogenomics approach unveils an oncogenic pathway and potential therapeutic vulnerability in the most common medulloblastoma subgroup.
Asunto(s)
Neoplasias Cerebelosas/patología , Meduloblastoma/patología , Receptor ErbB-4/metabolismo , Familia-src Quinasas/metabolismo , Adolescente , Animales , Carcinogénesis/patología , Línea Celular Tumoral , Neoplasias Cerebelosas/genética , Cerebelo/patología , Niño , Preescolar , Femenino , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Humanos , Lactante , Masculino , Meduloblastoma/genética , Ratones , Ratones Transgénicos , Fosforilación , Proteoma/metabolismo , Proteómica/métodos , Transducción de Señal , Familia-src Quinasas/genéticaRESUMEN
The use of microorganisms as cell factories frequently requires extensive molecular manipulation. Therefore, the identification of genomic neutral sites for the stable integration of ectopic DNA is required to ensure a successful outcome. Here we describe the genome mapping and validation of five neutral sites in the chromosome of Synechocystis sp. PCC 6803, foreseeing the use of this cyanobacterium as a photoautotrophic chassis. To evaluate the neutrality of these loci, insertion/deletion mutants were produced, and to assess their functionality, a synthetic green fluorescent reporter module was introduced. The constructed integrative vectors include a BioBrick-compatible multiple cloning site insulated by transcription terminators, constituting robust cloning interfaces for synthetic biology approaches. Moreover, Synechocystis mutants (chassis) ready to receive purpose-built synthetic modules/circuits are also available. This work presents a systematic approach to map and validate chromosomal neutral sites in cyanobacteria, and that can be extended to other organisms.
Asunto(s)
Cianobacterias/genética , Synechocystis/genética , Procesos Autotróficos , Mapeo Cromosómico , Cianobacterias/crecimiento & desarrollo , Proteínas Fluorescentes Verdes , Mutación , Biología Sintética/métodosRESUMEN
The reconstruction of genome-scale metabolic models and their applications represent a great advantage of systems biology. Through their use as metabolic flux simulation models, production of industrially-interesting metabolites can be predicted. Due to the growing number of studies of metabolic models driven by the increasing genomic sequencing projects, it is important to conceptualize steps of reconstruction and analysis. We have focused our work in the cyanobacterium Synechococcus elongatus PCC7942, for which several analyses and insights are unveiled. A comprehensive approach has been used, which can be of interest to lead the process of manual curation and genome-scale metabolic analysis. The final model, iSyf715 includes 851 reactions and 838 metabolites. A biomass equation, which encompasses elementary building blocks to allow cell growth, is also included. The applicability of the model is finally demonstrated by simulating autotrophic growth conditions of Synechococcus elongatus PCC7942.
RESUMEN
BACKGROUND/AIMS: The influence of different parameters such as temperature, irradiance, nitrate concentration, pH, and an external carbon source on Synechocystis PCC 6803 growth was evaluated. METHODS: 4.5-ml cuvettes containing 2 ml of culture, a high-throughput system equivalent to batch cultures, were used with gas exchange ensured by the use of a Parafilm™ cover. The effect of the different variables on maximum growth was assessed by a multi-way statistical analysis. RESULTS: Temperature and pH were identified as the key factors. It was observed that Synechocystis cells have a strong influence on the external pH. The optimal growth temperature was 33°C while light-saturating conditions were reached at 40 µE·m⻲·s⻹. CONCLUSION: It was demonstrated that Synechocystis exhibits a marked difference in behavior between autotrophic and glucose-based mixotrophic conditions, and that nitrate concentrations did not have a significant influence, probably due to endogenous nitrogen reserves. Furthermore, a dynamic metabolic model of Synechocystis photosynthesis was developed to gain insights on the underlying mechanism enabling this cyanobacterium to control the levels of external pH. The model showed a coupled effect between the increase of the pH and ATP production which in turn allows a higher carbon fixation rate.
Asunto(s)
Procesos Autotróficos , Técnicas de Cultivo Celular por Lotes/métodos , Modelos Biológicos , Synechocystis/crecimiento & desarrollo , Técnicas Bacteriológicas/métodos , Técnicas Bacteriológicas/normas , Carbono/metabolismo , Procesos Heterotróficos , Concentración de Iones de Hidrógeno , Luz , Viabilidad Microbiana , Análisis Multivariante , Nitratos/metabolismo , Fotosíntesis , Synechocystis/metabolismo , TemperaturaRESUMEN
Synechocystis sp. PCC6803 is a model cyanobacterium capable of producing biofuels with CO(2) as carbon source and with its metabolism fueled by light, for which it stands as a potential production platform of socio-economic importance. Compilation and characterization of Synechocystis genome-scale metabolic model is a pre-requisite toward achieving a proficient photosynthetic cell factory. To this end, we report iSyn811, an upgraded genome-scale metabolic model of Synechocystis sp. PCC6803 consisting of 956 reactions and accounting for 811 genes. To gain insights into the interplay between flux activities and metabolic physiology, flux coupling analysis was performed for iSyn811 under four different growth conditions, viz., autotrophy, mixotrophy, heterotrophy, and light-activated heterotrophy (LH). Initial steps of carbon acquisition and catabolism formed the versatile center of the flux coupling networks, surrounded by a stable core of pathways leading to biomass building blocks. This analysis identified potential bottlenecks for hydrogen and ethanol production. Integration of transcriptomic data with the Synechocystis flux coupling networks lead to identification of reporter flux coupling pairs and reporter flux coupling groups - regulatory hot spots during metabolic shifts triggered by the availability of light. Overall, flux coupling analysis provided insight into the structural organization of Synechocystis sp. PCC6803 metabolic network toward designing of a photosynthesis-based production platform.
Asunto(s)
Ciclo del Carbono , Etanol/metabolismo , Synechocystis/genética , Synechocystis/metabolismo , Transcripción Genética , Hidrógeno/metabolismo , Redes y Vías Metabólicas/genética , Fotosíntesis/genéticaRESUMEN
BACKGROUND: Insects are associated with microorganisms that contribute to the digestion and processing of nutrients. The European Corn Borer (ECB) is a moth present world-wide, causing severe economical damage as a pest on corn and other crops. In the present work, we give a detailed view of the complexity of the microorganisms forming the ECB midgut microbiota with the objective of comparing the biodiversity of the midgut-associated microbiota and explore their potential as a source of genes and enzymes with biotechnological applications. METHODOLOGICAL/PRINCIPAL FINDINGS: A high-throughput sequencing approach has been used to identify bacterial species, genes and metabolic pathways, particularly those involved in plant-matter degradation, in two different ECB populations (field-collected vs. lab-reared population with artificial diet). Analysis of the resulting sequences revealed the massive presence of Staphylococcus warneri and Weissella paramesenteroides in the lab-reared sample. This enabled us to reconstruct both genomes almost completely. Despite the apparently low diversity, 208 different genera were detected in the sample, although most of them at very low frequency. By contrast, the natural population exhibited an even higher taxonomic diversity along with a wider array of cellulolytic enzyme families. However, in spite of the differences in relative abundance of major taxonomic groups, not only did both metagenomes share a similar functional profile but also a similar distribution of non-redundant genes in different functional categories. CONCLUSIONS/SIGNIFICANCE: Our results reveal a highly diverse pool of bacterial species in both O. nubilalis populations, with major differences: The lab-reared sample is rich in gram-positive species (two of which have almost fully sequenced genomes) while the field sample harbors mainly gram-negative species and has a larger set of cellulolytic enzymes. We have found a clear relationship between the diet and the midgut microbiota, which reveals the selection pressure of food on the community of intestinal bacteria.