RESUMO
Mass spectrometry (MS) is a valuable tool for plasma proteome profiling and disease biomarker discovery. However, wide-ranging plasma protein concentrations, along with technical and biological variabilities, present significant challenges for deep and reproducible protein quantitation. Here, we evaluated the qualitative and quantitative performance of timsTOF HT and timsTOF Pro 2 mass spectrometers for analysis of neat plasma samples (unfractionated) and plasma samples processed using the Proteograph Product Suite (Proteograph) that enables robust deep proteomics sampling prior to mass spectrometry. Samples were evaluated across a wide range of peptide loading masses and liquid chromatography (LC) gradients. We observed up to a 76% increase in total plasma peptide precursors identified and a >2-fold boost in quantifiable plasma peptide precursors (CV < 20%) with timsTOF HT compared to Pro 2. Additionally, approximately 4.5 fold more plasma peptide precursors were detected by both timsTOF HT and timsTOF Pro 2 in the Proteograph analyzed plasma vs neat plasma. In an exploratory analysis of 20 late-stage lung cancer and 20 control plasma samples with the Proteograph, which were expected to exhibit distinct proteomes, an approximate 50% increase in total and statistically significant plasma peptide precursors (q < 0.05) was observed with timsTOF HT compared to Pro 2. Our data demonstrate the superior performance of timsTOF HT for identifying and quantifying differences between biologically diverse samples, allowing for improved disease biomarker discovery in large cohort studies. Moreover, researchers can leverage data sets from this study to optimize their liquid chromatography-mass spectrometry (LC-MS) workflows for plasma protein profiling and biomarker discovery. (ProteomeXchange identifier: PXD047854 and PXD047839).
Assuntos
Proteínas Sanguíneas , Proteoma , Humanos , Reprodutibilidade dos Testes , Peptídeos , BiomarcadoresRESUMO
Climate change is globally affecting rainfall patterns, necessitating the improvement of drought tolerance in crops. Sorghum bicolor is a relatively drought-tolerant cereal. Functional stay-green sorghum genotypes can maintain green leaf area and efficient grain filling during terminal post-flowering water deprivation, a period of ~10 weeks. To obtain molecular insights into these characteristics, two drought-tolerant genotypes, BTx642 and RTx430, were grown in replicated control and terminal post-flowering drought field plots in California's Central Valley. Photosynthetic, photoprotective, and water dynamics traits were quantified and correlated with metabolomic data collected from leaves, stems, and roots at multiple timepoints during control and drought conditions. Physiological and metabolomic data were then compared to longitudinal RNA sequencing data collected from these two genotypes. The unique metabolic and transcriptomic response to post-flowering drought in sorghum supports a role for the metabolite galactinol in controlling photosynthetic activity through regulating stomatal closure in post-flowering drought. Additionally, in the functional stay-green genotype BTx642, photoprotective responses were specifically induced in post-flowering drought, supporting a role for photoprotection in the molecular response associated with the functional stay-green trait. From these insights, new pathways are identified that can be targeted to maximize yields under growth conditions with limited water.
RESUMO
Analysis of ion mobility spectrometry (IMS) data has been challenging and limited the full utility of these measurements. Unlike liquid chromatography-mass spectrometry, where a plethora of tools with well-established algorithms exist, the incorporation of the additional IMS dimension requires upgrading existing computational pipelines and developing new algorithms to fully exploit the advantages of the technology. We have recently reported MZA, a new and simple mass spectrometry data structure based on the broadly supported HDF5 format and created to facilitate software development. While this format is inherently supportive of application development, the availability of core libraries in popular programming languages with standard mass spectrometry utilities will facilitate fast software development and broader adoption of the format. To this end, we present a Python package, mzapy, for efficient extraction and processing of mass spectrometry data in the MZA format, especially for complex data containing ion mobility spectrometry dimension. In addition to raw data extraction, mzapy contains supporting utilities enabling tasks including calibration, signal processing, peak finding, and generating plots. Being implemented in pure Python and having minimal and largely standardized dependencies makes mzapy uniquely suited to application development in the multiomics domain. The mzapy package is free and open-source, includes comprehensive documentation, and is structured to support future extension to meet the evolving needs of the MS community. The software source code is freely available at https://github.com/PNNL-m-q/mzapy.
RESUMO
Lipids play essential roles in many biological processes and disease pathology, but unambiguous identification of lipids is complicated by the presence of multiple isomeric species differing by fatty acyl chain length, stereospecifically numbered (sn) position, and position/stereochemistry of double bonds. Conventional liquid chromatography-mass spectrometry (LC-MS/MS) analyses enable the determination of fatty acyl chain lengths (and in some cases sn position) and number of double bonds, but not carbon-carbon double bond positions. Ozone-induced dissociation (OzID) is a gas-phase oxidation reaction that produces characteristic fragments from lipids containing double bonds. OzID can be incorporated into ion mobility spectrometry (IMS)-MS instruments for the structural characterization of lipids, including additional isomer separation and confident assignment of double bond positions. The complexity and repetitive nature of OzID data analysis and lack of software tool support have limited the application of OzID for routine lipidomics studies. Here, we present an open-source Python tool, LipidOz, for the automated determination of lipid double bond positions from OzID-IMS-MS data, which employs a combination of traditional automation and deep learning approaches. Our results demonstrate the ability of LipidOz to robustly assign double bond positions for lipid standard mixtures and complex lipid extracts, enabling practical application of OzID for future lipidomics.
RESUMO
Modern mass spectrometry-based workflows employing hybrid instrumentation and orthogonal separations collect multidimensional data, potentially allowing deeper understanding in omics studies through adoption of artificial intelligence methods. However, the large volume of these rich spectra challenges existing data storage and access technologies, therefore precluding informatics advancements. We present MZA (pronounced m-za), the mass-to-charge (m/z) generic data storage and access tool designed to facilitate software development and artificial intelligence research in multidimensional mass spectrometry measurements. Composed of a data conversion tool and a simple file structure based on the HDF5 format, MZA provides easy, cross-platform and cross-programming language access to raw MS-data, enabling fast development of new tools in data science programming languages such as Python and R. The software executable, example MS-data and example Python and R scripts are freely available at https://github.com/PNNL-m-q/mza.
Assuntos
Inteligência Artificial , Software , Espectrometria de Massas/métodos , Linguagens de Programação , Armazenamento e Recuperação da InformaçãoRESUMO
Soil microorganisms provide key ecological functions that often rely on metabolic interactions between individual populations of the soil microbiome. To better understand these interactions and community processes, we used chitin, a major carbon and nitrogen source in soil, as a test substrate to investigate microbial interactions during its decomposition. Chitin was applied to a model soil consortium that we developed, "model soil consortium-2" (MSC-2), consisting of eight members of diverse phyla and including both chitin degraders and nondegraders. A multiomics approach revealed how MSC-2 community-level processes during chitin decomposition differ from monocultures of the constituent species. Emergent properties of both species and the community were found, including changes in the chitin degradation potential of Streptomyces species and organization of all species into distinct roles in the chitin degradation process. The members of MSC-2 were further evaluated via metatranscriptomics and community metabolomics. Intriguingly, the most abundant members of MSC-2 were not those that were able to metabolize chitin itself, but rather those that were able to take full advantage of interspecies interactions to grow on chitin decomposition products. Using a model soil consortium greatly increased our knowledge of how carbon is decomposed and metabolized in a community setting, showing that niche size, rather than species metabolic capacity, can drive success and that certain species become active carbon degraders only in the context of their surrounding community. These conclusions fill important knowledge gaps that are key to our understanding of community interactions that support carbon and nitrogen cycling in soil. IMPORTANCE The soil microbiome performs many functions that are key to ecology, agriculture, and nutrient cycling. However, because of the complexity of this ecosystem we do not know the molecular details of the interactions between microbial species that lead to these important functions. Here, we use a representative but simplified model community of bacteria to understand the details of these interactions. We show that certain species act as primary degraders of carbon sources and that the most successful species are likely those that can take the most advantage of breakdown products, not necessarily the primary degraders. We also show that a species phenotype, including whether it is a primary degrader or not, is driven in large part by the membership of the community it resides in. These conclusions are critical to a better understanding of the soil microbial interaction network and how these interactions drive central soil microbiome functions.
Assuntos
Quitina , Microbiota , Quitina/metabolismo , Solo/química , Microbiota/genética , Carbono , Nitrogênio/metabolismoRESUMO
Metaproteomics has been increasingly utilized for high-throughput characterization of proteins in complex environments and has been demonstrated to provide insights into microbial composition and functional roles. However, significant challenges remain in metaproteomic data analysis, including creation of a sample-specific protein sequence database. A well-matched database is a requirement for successful metaproteomics analysis, and the accuracy and sensitivity of PSM identification algorithms suffer when the database is incomplete or contains extraneous sequences. When matched DNA sequencing data of the sample is unavailable or incomplete, creating the proteome database that accurately represents the organisms in the sample is a challenge. Here, we leverage a de novo peptide sequencing approach to identify the sample composition directly from metaproteomic data. First, we created a deep learning model, Kaiko, to predict the peptide sequences from mass spectrometry data and trained it on 5 million peptide-spectrum matches from 55 phylogenetically diverse bacteria. After training, Kaiko successfully identified organisms from soil isolates and synthetic communities directly from proteomics data. Finally, we created a pipeline for metaproteome database generation using Kaiko. We tested the pipeline on native soils collected in Kansas, showing that the de novo sequencing model can be employed as an alternative and complementary method to construct the sample-specific protein database instead of relying on (un)matched metagenomes. Our pipeline identified all highly abundant taxa from 16S rRNA sequencing of the soil samples and uncovered several additional species which were strongly represented only in proteomic data.
Assuntos
Microbiota , Proteômica , Microbiota/genética , Peptídeos/análise , Peptídeos/genética , Proteoma/genética , Proteômica/métodos , RNA Ribossômico 16S/genética , SoloRESUMO
Ion trajectory simulation in mass spectrometry systems from injection to detection is technically challenging but very important for better understanding the ion dynamics in instrument development. Here, we present SimELIT (Simulator of Eulerian and Lagrangian Ion Trajectories), a novel ion trajectory simulation platform. SimELIT is built upon a suite of multiphysics solvers compiled into OpenFOAM (an open-source numerical solver library particularly used for computational mechanics), with a simple web-based graphical user interface (GUI) allowing users to define the details of OpenFOAM cases and run simulations. SimELIT is a modular program and can provide extensions of physics (e.g., gas flows, electrodynamic fields) and thus enable ion trajectory simulations from the ion source to detector. The current version (SimELIT) provides two numerical solvers for ion trajectory simulationsâ(1) a Lagrangian particle tracker in vacuum and (2) a Eulerian ion density solver in background gas in the presence of electric fields. Here, we describe the architecture of SimELIT, including its use of Docker and the React Framework, and demonstrate the computation of ion trajectories of multiple m/z values in a static/linear voltage drop in vacuum (across a 1 m long flight tube). Further, the drift motion of ions under 1 Torr pressure conditions in a static background (N2) gas through a 20 V/cm static electric field is shown. The results produced from SimELIT were compared with SIMION and theoretical estimates. In addition, we report the computation of ion trajectories in electrodynamic fields within a planar FAIMS device operating at atmospheric pressure.
RESUMO
A comprehensive understanding of host dependency factors for SARS-CoV-2 remains elusive. Here, we map alterations in host lipids following SARS-CoV-2 infection using nontargeted lipidomics. We find that SARS-CoV-2 rewires host lipid metabolism, significantly altering hundreds of lipid species to effectively establish infection. We correlate these changes with viral protein activity by transfecting human cells with each viral protein and performing lipidomics. We find that lipid droplet plasticity is a key feature of infection and that viral propagation can be blocked by small-molecule glycerolipid biosynthesis inhibitors. We find that this inhibition was effective against the main variants of concern (alpha, beta, gamma, and delta), indicating that glycerolipid biosynthesis is a conserved host dependency factor that supports this evolving virus.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , Lipídeos , Proteínas ViraisRESUMO
A comprehensive understanding of host dependency factors for SARS-CoV-2 remains elusive. We mapped alterations in host lipids following SARS-CoV-2 infection using nontargeted lipidomics. We found that SARS-CoV-2 rewires host lipid metabolism, altering 409 lipid species up to 64-fold relative to controls. We correlated these changes with viral protein activity by transfecting human cells with each viral protein and performing lipidomics. We found that lipid droplet plasticity is a key feature of infection and that viral propagation can be blocked by small-molecule glycerolipid biosynthesis inhibitors. We found that this inhibition was effective against the main variants of concern (alpha, beta, gamma, and delta), indicating that glycerolipid biosynthesis is a conserved host dependency factor that supports this evolving virus.
RESUMO
Current coronavirus disease 2019 (COVID-19) vaccines effectively reduce overall morbidity and mortality and are vitally important to controlling the pandemic. Individuals who previously recovered from COVID-19 have enhanced immune responses after vaccination (hybrid immunity) compared with their naïve-vaccinated peers; however, the effects of post-vaccination breakthrough infections on humoral immune response remain to be determined. Here, we measure neutralizing antibody responses from 104 vaccinated individuals, including those with breakthrough infections, hybrid immunity, and no infection history. We find that human immune sera after breakthrough infection and vaccination after natural infection broadly neutralize SARS-CoV-2 (severe acute respiratory coronavirus 2) variants to a similar degree. Although age negatively correlates with antibody response after vaccination alone, no correlation with age was found in breakthrough or hybrid immune groups. Together, our data suggest that the additional antigen exposure from natural infection substantially boosts the quantity, quality, and breadth of humoral immune response regardless of whether it occurs before or after vaccination.
Assuntos
Anticorpos Neutralizantes/biossíntese , Anticorpos Antivirais/biossíntese , Vacinas contra COVID-19/imunologia , COVID-19/prevenção & controle , SARS-CoV-2/imunologia , Vacinação , Adulto , Idoso , Animais , Anticorpos Neutralizantes/imunologia , Anticorpos Antivirais/imunologia , Antígenos Virais/imunologia , COVID-19/epidemiologia , COVID-19/imunologia , Chlorocebus aethiops , Ensaio de Imunoadsorção Enzimática , Humanos , Imunogenicidade da Vacina , Pessoa de Meia-Idade , Fagocitose , SARS-CoV-2/crescimento & desenvolvimento , SARS-CoV-2/isolamento & purificação , Glicoproteína da Espícula de Coronavírus/imunologia , Células THP-1 , Fatores de Tempo , Células Vero , Carga ViralRESUMO
Signal digitization is a commonly overlooked part of ion mobility-mass spectrometry (IMS-MS) workflows, yet it greatly affects signal-to-noise ratio and MS resolution measurements. Here, we report on the integration of a 2 GS/s, 14-bit ADC with structures for lossless ion manipulations (SLIM-IMS-MS) and compare the performance to a commonly used 8-bit ADC. The 14-bit ADC provided a reduction in the digitized noise by a factor of â¼6, owing largely to the use of smaller bit sizes. The low baseline allowed threshold voltage levels to be set very close to the MCP baseline voltage, allowing for as much signal to be acquired as possible without overloading or excessive digitization of MCP baseline noise. Analyses of Agilent tuning mixture ions and a mixture of heavy labeled phosphopeptides showed that the 14-bit ADC provided a â¼1.5-2× signal-to-noise (S/N) increase for high intensity ions, such as the Agilent tuning mixture ions and the 2+ and 3+ charge states of many phosphopeptide constituents. However, signal enhancements were as much as 10-fold for low intensity ions, and the 14-bit ADC enabled discernible signal intensities otherwise lost using an 8-bit digitizer. Additionally, the 14-bit ADC required â¼14-fold fewer mass spectra to be averaged to produce a mass spectrum with a similar S/N as the 8-bit ADC, demonstrating â¼10× higher measurement throughput. The high resolution, low baseline, and fast speed of the new 14-bit ADC enables high performance digitization of MS, IMS-MS, and SLIM-IMS-MS spectra and provides a much better picture of analyte profiles in complex mixtures.
RESUMO
Brown rot fungi release massive amounts of carbon from forest deadwood, particularly at high latitudes. These fungi degrade wood by generating small reactive oxygen species (ROS) to loosen lignocellulose, to then selectively remove carbohydrates. The ROS mechanism has long been considered the key adaptation defining brown rot wood decomposition, but recently, we found preliminary evidence that fungal glycoside hydrolases (GHs) implicated in early cell wall loosening might have been adapted to tolerate ROS stress and to synergize with ROS to loosen woody lignocellulose. In the current study, we found more specifically that side chain hemicellulases that help in the early deconstruction of the lignocellulosic complex are significantly more tolerant of ROS in the brown rot fungus Rhodonia placenta than in a white rot fungus (Trametes versicolor) and a soft rot fungus (Trichoderma reesei). Using proteomics to understand the extent of tolerance, we found that significant oxidation of secreted R. placenta proteins exposed to ROS was less than half of the oxidation observed for T. versicolor or T. reesei. The principal oxidative modifications observed in all cases were monooxidation and dioxidation/trioxidation (mainly in methionine and tryptophan residues), some of which were critical for enzyme activity. At the peptide level, we found that GHs in R. placenta were the least ROS affected among our tested fungi. These results confirm and describe underlying mechanisms of tolerance in early-secreted brown rot fungal hemicellulases. These enzymatic adaptations may have been as important as nonenzymatic ROS pathway adaptations in brown rot fungal evolution. IMPORTANCE Brown rot fungi play a critical role in carbon recycling and are of industrial interest. These fungi typically use reactive oxygen species (ROS) to indiscriminately "loosen" wood cell walls at the outset of decay. Brown rot fungi avoid oxidative stress associated with this ROS step by delaying the expression/secretion of many carbohydrate-active enzymes, but there are exceptions, notably some side chain hemicellulases, implicated in loosening lignocellulose. In this study, we provide enzyme activity and secretomic evidence that these enzymes in the brown rot model Rhodonia placenta are more ROS tolerant than the white and soft rot isolates tested. For R. placenta, and perhaps all brown rot lineages, these ROS tolerance adaptions may have played a long-overshadowed role in enabling brown rot.
Assuntos
Fungos/metabolismo , Secretoma , Estresse Fisiológico , Madeira/metabolismo , Madeira/microbiologia , Parede Celular/metabolismo , Proteínas Fúngicas/metabolismo , Glicosídeo Hidrolases/metabolismo , Hidrólise , Lignina/metabolismo , Oxirredução , Polyporales/metabolismoRESUMO
MOTIVATION: Ion mobility spectrometry (IMS) separations are increasingly used in conjunction with mass spectrometry (MS) for separation and characterization of ionized molecular species. Information obtained from IMS measurements includes the ion's collision cross section (CCS), which reflects its size and structure and constitutes a descriptor for distinguishing similar species in mixtures that cannot be separated using conventional approaches. Incorporating CCS into MS-based workflows can improve the specificity and confidence of molecular identification. At present, there is no automated, open-source pipeline for determining CCS of analyte ions in both targeted and untargeted fashion, and intensive user-assisted processing with vendor software and manual evaluation is often required. RESULTS: We present AutoCCS, an open-source software to rapidly determine CCS values from IMS-MS measurements. We conducted various IMS experiments in different formats to demonstrate the flexibility of AutoCCS for automated CCS calculation: (i) stepped-field methods for drift tube-based IMS (DTIMS), (ii) single-field methods for DTIMS (supporting two calibration methods: a standard and a new enhanced method) and (iii) linear calibration for Bruker timsTOF and non-linear calibration methods for traveling wave based-IMS in Waters Synapt and Structures for Lossless Ion Manipulations. We demonstrated that AutoCCS offers an accurate and reproducible determination of CCS for both standard and unknown analyte ions in various IMS-MS platforms, IMS-field methods, ionization modes and collision gases, without requiring manual processing. AVAILABILITY AND IMPLEMENTATION: https://github.com/PNNL-Comp-Mass-Spec/AutoCCS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Demo datasets are publicly available at MassIVE (Dataset ID: MSV000085979).
Assuntos
Espectrometria de Mobilidade Iônica , Software , Espectrometria de Massas/métodos , ÍonsRESUMO
Predictive biogeochemical modeling requires data-model integration that enables explicit representation of the sophisticated roles of microbial processes that transform substrates. Data from high-resolution organic matter (OM) characterization are increasingly available and can serve as a critical resource for this purpose, but their incorporation into biogeochemical models is often prohibited due to an over-simplified description of reaction networks. To fill this gap, we proposed a new concept of biogeochemical modeling-termed substrate-explicit modeling-that enables parameterizing OM-specific oxidative degradation pathways and reaction rates based on the thermodynamic properties of OM pools. Based on previous developments in the literature, we characterized the resulting kinetic models by only two parameters regardless of the complexity of OM profiles, which can greatly facilitate the integration with reactive transport models for ecosystem simulations by alleviating the difficulty in parameter identification. The two parameters include maximal growth rate (µmax) and harvest volume (Vh) (i.e., the volume that a microbe can access for harvesting energy). For every detected organic molecule in a given sample, our approach provides a systematic way to formulate reaction kinetics from chemical formula, which enables the evaluation of the impact of OM character on biogeochemical processes across conditions. In a case study of two sites with distinct OM thermodynamics using ultra high-resolution metabolomics datasets derived from Fourier transform ion cyclotron resonance mass spectrometry analyses, our method predicted how oxidative degradation is primarily driven by thermodynamic efficiency of OM consistent with experimental rate measurements (as shown by correlation coefficients of up to 0.61), and how biogeochemical reactions can vary in response to carbon and/or oxygen limitations. Lastly, we showed that incorporation of enzymatic regulations into substrate-explicit models is critical for more reasonable predictions. This result led us to present integrative biogeochemical modeling as a unifying framework that can ideally describe the dynamic interplay among microbes, enzymes, and substrates to address advanced questions and hypotheses in future studies. Altogether, the new modeling concept we propose in this work provides a foundational platform for unprecedented predictions of biogeochemical and ecosystem dynamics through enhanced integration with diverse experimental data and extant modeling approaches.
RESUMO
Ion packets introduced from gates, ion funnel traps, and other conventional ion injection mechanisms produce ion pulse widths typically around a few microseconds or less for ion mobility spectrometry (IMS)-based separations on the order of 100 milliseconds. When such ion injection techniques are coupled with ultralong path length traveling wave (TW)-based IMS separations (i.e., on the order of seconds) using structures for lossless ion manipulations (SLIMs), typically very low ion utilization efficiency is achieved for continuous ion sources [e.g., electrospray ionization (ESI)]. Even with the ability to trap and accumulate much larger populations of ions than being conventionally feasible over longer time periods in SLIM devices, the subsequent long separations lead to overall low ion utilization. Here, we report the use of a highly flexible SLIM arrangement, enabling concurrent ion accumulation and separation and achieving near-complete ion utilization with ESI. We characterize the ion accumulation process in SLIM, demonstrate >98% ion utilization, and show both increased signal intensities and measurement throughput. This approach is envisioned to have broad utility to applications, for example, involving the fast detection of trace chemical species.
Assuntos
Espectrometria de Mobilidade Iônica/métodos , Razão Sinal-Ruído , Espectrometria de Massas por Ionização por ElectrosprayRESUMO
Zika virus (ZIKV), an arbovirus of global concern, remodels intracellular membranes to form replication sites. How ZIKV dysregulates lipid networks to allow this, and consequences for disease, is poorly understood. Here, we perform comprehensive lipidomics to create a lipid network map during ZIKV infection. We find that ZIKV significantly alters host lipid composition, with the most striking changes seen within subclasses of sphingolipids. Ectopic expression of ZIKV NS4B protein results in similar changes, demonstrating a role for NS4B in modulating sphingolipid pathways. Disruption of sphingolipid biosynthesis in various cell types, including human neural progenitor cells, blocks ZIKV infection. Additionally, the sphingolipid ceramide redistributes to ZIKV replication sites, and increasing ceramide levels by multiple pathways sensitizes cells to ZIKV infection. Thus, we identify a sphingolipid metabolic network with a critical role in ZIKV replication and show that ceramide flux is a key mediator of ZIKV infection.
Assuntos
Interações Hospedeiro-Patógeno , Esfingolipídeos/metabolismo , Proteínas não Estruturais Virais/metabolismo , Infecção por Zika virus/patologia , Zika virus/patogenicidade , Animais , Linhagem Celular Tumoral , Chlorocebus aethiops , Células HEK293 , Humanos , Lipidômica , Camundongos , Esfingolipídeos/análise , Células Vero , Replicação Viral , Zika virus/metabolismo , Infecção por Zika virus/virologiaRESUMO
Microbial communities organize into spatial patterns that are largely governed by interspecies interactions. This phenomenon is an important metric for understanding community functional dynamics, yet the use of spatial patterns for predicting microbial interactions is currently lacking. Here we propose supervised deep learning as a new tool for network inference. An agent-based model was used to simulate the spatiotemporal evolution of two interacting organisms under diverse growth and interaction scenarios, the data of which was subsequently used to train deep neural networks. For small-size domains (100 µm × 100 µm) over which interaction coefficients are assumed to be invariant, we obtained fairly accurate predictions, as indicated by an average R2 value of 0.84. In application to relatively larger domains (450 µm × 450 µm) where interaction coefficients are varying in space, deep learning models correctly predicted spatial distributions of interaction coefficients without any additional training. Lastly, we evaluated our model against real biological data obtained using Pseudomonas fluorescens and Escherichia coli co-cultures treated with polymeric chitin or N-acetylglucosamine, the hydrolysis product of chitin. While P. fluorescens can utilize both substrates for growth, E. coli lacked the ability to degrade chitin. Consistent with our expectations, our model predicted context-dependent interactions across two substrates, i.e., degrader-cheater relationship on chitin polymers and competition on monomers. The combined use of the agent-based model and machine learning algorithm successfully demonstrates how to infer microbial interactions from spatially distributed data, presenting itself as a useful tool for the analysis of more complex microbial community interactions.
RESUMO
The soil environment is constantly changing due to shifts in soil moisture, nutrient availability and other conditions. To contend with these changes, soil microorganisms have evolved a variety of ways to adapt to environmental perturbations, including regulation of gene expression. However, it is challenging to untangle the complex phenotypic response of the soil to environmental change, partly due to the absence of predictive modeling frameworks that can mechanistically link molecular-level changes in soil microorganisms to a community's functional phenotypes (or metaphenome). Towards filling this gap, we performed a combined analysis of metabolic and gene co-expression networks to explore how the soil microbiome responded to changes in soil moisture and nutrient conditions and to determine which genes were expressed under a given condition. Our integrated modeling approach revealed previously unknown, but critically important aspects of the soil microbiomes' response to environmental perturbations. Incorporation of metabolomic and transcriptomic data into metabolic reaction networks identified condition-specific signature genes that are uniquely associated with dry, wet, and glycine-amended conditions. A subsequent gene co-expression network analysis revealed that drought-associated genes occupied more central positions in a network model of the soil community, compared to the genes associated with wet, and glycine-amended conditions. These results indicate the occurrence of system-wide metabolic coordination when soil microbiomes cope with moisture or nutrient perturbations. Importantly, the approach that we demonstrate here to analyze large-scale multi-omics data from a natural soil environment is applicable to other microbiome systems for which multi-omics data are available.
Assuntos
Redes e Vias Metabólicas , Microbiota , Microbiologia do Solo , Proteínas de Bactérias/genética , Secas , Enzimas/genética , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Glicina/farmacologia , Umidade , Kansas , Microbiota/genética , TranscriptomaRESUMO
An intriguing aspect in microbial communities is that pairwise interactions can be influenced by neighboring species. This creates context dependencies for microbial interactions that are based on the functional composition of the community. Context dependent interactions are ecologically important and clearly present in nature, yet firmly established theoretical methods are lacking from many modern computational investigations. Here, we propose a novel network inference method that enables predictions for interspecies interactions affected by shifts in community composition and species populations. Our approach first identifies interspecies interactions in binary communities, which is subsequently used as a basis to infer modulation in more complex multi-species communities based on the assumption that microbes minimize adjustments of pairwise interactions in response to neighbor species. We termed this rule-based inference minimal interspecies interaction adjustment (MIIA). Our critical assessment of MIIA has produced reliable predictions of shifting interspecies interactions that are dependent on the functional role of neighbor organisms. We also show how MIIA has been applied to a microbial community composed of competing soil bacteria to elucidate a new finding that - in many cases - adding fewer competitors could impose more significant impact on binary interactions. The ability to predict membership-dependent community behavior is expected to help deepen our understanding of how microbiomes are organized in nature and how they may be designed and/or controlled in the future.