Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 75
Filtrar
1.
Proc Natl Acad Sci U S A ; 119(17): e2117814119, 2022 04 26.
Artigo em Inglês | MEDLINE | ID: mdl-35446625

RESUMO

Building and changing a microbiome at will and maintaining it over hundreds of generations has so far proven challenging. Despite best efforts, complex microbiomes appear to be susceptible to large stochastic fluctuations. Current capabilities to assemble and control stable complex microbiomes are limited. Here, we propose a looped mass transfer design that stabilizes microbiomes over long periods of time. Five local microbiomes were continuously grown in parallel for over 114 generations and connected by a loop to a regional pool. Mass transfer rates were altered and microbiome dynamics were monitored using quantitative high-throughput flow cytometry and taxonomic sequencing of whole communities and sorted subcommunities. Increased mass transfer rates reduced local and temporal variation in microbiome assembly, did not affect functions, and overcame stochasticity, with all microbiomes exhibiting high constancy and increasing resistance. Mass transfer synchronized the structures of the five local microbiomes and nestedness of certain cell types was eminent. Mass transfer increased cell number and thus decreased net growth rates µ'. Subsets of cells that did not show net growth µ'SCx were rescued by the regional pool R and thus remained part of the microbiome. The loop in mass transfer ensured the survival of cells that would otherwise go extinct, even if they did not grow in all local microbiomes or grew more slowly than the actual dilution rate D would allow. The rescue effect, known from metacommunity theory, was the main stabilizing mechanism leading to synchrony and survival of subcommunities, despite differences in cell physiological properties, including growth rates.


Assuntos
Microbiota , Biotecnologia , Ecologia
2.
Brief Bioinform ; 23(4)2022 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-35753697

RESUMO

Recent technological advances have led to an exponential expansion of biological sequence data and extraction of meaningful information through Machine Learning (ML) algorithms. This knowledge has improved the understanding of mechanisms related to several fatal diseases, e.g. Cancer and coronavirus disease 2019, helping to develop innovative solutions, such as CRISPR-based gene editing, coronavirus vaccine and precision medicine. These advances benefit our society and economy, directly impacting people's lives in various areas, such as health care, drug discovery, forensic analysis and food processing. Nevertheless, ML-based approaches to biological data require representative, quantitative and informative features. Many ML algorithms can handle only numerical data, and therefore sequences need to be translated into a numerical feature vector. This process, known as feature extraction, is a fundamental step for developing high-quality ML-based models in bioinformatics, by allowing the feature engineering stage, with design and selection of suitable features. Feature engineering, ML algorithm selection and hyperparameter tuning are often manual and time-consuming processes, requiring extensive domain knowledge. To deal with this problem, we present a new package: BioAutoML. BioAutoML automatically runs an end-to-end ML pipeline, extracting numerical and informative features from biological sequence databases, using the MathFeature package, and automating the feature selection, ML algorithm(s) recommendation and tuning of the selected algorithm(s) hyperparameters, using Automated ML (AutoML). BioAutoML has two components, divided into four modules: (1) automated feature engineering (feature extraction and selection modules) and (2) Metalearning (algorithm recommendation and hyper-parameter tuning modules). We experimentally evaluate BioAutoML in two different scenarios: (i) prediction of the three main classes of noncoding RNAs (ncRNAs) and (ii) prediction of the eight categories of ncRNAs in bacteria, including housekeeping and regulatory types. To assess BioAutoML predictive performance, it is experimentally compared with two other AutoML tools (RECIPE and TPOT). According to the experimental results, BioAutoML can accelerate new studies, reducing the cost of feature engineering processing and either keeping or improving predictive performance. BioAutoML is freely available at https://github.com/Bonidia/BioAutoML.


Assuntos
Vacinas contra COVID-19 , COVID-19 , Algoritmos , Bactérias/genética , Humanos , Aprendizado de Máquina
3.
RNA Biol ; 21(1): 1-12, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-38528797

RESUMO

The accurate classification of non-coding RNA (ncRNA) sequences is pivotal for advanced non-coding genome annotation and analysis, a fundamental aspect of genomics that facilitates understanding of ncRNA functions and regulatory mechanisms in various biological processes. While traditional machine learning approaches have been employed for distinguishing ncRNA, these often necessitate extensive feature engineering. Recently, deep learning algorithms have provided advancements in ncRNA classification. This study presents BioDeepFuse, a hybrid deep learning framework integrating convolutional neural networks (CNN) or bidirectional long short-term memory (BiLSTM) networks with handcrafted features for enhanced accuracy. This framework employs a combination of k-mer one-hot, k-mer dictionary, and feature extraction techniques for input representation. Extracted features, when embedded into the deep network, enable optimal utilization of spatial and sequential nuances of ncRNA sequences. Using benchmark datasets and real-world RNA samples from bacterial organisms, we evaluated the performance of BioDeepFuse. Results exhibited high accuracy in ncRNA classification, underscoring the robustness of our tool in addressing complex ncRNA sequence data challenges. The effective melding of CNN or BiLSTM with external features heralds promising directions for future research, particularly in refining ncRNA classifiers and deepening insights into ncRNAs in cellular processes and disease manifestations. In addition to its original application in the context of bacterial organisms, the methodologies and techniques integrated into our framework can potentially render BioDeepFuse effective in various and broader domains.


Assuntos
Aprendizado Profundo , RNA não Traduzido/genética , Algoritmos , RNA , Redes Neurais de Computação
4.
Environ Sci Technol ; 58(5): 2346-2359, 2024 Feb 06.
Artigo em Inglês | MEDLINE | ID: mdl-38267392

RESUMO

Ecological role of the viral community on the fate of antibiotic resistance genes (ARGs) (reduction vs proliferation) remains unclear in anaerobic digestion (AD). Metagenomics revealed a dominance of Siphoviridae and Podoviridae among 13,895 identified viral operational taxonomic units (vOTUs) within AD, and only 21 of the vOTUs carried ARGs, which only accounted for 0.57 ± 0.43% of AD antibiotic resistome. Conversely, ARGs locating on plasmids and integrative and conjugative elements accounted for above 61.0%, indicating a substantial potential for conjugation in driving horizontal gene transfer of ARGs within AD. Virus-host prediction based on CRISPR spacer, tRNA, and homology matches indicated that most viruses (80.2%) could not infect across genera. Among 480 high-quality metagenome assembly genomes, 95 carried ARGs and were considered as putative antibiotic-resistant bacteria (pARB). Furthermore, lytic phages of 66 pARBs were identified and devoid of ARGs, and virus/host abundance ratios with an average value of 71.7 indicated extensive viral activity and lysis. The infectivity of lytic phage was also elucidated through laboratory experiments concerning changes of the phage-to-host ratio, pH, and temperature. Although metagenomic evidence for dissemination of ARGs by phage transduction was found, the higher proportion of lytic phages infecting pARBs suggested that the viral community played a greater role in reducing ARB numbers than spreading ARGs in AD.


Assuntos
Antibacterianos , Bacteriófagos , Antibacterianos/farmacologia , Anaerobiose , Antagonistas de Receptores de Angiotensina , Genes Bacterianos , Inibidores da Enzima Conversora de Angiotensina , Bactérias/genética , Resistência Microbiana a Medicamentos/genética , Bacteriófagos/genética , Metagenômica
5.
Environ Sci Technol ; 57(46): 18350-18361, 2023 Nov 21.
Artigo em Inglês | MEDLINE | ID: mdl-37097211

RESUMO

For anaerobic mixed cultures performing microbial chain elongation, it is unclear how pH alterations affect the abundance of key players, microbial interactions, and community functioning in terms of medium-chain carboxylate yields. We explored pH effects on mixed cultures enriched in continuous anaerobic bioreactors representing closed model ecosystems. Gradual pH increase from 5.5 to 6.5 induced dramatic shifts in community composition, whereas product range and yields returned to previous states after transient fluctuations. To understand community responses to pH perturbations over long-term reactor operation, we applied Aitchison PCA clustering, linear mixed-effects models, and random forest classification on 16S rRNA gene amplicon sequencing and process data. Different pH preferences of two key chain elongation species─one Clostridium IV species related to Ruminococcaceae bacterium CPB6 and one Clostridium sensu stricto species related to Clostridium luticellarii─were determined. Network analysis revealed positive correlations of Clostridium IV with lactic acid bacteria, which switched from Olsenella to Lactobacillus along the pH increase, illustrating the plasticity of the food web in chain elongation communities. Despite long-term cultivation in closed systems over the pH shift experiment, the communities retained functional redundancy in fermentation pathways, reflected by the emergence of rare species and concomitant recovery of chain elongation functions.


Assuntos
Resiliência Psicológica , RNA Ribossômico 16S , Ecossistema , Reatores Biológicos/microbiologia , Fermentação , Concentração de Íons de Hidrogênio
6.
Nucleic Acids Res ; 49(D1): D743-D750, 2021 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-33221926

RESUMO

Metagenomics became a standard strategy to comprehend the functional potential of microbial communities, including the human microbiome. Currently, the number of metagenomes in public repositories is increasing exponentially. The Sequence Read Archive (SRA) and the MG-RAST are the two main repositories for metagenomic data. These databases allow scientists to reanalyze samples and explore new hypotheses. However, mining samples from them can be a limiting factor, since the metadata available in these repositories is often misannotated, misleading, and decentralized, creating an overly complex environment for sample reanalysis. The main goal of the HumanMetagenomeDB is to simplify the identification and use of public human metagenomes of interest. HumanMetagenomeDB version 1.0 contains metadata of 69 822 metagenomes. We standardized 203 attributes, based on standardized ontologies, describing host characteristics (e.g. sex, age and body mass index), diagnosis information (e.g. cancer, Crohn's disease and Parkinson), location (e.g. country, longitude and latitude), sampling site (e.g. gut, lung and skin) and sequencing attributes (e.g. sequencing platform, average length and sequence quality). Further, HumanMetagenomeDB version 1.0 metagenomes encompass 58 countries, 9 main sample sites (i.e. body parts), 58 diagnoses and multiple ages, ranging from just born to 91 years old. The HumanMetagenomeDB is publicly available at https://webapp.ufz.de/hmgdb/.


Assuntos
Curadoria de Dados , Bases de Dados Genéticas/normas , Metadados/normas , Metagenoma , Humanos , Metagenômica , Padrões de Referência , Interface Usuário-Computador
7.
J Appl Microbiol ; 132(4): 2795-2811, 2022 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-34995421

RESUMO

AIMS: How benzene is metabolized by microbes under anoxic conditions is not fully understood. Here, we studied the degradation pathways in a benzene-mineralizing, nitrate-reducing enrichment culture. METHODS AND RESULTS: Benzene mineralization was dependent on the presence of nitrate and correlated to the enrichment of a Peptococcaceae phylotype only distantly related to known anaerobic benzene degraders of this family. Its relative abundance decreased after benzene mineralization had terminated, while other abundant taxa-Ignavibacteriaceae, Rhodanobacteraceae and Brocadiaceae-slightly increased. Generally, the microbial community remained diverse despite the amendment of benzene as single organic carbon source, suggesting complex trophic interactions between different functional groups. A subunit of the putative anaerobic benzene carboxylase previously detected in Peptococcaceae was identified by metaproteomic analysis suggesting that benzene was activated by carboxylation. Detection of proteins involved in anaerobic ammonium oxidation (anammox) indicates that benzene mineralization was accompanied by anammox, facilitated by nitrite accumulation and the presence of ammonium in the growth medium. CONCLUSIONS: The results suggest that benzene was activated by carboxylation and further assimilated by a novel Peptococcaceae phylotype. SIGNIFICANCE AND IMPACT OF THE STUDY: The results confirm the hypothesis that Peptococcaceae are important anaerobic benzene degraders.


Assuntos
Microbiota , Nitratos , Anaerobiose , Benzeno/metabolismo , Nitratos/metabolismo , Oxirredução , Peptococcaceae/metabolismo
8.
Nucleic Acids Res ; 48(D1): D626-D632, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31728526

RESUMO

Microbiome studies focused on the genetic potential of microbial communities (metagenomics) became standard within microbial ecology. MG-RAST and the Sequence Read Archive (SRA), the two main metagenome repositories, contain over 202 858 public available metagenomes and this number has increased exponentially. However, mining databases can be challenging due to misannotated, misleading and decentralized data. The main goal of TerrestrialMetagenomeDB is to make it easier for scientists to find terrestrial metagenomes of interest that could be compared with novel datasets in meta-analyses. We defined terrestrial metagenomes as those that do not belong to marine environments. Further, we curated the database using text mining to assign potential descriptive keywords that better contextualize environmental aspects of terrestrial metagenomes, such as biomes and materials. TerrestrialMetagenomeDB release 1.0 includes 15 022 terrestrial metagenomes from SRA and MG-RAST. Together, the downloadable data amounts to 68 Tbp. In total, 199 terrestrial terms were divided into 14 categories. These metagenomes span 83 countries, 30 biomes and 7 main source materials. The TerrestrialMetagenomeDB is publicly available at https://webapp.ufz.de/tmdb.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Metadados , Metagenoma , Mineração de Dados , Ecologia , Ecossistema , Genoma Bacteriano , Geografia , Internet , Microbiologia do Solo , Interface Usuário-Computador
9.
Entropy (Basel) ; 24(10)2022 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-37420418

RESUMO

In recent years, there has been an exponential growth in sequencing projects due to accelerated technological advances, leading to a significant increase in the amount of data and resulting in new challenges for biological sequence analysis. Consequently, the use of techniques capable of analyzing large amounts of data has been explored, such as machine learning (ML) algorithms. ML algorithms are being used to analyze and classify biological sequences, despite the intrinsic difficulty in extracting and finding representative biological sequence methods suitable for them. Thereby, extracting numerical features to represent sequences makes it statistically feasible to use universal concepts from Information Theory, such as Tsallis and Shannon entropy. In this study, we propose a novel Tsallis entropy-based feature extractor to provide useful information to classify biological sequences. To assess its relevance, we prepared five case studies: (1) an analysis of the entropic index q; (2) performance testing of the best entropic indices on new datasets; (3) a comparison made with Shannon entropy and (4) generalized entropies; (5) an investigation of the Tsallis entropy in the context of dimensionality reduction. As a result, our proposal proved to be effective, being superior to Shannon entropy and robust in terms of generalization, and also potentially representative for collecting information in fewer dimensions compared with methods such as Singular Value Decomposition and Uniform Manifold Approximation and Projection.

10.
Appl Microbiol Biotechnol ; 101(12): 5175-5188, 2017 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-28321487

RESUMO

Benzene is an aromatic compound and harmful for the environment. Biodegradation of benzene can reduce the toxicological risk after accidental or controlled release of this chemical in the environment. In this study, we further characterized an anaerobic continuous biofilm culture grown for more than 14 years on benzene with nitrate as electron acceptor. We determined steady state degradation rates, microbial community composition dynamics in the biofilm, and the initial anaerobic benzene degradation reactions. Benzene was degraded at a rate of 0.15 µmol/mg protein/day and a first-order rate constant of 3.04/day which was fourfold higher than rates reported previously. Bacteria belonging to the Peptococcaceae were found to play an important role in this anaerobic benzene-degrading biofilm culture, but also members of the Anaerolineaceae were predicted to be involved in benzene degradation or benzene metabolite degradation based on Illumina MiSeq analysis of 16S ribosomal RNA genes. Biomass retention in the reactor using a filtration finger resulted in reduction of benzene degradation capacity. Detection of the benzene carboxylase encoding gene, abcA, and benzoic acid in the culture vessel indicated that benzene degradation proceeds through an initial carboxylation step.


Assuntos
Bactérias/metabolismo , Benzeno/metabolismo , Biodegradação Ambiental , Biofilmes/crescimento & desenvolvimento , Desnitrificação , Consórcios Microbianos/fisiologia , Anaerobiose , Bactérias/classificação , Bactérias/efeitos dos fármacos , Bactérias/genética , Benzeno/farmacologia , Ácido Benzoico/análise , Biofilmes/efeitos dos fármacos , Meios de Cultura/química , Consórcios Microbianos/efeitos dos fármacos , Consórcios Microbianos/genética , Nitratos/metabolismo , Peptococcaceae/classificação , Peptococcaceae/genética , Peptococcaceae/isolamento & purificação , Peptococcaceae/metabolismo , RNA Ribossômico 16S/genética
11.
Microbiol Spectr ; 12(3): e0291823, 2024 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-38289113

RESUMO

Wastewater is considered a reservoir of antimicrobial resistance genes (ARGs), where the abundant antimicrobial-resistant bacteria and mobile genetic elements facilitate horizontal gene transfer. However, the prevalence and extent of these phenomena in different taxonomic groups that inhabit wastewater are still not fully understood. Here, we determined the presence of ARGs in metagenome-assembled genomes (MAGs) and evaluated the risks of MAG-carrying ARGs in potential human pathogens. The potential of these ARGs to be transmitted horizontally or vertically was also determined. A total of 5,916 MAGs (completeness >50%, contamination <10%) were recovered, covering 68 phyla and 279 genera. MAGs were dereplicated into 1,204 genome operational taxonomic units (gOTUs) as a proxy for species ( average nucleotide identity >0.95). The dominant ARG classes detected were bacitracin, multi-drug, macrolide-lincosamide-streptogramin (MLS), glycopeptide, and aminoglycoside, and 10.26% of them were located on plasmids. The main hosts of ARGs belonged to Escherichia, Klebsiella, Acinetobacter, Gresbergeria, Mycobacterium, and Thauera. Our data showed that 253 MAGs carried virulence factor genes (VFGs) divided into 44 gOTUs, of which 45 MAGs were carriers of ARGs, indicating that potential human pathogens carried ARGs. Alarmingly, the MAG assigned as Escherichia coli contained 159 VFGs, of which 95 were located on chromosomes and 10 on plasmids. In addition to shedding light on the prevalence of ARGs in individual genomes recovered from activated sludge and wastewater, our study demonstrates a workflow that can identify antimicrobial-resistant pathogens in complex microbial communities. IMPORTANCE: Antimicrobial resistance (AMR) threatens the health of humans, animals, and natural ecosystems. In our study, an analysis of 165 metagenomes from wastewater revealed antibiotic-targeted alteration, efflux, and inactivation as the most prevalent AMR mechanisms. We identified several genera correlated with multiple ARGs, including Klebsiella, Escherichia, Acinetobacter, Nitrospira, Ottowia, Pseudomonas, and Thauera, which could have significant implications for AMR transmission. The abundance of bacA, mexL, and aph(3")-I in the genomes calls for their urgent management in wastewater. Our approach could be applied to different ecosystems to assess the risk of potential pathogens containing ARGs. Our findings highlight the importance of managing AMR in wastewater and can help design measures to reduce the transmission and evolution of AMR in these systems.


Assuntos
Microbiota , Águas Residuárias , Animais , Humanos , Esgotos/microbiologia , Antibacterianos/farmacologia , Metagenoma , Genes Bacterianos/genética , Farmacorresistência Bacteriana/genética , Bactérias , Sequências Repetitivas Dispersas
12.
Comput Struct Biotechnol J ; 23: 2267-2276, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38827228

RESUMO

Machine Learning (ML) algorithms have been important tools for the extraction of useful knowledge from biological sequences, particularly in healthcare, agriculture, and the environment. However, the categorical and unstructured nature of these sequences requiring usually additional feature engineering steps, before an ML algorithm can be efficiently applied. The addition of these steps to the ML algorithm creates a processing pipeline, known as end-to-end ML. Despite the excellent results obtained by applying end-to-end ML to biotechnology problems, the performance obtained depends on the expertise of the user in the components of the pipeline. In this work, we propose an end-to-end ML-based framework called BioPrediction-RPI, which can identify implicit interactions between sequences, such as pairs of non-coding RNA and proteins, without the need for specialized expertise in end-to-end ML. This framework applies feature engineering to represent each sequence by structural and topological features. These features are divided into feature groups and used to train partial models, whose partial decisions are combined into a final decision, which, provides insights to the user by giving an interpretability report. In our experiments, the developed framework was competitive when compared with various expert-created models. We assessed BioPrediction-RPI with 12 datasets when it presented equal or better performance than all tools in 40% to 100% of cases, depending on the experiment. Finally, BioPrediction-RPI can fine-tune models based on new data and perform at the same level as ML experts, democratizing end-to-end ML and increasing its access to those working in biological sciences.

13.
Mol Ecol Resour ; 24(2): e13904, 2024 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-37994269

RESUMO

Several computational frameworks and workflows that recover genomes from prokaryotes, eukaryotes and viruses from metagenomes exist. Yet, it is difficult for scientists with little bioinformatics experience to evaluate quality, annotate genes, dereplicate, assign taxonomy and calculate relative abundance and coverage of genomes belonging to different domains. MuDoGeR is a user-friendly tool tailored for those familiar with Unix command-line environment that makes it easy to recover genomes of prokaryotes, eukaryotes and viruses from metagenomes, either alone or in combination. We tested MuDoGeR using 24 individual-isolated genomes and 574 metagenomes, demonstrating the applicability for a few samples and high throughput. While MuDoGeR can recover eukaryotic viral sequences, its characterization is predominantly skewed towards bacterial and archaeal viruses, reflecting the field's current state. However, acting as a dynamic wrapper, the MuDoGeR is designed to constantly incorporate updates and integrate new tools, ensuring its ongoing relevance in the rapidly evolving field. MuDoGeR is open-source software available at https://github.com/mdsufz/MuDoGeR. Additionally, MuDoGeR is also available as a Singularity container.


Assuntos
Metagenoma , Vírus , Metagenômica , Software , Bactérias/genética , Filogenia , Vírus/genética
14.
Microb Ecol ; 65(1): 232-44, 2013 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22903086

RESUMO

To understand the functioning of sponges, knowledge of the structure of their associated microbial communities is necessary. However, our perception of sponge-associated microbiomes remains mainly restricted to marine ecosystems. Here, we report on the molecular diversity and composition of bacteria in the freshwater sponge Ephydatia fluviatilis inhabiting the artificial lake Vinkeveense Plassen, Utrecht, The Netherlands. Polymerase chain reaction-denaturing gradient gel electrophoresis (PCR-DGGE) fingerprints revealed that the apparent diversities within the domain Bacteria and the phylum Actinobacteria were lower in E. fluviatilis than in bulk water. Enrichment of specific PCR-DGGE bands in E. fluviatilis was detected. Furthermore, sponge- and bulk water-derived bacterial clone libraries differed with respect to bacterial community composition at the phylum level. E. fluviatilis-derived sequences were affiliated with six recognized phyla, i.e., Proteobacteria, Planctomycetes, Actinobacteria, Bacteroidetes, Chlamydiae and Verrucomicrobia, in order of relative abundance; next to the uncultured candidate phylum TM7 and one deeply rooted bacterial lineage of undefined taxonomy (BLUT). Actinobacteria, Proteobacteria, and Bacteroidetes were the dominant bacterial phyla in the freshwater clone library whereas sequences affiliated with Planctomycetes, Verrucomicrobia, Acidobacteria and Armatimonadetes were found at lower frequencies. Fine-tuned phylogenetic inference showed no or negligible overlaps between the E. fluviatilis and water-derived phylotypes within bacterial taxa such as Alphaproteobacteria, Bacteroidetes and Actinobacteria. We also ascertained the status of two alphaproteobacterial lineages as freshwater sponge-specific phylogenetic clusters, and report on high distinctiveness of other E. fluviatilis specific phylotypes, especially within the Bacteroidetes, Planctomycetes and Chlamydia taxa. This study supports the contention that the composition and diversity of bacteria in E. fluviatilis is partially driven by the host organism.


Assuntos
Bactérias/classificação , Metagenoma , Filogenia , Poríferos/microbiologia , Animais , Bactérias/genética , Bactérias/isolamento & purificação , Biodiversidade , DNA Bacteriano/genética , Água Doce/microbiologia , Países Baixos , RNA Ribossômico 16S/genética , Análise de Sequência de DNA
15.
Viruses ; 15(12)2023 11 28.
Artigo em Inglês | MEDLINE | ID: mdl-38140571

RESUMO

Huge phages have genomes larger than 200 kilobases, which are particularly interesting for their genetic inventory and evolution. We screened 165 wastewater metagenomes for the presence of viral sequences. After identifying over 600 potential huge phage genomes, we reduced the dataset using manual curation by excluding viral contigs that did not contain viral protein-coding genes or consisted of concatemers of several small phage genomes. This dataset showed seven fully annotated huge phage genomes. The phages grouped into distinct phylogenetic clades, likely forming new genera and families. A phylogenomic analysis between our huge phages and phages with smaller genomes, i.e., less than 200 kb, supported the hypothesis that huge phages have undergone convergent evolution. The genomes contained typical phage protein-coding genes, sequential gene cassettes for metabolic pathways, and complete inventories of tRNA genes covering all standard and rare amino acids. Our study showed a pipeline for huge phage analyses that may lead to new enzymes for therapeutic or biotechnological applications.


Assuntos
Bacteriófagos , Bacteriófagos/genética , Metagenoma , Águas Residuárias , Filogenia , Genoma Viral
16.
Mol Ecol Resour ; 23(5): 1066-1076, 2023 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-36847735

RESUMO

As most eukaryotic genomes are yet to be sequenced, the mechanisms underlying their contribution to different ecosystem processes remain untapped. Although approaches to recovering Prokaryotic genomes have become common in genome biology, few studies have tackled the recovery of eukaryotic genomes from metagenomes. This study assessed the reconstruction of microbial eukaryotic genomes using 6000 metagenomes from terrestrial and some transition environments using the EukRep pipeline. Only 215 metagenomic libraries yielded eukaryotic bins. From a total of 447 eukaryotic bins recovered 197 were classified at the phylum level. Streptophytes and fungi were the most represented clades with 83 and 73 bins, respectively. More than 78% of the obtained eukaryotic bins were recovered from samples whose biomes were classified as host-associated, aquatic, and anthropogenic terrestrial. However, only 93 bins were taxonomically assigned at the genus level and 17 bins at the species level. Completeness and contamination estimates were obtained for a total of 193 bins and consisted of 44.64% (σ = 27.41%) and 3.97% (σ = 6.53%), respectively. Micromonas commoda was the most frequent taxon found while Saccharomyces cerevisiae presented the highest completeness, probably because more reference genomes are available. Current measures of completeness are based on the presence of single-copy genes. However, mapping of the contigs from the recovered eukaryotic bins to the chromosomes of the reference genomes showed many gaps, suggesting that completeness measures should also include chromosome coverage. Recovering eukaryotic genomes will benefit significantly from long-read sequencing, development of tools for dealing with repeat-rich genomes, and improved reference genomes databases.


Assuntos
Eucariotos , Metagenoma , Eucariotos/genética , Ecossistema , Genoma Microbiano , Fungos/genética , Metagenômica
17.
mSystems ; 8(6): e0064323, 2023 Dec 21.
Artigo em Inglês | MEDLINE | ID: mdl-38018967

RESUMO

IMPORTANCE: This is the most comprehensive study performed thus far on the biosynthetic potential within the Flavobacteriaceae family. Our findings reveal intertwined taxonomic and natural product biosynthesis diversification within the family. We posit that the carbohydrate, peptide, and secondary metabolism triad synergistically shaped the evolution of this keystone bacterial taxon, acting as major forces underpinning the broad host range and opportunistic-to-pathogenic behavior encompassed by species in the family. This study further breaks new ground for future research on select Flavobacteriaceae spp. as reservoirs of novel drug leads.


Assuntos
Produtos Biológicos , Flavobacteriaceae , Produtos Biológicos/metabolismo , Flavobacteriaceae/metabolismo , Metabolismo Secundário , Peptídeos/metabolismo
18.
Microorganisms ; 11(1)2023 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-36677467

RESUMO

The recovery of metagenome-assembled genomes is biased towards the most abundant species in a given community. To improve the identification of species, even if only dominant species are recovered, we investigated the integration of flow cytometry cell sorting with bioinformatics tools to recover metagenome-assembled genomes. We used a cell culture of a wastewater microbial community as our model system. Cells were separated based on fluorescence signals via flow cytometry cell sorting into sub-communities: dominant gates, low abundant gates, and outer gates into subsets of the original community. Metagenome sequencing was performed for all groups. The unsorted community was used as control. We recovered a total of 24 metagenome-assembled genomes (MAGs) representing 11 species-level genome operational taxonomic units (gOTUs). In addition, 57 ribosomal operational taxonomic units (rOTUs) affiliated with 29 taxa at species level were reconstructed from metagenomic libraries. Our approach suggests a two-fold increase in the resolution when comparing sorted and unsorted communities. Our results also indicate that species abundance is one determinant of genome recovery from metagenomes as we can recover taxa in the sorted libraries that are not present in the unsorted community. In conclusion, a combination of cell sorting and metagenomics allows the recovery of MAGs undetected without cell sorting.

19.
Front Microbiol ; 14: 1037845, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36760505

RESUMO

Introduction: Every year, millions of deaths are associated with the increased spread of antimicrobial resistance genes (ARGs) in bacteria. With the increasing urbanization of the global population, the spread of ARGs in urban bacteria has become a more severe threat to human health. Methods: In this study, we used metagenome-assembled genomes (MAGs) recovered from 1,153 urban metagenomes in multiple urban locations to investigate the fate and occurrence of ARGs in urban bacteria. Additionally, we analyzed the occurrence of these ARGs on plasmids and estimated the virulence of the bacterial species. Results: Our results showed that multidrug and glycopeptide ARGs are ubiquitous among urban bacteria. Additionally, we analyzed the deterministic effects of phylogeny on the spread of these ARGs and found ARG classes that have a non-random distribution within the phylogeny of our recovered MAGs. However, few ARGs were found on plasmids and most of the recovered MAGs contained few virulence factors. Discussion: Our results suggest that the observed non-random spreads of ARGs are not due to the transfer of plasmids and that most of the bacteria observed in the study are unlikely to be virulent. Additional research is needed to evaluate whether the ubiquitous and widespread ARG classes will become entirely prevalent among urban bacteria and how they spread among phylogenetically distinct species.

20.
Front Microbiol ; 14: 1058350, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36760511

RESUMO

Introduction: Currently there are sparse regulations regarding the discharge of antibiotics from wastewater treatment plants (WWTP) into river systems, making surface waters a latent reservoir for antibiotics and antibiotic resistance genes (ARGs). To better understand factors that influence the fate of ARGs in the environment and to foster surveillance of antibiotic resistance spreading in such habitats, several indicator genes have been proposed, including the integrase gene intI1 and the sulfonamide resistance genes sul1 and sul2. Methods: Here we used quantitative PCR and long-read nanopore sequencing to monitor the abundance of these indicator genes and ARGs present as class 1 integron gene cassettes in a river system from pristine source to WWTP-impacted water. ARG abundance was compared with the dynamics of the microbial communities determined via 16S rRNA gene amplicon sequencing, conventional water parameters and the concentration of sulfamethoxazole (SMX), sulfamethazine (SMZ) and sulfadiazine (SDZ). Results: Our results show that WWTP effluent was the principal source of all three sulfonamides with highest concentrations for SMX (median 8.6 ng/l), and of the indicator genes sul1, sul2 and intI1 with median relative abundance to 16S rRNA gene of 0.55, 0.77 and 0.65%, respectively. Downstream from the WWTP, water quality improved constantly, including lower sulfonamide concentrations, decreasing abundances of sul1 and sul2 and lower numbers and diversity of ARGs in the class 1 integron. The riverine microbial community partially recovered after receiving WWTP effluent, which was consolidated by a microbiome recovery model. Surprisingly, the relative abundance of intI1 increased 3-fold over 13 km of the river stretch, suggesting an internal gene multiplication. Discussion: We found no evidence that low amounts of sulfonamides in the aquatic environment stimulate the maintenance or even spread of corresponding ARGs. Nevertheless, class 1 integrons carrying various ARGs were still present 13 km downstream from the WWTP. Therefore, limiting the release of ARG-harboring microorganisms may be more crucial for restricting the environmental spread of antimicrobial resistance than attenuating ng/L concentrations of antibiotics.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA