Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 936
Filter
1.
Vavilovskii Zhurnal Genet Selektsii ; 28(3): 308-316, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38952705

ABSTRACT

We report the results of taxonomic studies on members of the family Micrococcaceae that, according to the 16S rRNA, internal transcribed spacer 1 (ITS1), average nucleotide identity (ANI), and average amino acid identity (AAI) tests, are related to Kocuria rosea strain RCAM04488, a plant-growth-promoting rhizobacterium (PGPR) isolated from the rhizosphere of potato (Solanum tuberosum L.). In these studies, we used whole-genome phylogenetic tests and pangenomic analysis. According to the ANI > 95 % criterion, several known members of K. salina, K. polaris, and K. rosea (including K. rosea type strain ATCC 186T) that are related most closely to isolate RCAM04488 in the ITS1 test should be assigned to the same species with appropriate strain verification. However, these strains were isolated from strongly contrasting ecological and geographical habitats, which could not but affect their genotypes and phenotypes and which should be taken into account in evaluation of their systematic position. This contradiction was resolved by a pangenomic analysis, which showed that the strains differed strongly in the number of accessory and strain-specific genes determining their individuality and possibly their potential for adaptation to different ecological niches. Similar results were obtained in a full-scale AAI test against the UniProt database (about 250 million records), by using the AAI-profiler program and the proteome of K. rosea strain ATCC 186T as a query. According to the AAI > 65 % criterion, members of the genus Arthrobacter and several other genera belonging to the class Actinomycetes, with a very wide geographical and ecological range of sources of isolation, should be placed into the same genus as Kocuria. Within the paradigm with vertically inherited phylogenetic markers, this could be regarded as a signal for their following taxonomic reclassification. An important factor in this case may be the detailing of the gene composition of the strains and the taxonomic ratios resulting from analysis of the pangenomes of the corresponding clades.

2.
Natl Sci Rev ; 11(6): nwae188, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38962716

ABSTRACT

Transposable elements (TEs) are ubiquitous genomic components and hard to study due to being highly repetitive. Here we assembled 232 chromosome-level genomes based on long-read sequencing data. Coupling the 232 genomes with 15 existing assemblies, we developed a pan-TE map comprising both cultivated and wild Asian rice. We detected 177 084 high-quality TE variations and inferred their derived state using outgroups. We found TEs were one source of phenotypic variation during rice domestication and differentiation. We identified 1246 genes whose expression variation was associated with TEs but not single-nucleotide polymorphisms (SNPs), such as OsRbohB, and validated OsRbohB's relative expression activity using a dual-Luciferase (LUC) reporter assays system. Our pan-TE map allowed us to detect multiple novel loci associated with agronomic traits. Collectively, our findings highlight the contributions of TEs to domestication, differentiation and agronomic traits in rice, and there is massive potential for gene cloning and molecular breeding by the high-quality Asian pan-TE map we generated.

3.
Genome Biol ; 25(1): 170, 2024 07 01.
Article in English | MEDLINE | ID: mdl-38951884

ABSTRACT

Microbial pangenome analysis identifies present or absent genes in prokaryotic genomes. However, current tools are limited when analyzing species with higher sequence diversity or higher taxonomic orders such as genera or families. The Roary ILP Bacterial core Annotation Pipeline (RIBAP) uses an integer linear programming approach to refine gene clusters predicted by Roary for identifying core genes. RIBAP successfully handles the complexity and diversity of Chlamydia, Klebsiella, Brucella, and Enterococcus genomes, outperforming other established and recent pangenome tools for identifying all-encompassing core genes at the genus level. RIBAP is a freely available Nextflow pipeline at github.com/hoelzer-lab/ribap and zenodo.org/doi/10.5281/zenodo.10890871.


Subject(s)
Genome, Bacterial , Molecular Sequence Annotation , Software , Brucella/genetics , Brucella/classification , Bacteria/genetics , Bacteria/classification , Chlamydia/genetics , Enterococcus/genetics , Klebsiella/genetics
4.
Front Mol Biosci ; 11: 1395450, 2024.
Article in English | MEDLINE | ID: mdl-38974320

ABSTRACT

Bacteriophages are the most prevalent biological entities in the biosphere. However, limitations in both medical relevance and sequencing technologies have led to a systematic underestimation of the genetic diversity within phages. This underrepresentation not only creates a significant gap in our understanding of phage roles across diverse biosystems but also introduces biases in computational models reliant on these data for training and testing. In this study, we focused on publicly available genomes of bacteriophages infecting high-priority ESKAPE pathogens to show the extent and impact of this underrepresentation. First, we demonstrate a stark underrepresentation of ESKAPE phage genomes within the public genome and protein databases. Next, a pangenome analysis of these ESKAPE phages reveals extensive sharing of core genes among phages infecting the same host. Furthermore, genome analyses and clustering highlight close nucleotide-level relationships among the ESKAPE phages, raising concerns about the limited diversity within current public databases. Lastly, we uncover a scarcity of unique lytic phages and phage proteins with antimicrobial activities against ESKAPE pathogens. This comprehensive analysis of the ESKAPE phages underscores the severity of underrepresentation and its potential implications. This lack of diversity in phage genomes may restrict the resurgence of phage therapy and cause biased outcomes in data-driven computational models due to incomplete and unbalanced biological datasets.

5.
Article in English | MEDLINE | ID: mdl-38995188

ABSTRACT

A Gram-negative, ellipsoidal to short-rod-shaped, motile bacterium was isolated from Beijing's urban air. The isolate exhibited the closest kinship with Noviherbaspirillum aerium 122213-3T, exhibiting 98.4 % 16S rRNA gene sequence similarity. Phylogenetic analyses based on 16S rRNA gene sequences and genomes showed that it clustered closely with N. aerium 122213-3T, thus forming a distinct phylogenetic lineage within the genus Noviherbaspirillum. The average nucleotide identity and digital DNA-DNA hybridization values between strain I16B-00201T and N. aerium 122213-3T were 84.6 and 29.4 %, respectively. The respiratory ubiquinone was ubiquinone 8. The major fatty acids (>10 %) were summed feature 3 (C16:1ω6c/C16:1ω7c, 43.3 %), summed feature 8 (C18:1ω7c/C18:1ω6c, 15.9 %) and C12:0 (11.0 %). The polyamine profile showed putrescine as the predominant compound. The polar lipid profile consisted of diphosphatidylglycerol, phosphatidylglycerol, phosphatidylethanolamine, phosphatidylcholine, unknown lipids and unknown phosphatidylaminolipids. The phenotypic, phylogenetic and chemotaxonomic results consistently supported that strain I16B-00201T represented a novel species of the genus Noviherbaspirillum, for which the name Noviherbaspirillum album sp. nov. is proposed, with I16B-00201T (=CPCC 100848T=KCTC 52095T) designated as the type strain. Its DNA G+C content is 59.4 mol%. Pan-genome analysis indicated that some Noviherbaspirillum species possess diverse nitrogen and aromatic compound metabolism pathways, suggesting their potential value in pollutant treatment.


Subject(s)
Air Microbiology , Bacterial Typing Techniques , Base Composition , DNA, Bacterial , Fatty Acids , Nucleic Acid Hybridization , Phospholipids , Phylogeny , RNA, Ribosomal, 16S , Sequence Analysis, DNA , Ubiquinone , RNA, Ribosomal, 16S/genetics , Beijing , DNA, Bacterial/genetics , Fatty Acids/analysis , Phospholipids/analysis
6.
mLife ; 3(2): 277-290, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38948139

ABSTRACT

Most in silico evolutionary studies commonly assumed that core genes are essential for cellular function, while accessory genes are dispensable, particularly in nutrient-rich environments. However, this assumption is seldom tested genetically within the pangenome context. In this study, we conducted a robust pangenomic Tn-seq analysis of fitness genes in a nutrient-rich medium for Sinorhizobium strains with a canonical open pangenome. To evaluate the robustness of fitness category assignment, Tn-seq data for three independent mutant libraries per strain were analyzed by three methods, which indicates that the Hidden Markov Model (HMM)-based method is most robust to variations between mutant libraries and not sensitive to data size, outperforming the Bayesian and Monte Carlo simulation-based methods. Consequently, the HMM method was used to classify the fitness category. Fitness genes, categorized as essential (ES), advantage (GA), and disadvantage (GD) genes for growth, are enriched in core genes, while nonessential genes (NE) are over-represented in accessory genes. Accessory ES/GA genes showed a lower fitness effect than core ES/GA genes. Connectivity degrees in the cofitness network decrease in the order of ES, GD, and GA/NE. In addition to accessory genes, 1599 out of 3284 core genes display differential essentiality across test strains. Within the pangenome core, both shared quasi-essential (ES and GA) and strain-dependent fitness genes are enriched in similar functional categories. Our analysis demonstrates a considerable fuzzy essential zone determined by cofitness connectivity degrees in Sinorhizobium pangenome and highlights the power of the cofitness network in understanding the genetic basis of ever-increasing prokaryotic pangenome data.

7.
Res Sq ; 2024 Jun 11.
Article in English | MEDLINE | ID: mdl-38947078

ABSTRACT

Background: The Borreliaceae family includes many obligate parasitic bacterial species which are etiologically associated with a myriad of zoonotic borrelioses including Lyme disease and vector-borne relapsing fevers. Infections by the Borreliaceae are difficult to detect by both direct and indirect methods, often leading to delayed and missed diagnoses. Efforts to improve diagnoses center around the development of molecular diagnostics (MDx), but due to deep tissue sequestration of the causative spirochaetes and the lack of persistent bacteremias, even MDx assays suffer from a lack of sensitivity. Additionally, the highly extensive genomic heterogeneity among isolates, even within the same species, contributes to the lack of assay sensitivity as single target assays cannot provide universal coverage. This within-species heterogeneity is partly due to differences in replicon repertoires and genomic structures that have likely arisen to support the complex Borreliaceae lifecycle in which these parasites have to survive in multiple hosts each with unique immune responses. Results: We constructed a Borreliaceae family-level pangenome and characterized the phylogenetic relationships among the constituent taxa which supports the recent taxonomy of splitting the family into at least two genera. Gene content pro les were created for the majority of the Borreliaceae replicons, providing for the first time their unambiguous molecular typing. Conclusion: Our characterization of the Borreliaceae pan-genome supports the splitting of the former Borrelia genus into two genera and provides for the phylogenetic placement of several non-species designated isolates. Mining this family-level pangenome will enable precision diagnostics corresponding to gene content-driven clinical outcomes while also providing targets for interventions.

8.
AMB Express ; 14(1): 78, 2024 Jul 04.
Article in English | MEDLINE | ID: mdl-38965152

ABSTRACT

Urinary tract infections (UTI) by antibiotic resistant and virulent K. pneumoniae are a growing concern. Understanding the genome and validating the genomic profile along with pangenome analysis will facilitate surveillance of high-risk clones of K. pneumoniae to underpin management strategies toward early detection. The present study aims to correlate resistome with phenotypic antimicrobial resistance and virulome with pathogenicity in Klebsiella spp. The present study aimed to perform complete genome sequences of Klebsiella spp. and to analyse the correlation of resistome with phenotypic antimicrobial resistance and virulome with pathogenicity. To understand the resistome, pangenome and virulome in the Klebsiella spp, the ResFinder, CARD, IS Finder, PlasmidFinder, PHASTER, Roary, VFDB were used. The phenotypic susceptibility profiling identified the uropathogenic kp3 to exhibit multi drug resistance. The resistome and in vitro antimicrobial profiling showed concordance with all the tested antibiotics against the study strains. Hypermucoviscosity was not observed for any of the test isolates; this phenotypic character matches perfectly with the absence of rmpA and magA genes. To the best of our knowledge, this is the first report on the presence of ste, stf, stc and sti major fimbrial operons of Salmonella enterica serotype Typhimurium in K. pneumoniae genome. The study identifies the discordance of virulome and virulence in Klebsiella spp. The complete genome analysis and phenotypic correlation identify uropathogenic K. pneumoniae kp3 as a carbapenem-resistant and virulent pathogen. The Pangenome of K. pneumoniae was open suggesting high genetic diversity. Diverse K serotypes were observed. Sequence typing reveals the prevalence of K. pneumoniae high-risk clones in UTI catheterised patients. The study also highlights the concordance of resistome and in vitro susceptibility tests. Importantly, the study identifies the necessity of virulome and phenotypic virulence markers for timely diagnosis and immediate treatment for the management of high-risk K. pneumoniae clones.

9.
J Fish Biol ; 2024 Jun 17.
Article in English | MEDLINE | ID: mdl-38885946

ABSTRACT

Dusky kob (Argyrosomus japonicus) is a commercially important finfish, indigenous to South Africa, Australia, and China. Previous studies highlighted differences in genetic composition, life history, and morphology of the species across geographic regions. A draft genome sequence of 0.742 Gb (N50 = 5.49 Mb; BUSCO completeness = 97.8%) and 22,438 predicted protein-coding genes was generated for the South African (SA) conspecific. A comparison with the Chinese (CN) conspecific revealed a core set of 32,068 orthologous protein clusters across both genomes. The SA genome exhibited 440 unique clusters compared to 1928 unique clusters in the CN genome. Transportation and immune response processes were overrepresented among the SA accessory genome, whereas the CN accessory genome was enriched for immune response, DNA transposition, and sensory detection (FDR-adjusted p < 0.01). These unique clusters may represent an adaptive component of the species' pangenome that could explain population divergence due to differential environmental specialisation. Furthermore, 700 single-copy orthologues (SCOs) displayed evidence of positive selection between the SA and CN genomes, and globally these genomes shared only 92% similarity, suggesting they might be distinct species. These genes primarily play roles in metabolism and digestion, illustrating the evolutionary pathways that differentiate the species. Understanding these genomic mechanisms underlying adaptation and evolution within and between species provides valuable insights into growth and maturation of kob, traits that are particularly relevant to commercial aquaculture.

10.
Front Plant Sci ; 15: 1383914, 2024.
Article in English | MEDLINE | ID: mdl-38872883

ABSTRACT

To assess the genomic diversity of Fusarium oxysporum f. sp. lini strains and compile a comprehensive gene repertoire, we constructed a pangenome using 13 isolates from four different clonal lineages, each exhibiting distinct levels of virulence. Syntenic analyses of two selected genomes revealed significant chromosomal rearrangements unique to each genome. A comprehensive examination of both core and accessory pangenome content and diversity points at an open genome state. Additionally, Gene Ontology (GO) enrichment analysis indicated that non-core pangenome genes are associated with pathogen recognition and immune signaling. Furthermore, the Folini pansecterome, encompassing secreted proteins critical for fungal pathogenicity, primarily consists of three functional classes: effector proteins, CAZYmes, and proteases. These three classes account for approximately 3.5% of the pangenome. Each functional class within the pansecterome was meticulously annotated and characterized with respect to pangenome category distribution, PFAM domain frequency, and strain virulence assessment. This analysis revealed that highly virulent isolates have specific types of PFAM domains that are exclusive to them. Upon examining the repertoire of SIX genes known for virulence in other formae speciales, it was found that all isolates had a similar gene content except for two, which lacked SIX genes entirely.

11.
Front Microbiol ; 15: 1379500, 2024.
Article in English | MEDLINE | ID: mdl-38873165

ABSTRACT

Introduction: Faecalibacterium is one of the most abundant bacteria in the gut microbiota of healthy adults, highly regarded as a next-generation probiotic. However, the functions of Faecalibacterium genomes from cultured strains and the distribution of different species in populations may differ among different sources. Methods: We here performed an extensive analysis of pan-genomes, functions, and safety evaluation of 136 Faecalibacterium genomes collected from 10 countries. Results: The genomes are clustered into 11 clusters, with only five of them were characterized and validly nomenclated. Over 80% of the accessory genes and unique genes of Faecalibacterium are found with unknown function, which reflects the importance of expanding the collection of Faecalibacterium strains. All the genomes have the potential to produce acetic acid and butyric acid. Nine clusters of Faecalibacterium are found significantly enriched in the healthy individuals compared with patients with type II diabetes.. Discussion: This study provides a comprehensive view of genomic characteristic and functions and of culturable Faecalibacterium bacterium from human gut, and enables clinical advances in the future.

12.
mSystems ; : e0015624, 2024 Jun 26.
Article in English | MEDLINE | ID: mdl-38920366

ABSTRACT

Strains across the Lactobacillaceae family form the basis for a trillion-dollar industry. Our understanding of the genomic basis for their key traits is fragmented, however, including the metabolism that is foundational to their industrial uses. Pangenome analysis of publicly available Lactobacillaceae genomes allowed us to generate genome-scale metabolic network reconstructions for 26 species of industrial importance. Their manual curation led to more than 75,000 gene-protein-reaction associations that were deployed to generate 2,446 genome-scale metabolic models. Cross-referencing genomes and known metabolic traits allowed for manual metabolic network curation and validation of the metabolic models. As a result, we provide the first pangenomic basis for metabolism in the Lactobacillaceae family and a collection of predictive computational metabolic models that enable a variety of practical uses.IMPORTANCELactobacillaceae, a bacterial family foundational to a trillion-dollar industry, is increasingly relevant to biosustainability initiatives. Our study, leveraging approximately 2,400 genome sequences, provides a pangenomic analysis of Lactobacillaceae metabolism, creating over 2,400 curated and validated genome-scale models (GEMs). These GEMs successfully predict (i) unique, species-specific metabolic reactions; (ii) niche-enriched reactions that increase organism fitness; (iii) essential media components, offering insights into the global amino acid essentiality of Lactobacillaceae; and (iv) fermentation capabilities across the family, shedding light on the metabolic basis of Lactobacillaceae-based commercial products. This quantitative understanding of Lactobacillaceae metabolic properties and their genomic basis will have profound implications for the food industry and biosustainability, offering new insights and tools for strain selection and manipulation.

13.
Microbiol Spectr ; : e0052724, 2024 Jun 25.
Article in English | MEDLINE | ID: mdl-38916315

ABSTRACT

The presence of intermittently dispersed insertion sequences and transposases in the Mycobacterium tuberculosis (Mtb) genome makes intra-genome recombination events inevitable. Understanding their effect on the gene repertoires (GR), which may contribute to the development of drug-resistant Mtb, is critical. In this study, publicly available WGS data of clinical Mtb isolates (endemic region n = 2,601; non-endemic region n = 1,130) were de novo assembled, filtered, scaffolded into assemblies, and functionally annotated. Out of 2,601 Mtb WGS data sets from endemic regions, 2,184 (drug resistant/sensitive: 1,386/798) qualified as high quality. We identified 3,784 core genes, 123 softcore genes, 224 shell genes, and 762 cloud genes in the pangenome of Mtb clinical isolates from endemic regions. Sets of 33 and 39 genes showed positive and negative associations (P < 0.01) with drug resistance status, respectively. Gene ontology clustering showed compromised immunity to phages and impaired DNA repair in drug-resistant Mtb clinical isolates compared to the sensitive ones. Multidrug efflux pump repressor genes (Rv3830c and Rv3855c) and CRISPR genes (Rv2816c-19c) were absent in the drug-resistant Mtb. A separate WGS data analysis of drug-resistant Mtb clinical isolates from the Netherlands (n = 1130) also showed the absence of CRISPR genes (Rv2816c-17c). This study highlights the role of CRISPR genes in drug resistance development in Mtb clinical isolates and helps in understanding its evolutionary trajectory and as useful targets for diagnostics development.IMPORTANCEThe results from the present Pan-GWAS study comparing gene sets in drug-resistant and drug-sensitive Mtb clinical isolates revealed intricate presence-absence patterns of genes encoding DNA-binding proteins having gene regulatory as well as DNA modification and DNA repair roles. Apart from the genes with known functions, some uncharacterized and hypothetical genes that seem to have a potential role in drug resistance development in Mtb were identified. We have been able to extrapolate many findings of the present study with the existing literature on the molecular aspects of drug-resistant Mtb, further strengthening the relevance of the results presented in this study.

14.
ISME Commun ; 4(1): ycae078, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38915450

ABSTRACT

Wolbachia is a maternally inherited intracellular bacterium that infects a wide range of arthropods including mosquitoes. The endosymbiont is widely used in biocontrol strategies due to its capacity to modulate arthropod reproduction and limit pathogen transmission. Wolbachia infections in Culex spp. are generally assumed to be monoclonal but the potential presence of genetically distinct Wolbachia subpopulations within and between individual organs has not been investigated using whole genome sequencing. Here we reconstructed Wolbachia genomes from ovary and midgut metagenomes of single naturally infected Culex pipiens mosquitoes from Southern France to investigate patterns of intra- and inter-individual differences across mosquito organs. Our analyses revealed a remarkable degree of intra-individual conservancy among Wolbachia genomes from distinct organs of the same mosquito both at the level of gene presence-absence signal and single-nucleotide polymorphisms (SNPs). Yet, we identified several synonymous and non-synonymous substitutions between individuals, demonstrating the presence of some level of genomic heterogeneity among Wolbachia that infect the same C. pipiens field population. Overall, the absence of genetic heterogeneity within Wolbachia populations in a single individual confirms the presence of a dominant Wolbachia that is maintained under strong purifying forces of evolution.

16.
G3 (Bethesda) ; 2024 Jun 27.
Article in English | MEDLINE | ID: mdl-38934790

ABSTRACT

Reniform and root-knot nematode are two of the most destructive pests of conventional upland cotton, Gossypium hirsutum, L. and continue to be a major threat to cotton fiber production in semi-arid regions of the southern United States and Central America. Fortunately, naturally occurring tolerance to these nematodes has been identified in the Pima cotton species (G. barbadense) and several upland cotton varieties (G. hirsutum), which has led to a robust breeding program that has successfully introgressed and stacked these independent resistant traits into several upland cotton lineages with superior agronomic traits, e.g. BAR 32-30 and BARBREN-713. This work identifies the genomic variations of these nematode tolerant accessions by comparing their respective genomes to the susceptible, high-quality fiber producing parental line of this lineage: Phytogen 355 (PSC355). We discover several large genomic differences within marker regions that harbor putative resistance genes as well as expression mechanisms shared by the two resistant lines, with respect to the susceptible PSC355 parental line. This work emphasizes the utility of whole genome comparisons as a means of elucidating large and small nuclear differences by lineage and phenotype.  .

17.
Mob DNA ; 15(1): 13, 2024 Jun 26.
Article in English | MEDLINE | ID: mdl-38926873

ABSTRACT

BACKGROUND: Transposable Elements (TEs) are segments of DNA, typically a few hundred base pairs up to several tens of thousands bases long, that have the ability to generate new copies of themselves in the genome. Most existing methods used to identify TEs in a newly sequenced genome are based on their repetitive character, together with detection based on homology and structural features. As new high quality assemblies become more common, including the availability of multiple independent assemblies from the same species, an alternative strategy for identification of TE families becomes possible in which we focus on the polymorphism at insertion sites caused by TE mobility. RESULTS: We develop the idea of using the structural polymorphisms found in pangenomes to create a library of the TE families recently active in a species, or in a closely related group of species. We present a tool, pantera, that achieves this task, and illustrate its use both on species with well-curated libraries, and on new assemblies. CONCLUSIONS: Our results show that pantera is sensitive and accurate, tending to correctly identify complete elements with precise boundaries, and is particularly well suited to detect larger, low copy number TEs that are often undetected with existing de novo methods.

18.
J Fungi (Basel) ; 10(6)2024 May 30.
Article in English | MEDLINE | ID: mdl-38921378

ABSTRACT

Candida auris is an emerging multidrug-resistant and opportunistic pathogenic yeast. Whole-genome sequencing analysis has defined five major clades, each from a distinct geographic region. The current study aimed to examine the genome of the C. auris 20-1498 strain, which is the first isolate of this fungus identified in Mexico. Based on whole-genome sequencing, the draft genome was found to contain 70 contigs. It had a total genome size of 12.86 Mbp, an N50 value of 1.6 Mbp, and an average guanine-cytosine (GC) content of 45.5%. Genome annotation revealed a total of 5432 genes encoding 5515 proteins. According to the genomic analysis, the C. auris 20-1498 strain belongs to clade IV (containing strains endemic to South America). Of the two genes (ERG11 and FKS1) associated with drug resistance in C. auris, a mutation was detected in K143R, a gene located in a mutation hotspot of ERG11 (lanosterol 14-α-demethylase), an antifungal drug target. The focus on whole-genome sequencing and the identification of mutations linked to the drug resistance of fungi could lead to the discovery of new therapeutic targets and new antifungal compounds.

19.
mSystems ; : e0051624, 2024 Jun 27.
Article in English | MEDLINE | ID: mdl-38934546

ABSTRACT

Bacteroides fragilis is a Gram-negative commensal bacterium commonly found in the human colon, which differentiates into two genomospecies termed divisions I and II. Through a comprehensive collection of 694 B. fragilis whole genome sequences, we identify novel features distinguishing these divisions. Our study reveals a distinct geographic distribution with division I strains predominantly found in North America and division II strains in Asia. Additionally, division II strains are more frequently associated with bloodstream infections, suggesting a distinct pathogenic potential. We report differences between the two divisions in gene abundance related to metabolism, virulence, stress response, and colonization strategies. Notably, division II strains harbor more antimicrobial resistance (AMR) genes than division I strains. These findings offer new insights into the functional roles of division I and II strains, indicating specialized niches within the intestine and potential pathogenic roles in extraintestinal sites. IMPORTANCE: Understanding the distinct functions of microbial species in the gut microbiome is crucial for deciphering their impact on human health. Classifying division II strains as Bacteroides fragilis can lead to erroneous associations, as researchers may mistakenly attribute characteristics observed in division II strains to the more extensively studied division I B. fragilis. Our findings underscore the necessity of recognizing these divisions as separate species with distinct functions. We unveil new findings of differential gene prevalence between division I and II strains in genes associated with intestinal colonization and survival strategies, potentially influencing their role as gut commensals and their pathogenicity in extraintestinal sites. Despite the significant niche overlap and colonization patterns between these groups, our study highlights the complex dynamics that govern strain distribution and behavior, emphasizing the need for a nuanced understanding of these microorganisms.

20.
mSystems ; : e0014324, 2024 Jun 27.
Article in English | MEDLINE | ID: mdl-38934646

ABSTRACT

Staphylococcus aureus causes both hospital- and community-acquired infections in humans worldwide. Due to the high incidence of infection, S. aureus is also one of the most sampled and sequenced pathogens today, providing an outstanding resource to understand variation at the bacterial subspecies level. We processed and downsampled 83,383 public S. aureus Illumina whole-genome shotgun sequences and 1,263 complete genomes to produce 7,954 representative substrains. Pairwise comparison of average nucleotide identity revealed a natural boundary of 99.5% that could be used to define 145 distinct strains within the species. We found that intermediate frequency genes in the pangenome (present in 10%-95% of genomes) could be divided into those closely linked to strain background ("strain-concentrated") and those highly variable within strains ("strain-diffuse"). Non-core genes had different patterns of chromosome location. Notably, strain-diffuse genes were associated with prophages; strain-concentrated genes were associated with the vSaß genome island and rare genes (<10% frequency) concentrated near the origin of replication. Antibiotic resistance genes were enriched in the strain-diffuse class, while virulence genes were distributed between strain-diffuse, strain-concentrated, core, and rare classes. This study shows how different patterns of gene movement help create strains as distinct subspecies entities and provide insight into the diverse histories of important S. aureus functions. IMPORTANCE: We analyzed the genomic diversity of Staphylococcus aureus, a globally prevalent bacterial species that causes serious infections in humans. Our goal was to build a genetic picture of the different strains of S. aureus and which genes may be associated with them. We reprocessed >84,000 genomes and subsampled to remove redundancy. We found that individual samples sharing >99.5% of their genome could be grouped into strains. We also showed that a portion of genes that are present in intermediate frequency in the species are strongly associated with some strains but completely absent from others, suggesting a role in strain specificity. This work lays the foundation for understanding individual gene histories of the S. aureus species and also outlines strategies for processing large bacterial genomic data sets.

SELECTION OF CITATIONS
SEARCH DETAIL
...