Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 526
Filter
1.
BMC Microbiol ; 24(1): 326, 2024 Sep 06.
Article in English | MEDLINE | ID: mdl-39243017

ABSTRACT

BACKGROUND: ​​The genus Fusarium poses significant threats to food security and safety worldwide because numerous species of the fungus cause destructive diseases and/or mycotoxin contamination in crops. The adverse effects of climate change are exacerbating some existing threats and causing new problems. These challenges highlight the need for innovative solutions, including the development of advanced tools to identify targets for control strategies. DESCRIPTION: In response to these challenges, we developed the Fusarium Protein Toolkit (FPT), a web-based tool that allows users to interrogate the structural and variant landscape within the Fusarium pan-genome. The tool displays both AlphaFold and ESMFold-generated protein structure models from six Fusarium species. The structures are accessible through a user-friendly web portal and facilitate comparative analysis, functional annotation inference, and identification of related protein structures. Using a protein language model, FPT predicts the impact of over 270 million coding variants in two of the most agriculturally important species, Fusarium graminearum and F. verticillioides. To facilitate the assessment of naturally occurring genetic variation, FPT provides variant effect scores for proteins in a Fusarium pan-genome based on 22 diverse species. The scores indicate potential functional consequences of amino acid substitutions and are displayed as intuitive heatmaps using the PanEffect framework. CONCLUSION: FPT fills a knowledge gap by providing previously unavailable tools to assess structural and missense variation in proteins produced by Fusarium. FPT has the potential to deepen our understanding of pathogenic mechanisms in Fusarium, and aid the identification of genetic targets for control strategies that reduce crop diseases and mycotoxin contamination. Such targets are vital to solving the agricultural problems incited by Fusarium, particularly evolving threats resulting from climate change. Thus, FPT has the potential to contribute to improving food security and safety worldwide.


Subject(s)
Fungal Proteins , Fusarium , Internet , Fusarium/genetics , Fusarium/metabolism , Fusarium/classification , Fungal Proteins/genetics , Fungal Proteins/chemistry , Fungal Proteins/metabolism , Genome, Fungal/genetics , Genetic Variation , Models, Molecular , Software , Protein Conformation
2.
PeerJ ; 12: e17964, 2024.
Article in English | MEDLINE | ID: mdl-39221271

ABSTRACT

Background: Next-generation sequencing of Mycobacterium tuberculosis, the infectious agent causing tuberculosis, is improving the understanding of genomic diversity of circulating lineages and strain-types, and informing knowledge of drug resistance mutations. An increasingly popular approach to characterizing M. tuberculosis genomes (size: 4.4 Mbp) and variants (e.g., single nucleotide polymorphisms (SNPs)) involves the de novo assembly of sequence data. Methods: We compared the performance of genome assembly tools (Unicycler, RagOut, and RagTag) on sequence data from nine drug resistant M. tuberculosis isolates (multi-drug (MDR) n = 1; pre-extensively-drug (pre-XDR) n = 8) generated using Illumina HiSeq, Oxford Nanopore Technology (ONT) PromethION, and PacBio platforms. Results: Our investigation found that Unicycler-based assemblies had significantly higher genome completeness (~98.7%; p values = 0.01) compared to other assembler tools (RagOut = 98.6%, and RagTag = 98.6%). The genome assembly sizes (bp) across isolates and sequencers based on RagOut was significantly longer (p values < 0.001) (4,418,574 ± 8,824 bp) than Unicycler and RagTag assemblies (Unicycler = 4,377,642 ± 55,257 bp, and RagTag = 4,380,711 ± 51,164 bp). RagOut-based assemblies had the fewest contigs (~32) and the longest genome size (4,418,574 bp; vs. H37Rv reference size 4,411,532 bp) and therefore were chosen for downstream analysis. Pan-genome analysis of Illumina and PacBio hybrid assemblies revealed the greatest number of detected genes (4,639 genes; H37Rv reference contains 3,976 genes), while Illumina and ONT hybrid assemblies produced the highest number of SNPs. The number of genes from hybrid assemblies with ONT and PacBio long-reads (mean: 4,620 genes) was greater than short-read assembly alone (4,478 genes). All nine RagOut hybrid genome assemblies detected known mutations in genes associated with MDR-TB and pre-XDR-TB. Conclusions: Unicycler software performed the best in terms of achieving contiguous genomes, whereas RagOut improved the quality of Unicycler's genome assemblies by providing a longer genome size. Overall, our approach has demonstrated that short-read and long-read hybrid assembly can provide a more complete genome assembly than short-read assembly alone by detecting pan-genomes and more genes, including IS6110, and SNPs.


Subject(s)
Genome, Bacterial , High-Throughput Nucleotide Sequencing , Mycobacterium tuberculosis , Mycobacterium tuberculosis/genetics , Genome, Bacterial/genetics , High-Throughput Nucleotide Sequencing/methods , Humans , Polymorphism, Single Nucleotide/genetics , Sequence Analysis, DNA/methods
3.
Heliyon ; 10(15): e35513, 2024 Aug 15.
Article in English | MEDLINE | ID: mdl-39170454

ABSTRACT

Conventional breeding approaches have played a significant role in meeting the food demand remarkably well until now. However, the increasing population, yield plateaus in certain crops, and limited recombination necessitate using genomic resources for genomics-assisted crop improvement programs. As a result of advancements in the next-generation sequence technology, GABs have developed dramatically to characterize allelic variants and facilitate their rapid and efficient incorporation in crop improvement programs. Genomics-assisted breeding (GAB) has played an important role in harnessing the potential of modern genomic tools, exploiting allelic variation from genetic resources and developing cultivars over the past decade. The availability of pangenomes for major crops has been a significant development, albeit with varying degrees of completeness. Even though adopting these technologies is essentially determined on economic grounds and cost-effective assays, which create a wealth of information that can be successfully used to exploit the latent potential of crops. GAB has been instrumental in harnessing the potential of modern genomic resources and exploiting allelic variation for genetic enhancement and cultivar development. GAB strategies will be indispensable for designing future crops and are expected to play a crucial role in breeding climate-smart crop cultivars with higher nutritional value.

4.
Plant Biotechnol J ; 2024 Aug 02.
Article in English | MEDLINE | ID: mdl-39095952

ABSTRACT

Structural variations (SVs) are major genetic variants that can be involved in the origin, adaptation and domestication of species. However, the identification and characterization of SVs in Spinacia species are rare due to the lack of a pan-genome. Here, we report eight chromosome-scale assemblies of cultivated spinach and its two wild species. After integration with five existing assemblies, we constructed a comprehensive Spinacia pan-genome and identified 193 661 pan-SVs, which were genotyped in 452 Spinacia accessions. Our pan-SVs enabled genome-wide association study identified signals associated with sex and clarified the evolutionary direction of spinach. Most sex-linked SVs (86%) were biased to occur on the Y chromosome during the evolution of the sex-linked region, resulting in reduced Y-linked gene expression. The frequency of pan-SVs among Spinacia accessions further illustrated the contribution of these SVs to domestication, such as bolting time and seed dormancy. Furthermore, compared with SNPs, pan-SVs act as efficient variants in genomic selection (GS) because of their ability to capture missing heritability information and higher prediction accuracy. Overall, this study provides a valuable resource for spinach genomics and highlights the potential utility of pan-SV in crop improvement and breeding programmes.

5.
Genome Biol Evol ; 16(8)2024 Aug 05.
Article in English | MEDLINE | ID: mdl-39101619

ABSTRACT

The plant Arabidopsis thaliana is a model system used by researchers through much of plant research. Recent efforts have focused on discovering the genomic variation found in naturally occurring ecotypes isolated from around the world. These ecotypes have come from diverse climates and therefore have faced and adapted to a variety of abiotic and biotic stressors. The sequencing and comparative analysis of these genomes can offer insight into the adaptive strategies of plants. While there are a large number of ecotype genome sequences available, the majority were created using short-read technology. Mapping of short-reads containing structural variation to a reference genome bereft of that variation leads to incorrect mapping of those reads, resulting in a loss of genetic information and introduction of false heterozygosity. For this reason, long-read de novo sequencing of genomes is required to resolve structural variation events. In this article, we sequenced the genomes of eight natural variants of A. thaliana using nanopore sequencing. This resulted in highly contiguous assemblies with >95% of the genome contained within five contigs. The sequencing results from this study include five ecotypes from relict and African populations, an area of untapped genetic diversity. With this study, we increase the knowledge of diversity we have across A. thaliana ecotypes and contribute to ongoing production of an A. thaliana pan-genome.


Subject(s)
Arabidopsis , Ecotype , Genome, Plant , Arabidopsis/genetics , Chromosomes, Plant/genetics , Molecular Sequence Annotation , Genetic Variation
6.
J Fungi (Basel) ; 10(8)2024 Aug 14.
Article in English | MEDLINE | ID: mdl-39194902

ABSTRACT

The "cost of domestication" hypothesis suggests that the domestication of wild species increases the number, frequency, and/or proportion of deleterious genetic variants, potentially reducing their fitness in the wild. While extensively studied in domesticated species, this phenomenon remains understudied in fungi. Here, we used Saccharomyces cerevisiae, the world's oldest domesticated fungus, as a model to investigate the genomic characteristics of deleterious variants arising from fungal domestication. Employing a graph-based pan-genome approach, we identified 1,297,761 single nucleotide polymorphisms (SNPs), 278,147 insertion/deletion events (indels; <30 bp), and 19,967 non-redundant structural variants (SVs; ≥30 bp) across 687 S. cerevisiae isolates. Comparing these variants with synonymous SNPs (sSNPs) as neutral controls, we found that the majority of the derived nonsynonymous SNPs (nSNPs), indels, and SVs were deleterious. Heterozygosity was positively correlated with the impact of deleterious SNPs, suggesting a role of genetic diversity in mitigating their effects. The domesticated isolates exhibited a higher additive burden of deleterious SNPs (dSNPs) than the wild isolates, but a lower burden of indels and SVs. Moreover, the domesticated S. cerevisiae showed reduced rates of adaptive evolution relative to the wild S. cerevisiae. In summary, deleterious variants tend to be heterozygous, which may mitigate their harmful effects, but they also constrain breeding potential. Addressing deleterious alleles and minimizing the genetic load are crucial considerations for future S. cerevisiae breeding efforts.

7.
J Bioinform Comput Biol ; : 2450019, 2024 Aug 31.
Article in English | MEDLINE | ID: mdl-39215522

ABSTRACT

The graph of sequences represents the genetic variations of pan-genome concisely and space-efficiently than multiple linear reference genome. In order to accelerate aligning reads to the graph, an index of graph-based reference genomes is used to obtain candidate locations. However, the potential combinatorial explosion of nodes on the sequence graph leads to increasing the index space and maximum memory usage of alignment process considerably, especially for large-scale datasets. For this, existing methods typically attempt to prune complex regions, or extend the length of seeds, which sacrifices the recall of alignment algorithm despite reducing space usage slightly. We present the Sparse-index of Graph (SIG) and alignment algorithm SIG-Aligner, capable of indexing and aligning at the lower memory cost. SIG builds the non-overlapping minimizers index inside nodes of sequence graph and SIG-Aligner filters out most of the false positive matches by the method based on the pigeonhole principle. Compared to Giraffe, the results of computational experiments show that SIG achieves a significant reduction in index memory space ranging from 50% to 75% for the human pan-genome graphs, while still preserving superior or comparable accuracy of alignment and the faster alignment time.

8.
Int J Mol Sci ; 25(16)2024 Aug 14.
Article in English | MEDLINE | ID: mdl-39201547

ABSTRACT

Lactiplantibacillus (Lb.) plantarum is known as a benign bacterium found in various habitats, including the intestines of animals and fermented foods. Since animal intestines lack oxygen, while fermented foods provide a limited or more oxygen environment, this study aimed to investigate whether there were genetic differences in the growth of Lb. plantarum under aerobic vs. anaerobic conditions. Genomic analysis of Lb. plantarum obtained from five sources-animals, dairy products, fermented meat, fermented vegetables, and humans-was conducted. The analysis included not only an examination of oxygen-utilizing genes but also a comparative pan-genomic analysis to investigate evolutionary relationships between genomes. The ancestral gene analysis of the evolutionary pathway classified Lb. plantarum into groups A and B, with group A further subdivided into A1 and A2. It was confirmed that group A1 does not possess the narGHIJ operon, which is necessary for energy production under limited oxygen conditions. Additionally, it was found that group A1 has experienced more gene acquisition and loss compared to groups A2 and B. Despite an initial assumption that there would be genetic distinctions based on the origin (aerobic or anaerobic conditions), it was observed that such differentiation could not be attributed to the origin. However, the evolutionary process indicated that the loss of genes related to nitrate metabolism was essential in anaerobic or limited oxygen conditions, contrary to the initial hypothesis.


Subject(s)
Evolution, Molecular , Oxygen , Oxygen/metabolism , Phylogeny , Lactobacillus plantarum/genetics , Lactobacillus plantarum/metabolism , Genome, Bacterial , Anaerobiosis , Animals , Humans
9.
bioRxiv ; 2024 Jul 10.
Article in English | MEDLINE | ID: mdl-39026826

ABSTRACT

Aspergillus flavus is a clinically and agriculturally important saprotrophic fungus responsible for severe human infections and extensive crop losses. We analyzed genomic data from 250 (95 clinical and 155 environmental) A. flavus isolates from 9 countries, including 70 newly sequenced clinical isolates, to examine population and pan-genome structure and their relationship to pathogenicity. We identified five A. flavus populations, including a new population, D, corresponding to distinct clades in the genome-wide phylogeny. Strikingly, > 75% of clinical isolates were from population D. Accessory genes, including genes within biosynthetic gene clusters, were significantly more common in some populations but rare in others. Population D was enriched for genes associated with zinc ion binding, lipid metabolism, and certain types of hydrolase activity. In contrast to the major human pathogen Aspergillus fumigatus, A. flavus pathogenicity in humans is strongly associated with population structure, making it a great system for investigating how population-specific genes contribute to pathogenicity.

10.
BMC Genomics ; 25(1): 691, 2024 Jul 14.
Article in English | MEDLINE | ID: mdl-39004696

ABSTRACT

BACKGROUND: Muskoxen are important ecosystem components and provide food, economic opportunities, and cultural well-being for Indigenous communities in the Canadian Arctic. Between 2010 and 2021, Erysipelothrix rhusiopathiae was isolated from carcasses of muskoxen, caribou, a seal, and an Arctic fox during multiple large scale mortality events in the Canadian Arctic Archipelago. A single strain ('Arctic clone') of E. rhusiopathiae was associated with the mortalities on Banks, Victoria and Prince Patrick Islands, Northwest Territories and Nunavut, Canada (2010-2017). The objectives of this study were to (i) characterize the genomes of E. rhusiopathiae isolates obtained from more recent muskox mortalities in the Canadian Arctic in 2019 and 2021; (ii) identify and compare common virulence traits associated with the core genome and mobile genetic elements (i.e. pathogenicity islands and prophages) among Arctic clone versus other E. rhusiopathiae genomes; and iii) use pan-genome wide association studies (GWAS) to determine unique genetic contents of the Arctic clone that may encode virulence traits and that could be used for diagnostic purposes. RESULTS: Phylogenetic analyses revealed that the newly sequenced E. rhusiopathiae isolates from Ellesmere Island, Nunavut (2021) also belong to the Arctic clone. Of 17 virulence genes analysed among 28 Arctic clone isolates, four genes - adhesin, rhusiopathiae surface protein-A (rspA), choline binding protein-B (cbpB) and CDP-glycerol glycerophosphotransferase (tagF) - had amino acid sequence variants unique to this clone when compared to 31 other E. rhusiopathiae genomes. These genes encode proteins that facilitate E. rhusiopathiae to attach to the host endothelial cells and form biofilms. GWAS analyses using Scoary found several unique genes to be overrepresented in the Arctic clone. CONCLUSIONS: The Arctic clone of E. rhusiopathiae was associated with multiple muskox mortalities spanning over a decade and multiple Arctic islands with distances over 1000 km, highlighting the extent of its spatiotemporal spread. This clone possesses unique gene content, as well as amino acid variants in multiple virulence genes that are distinct from the other closely related E. rhusiopathiae isolates. This study establishes an essential foundation on which to investigate whether these differences are correlated with the apparent virulence of this specific clone through in vitro and in vivo studies.


Subject(s)
Erysipelothrix , Arctic Regions , Erysipelothrix/genetics , Erysipelothrix/pathogenicity , Erysipelothrix/isolation & purification , Canada , Animals , Virulence/genetics , Genomics , Genome, Bacterial , Phylogeny , Erysipelothrix Infections/microbiology , Virulence Factors/genetics , Genome-Wide Association Study , Genomic Islands
11.
Genome Biol Evol ; 16(7)2024 Jul 03.
Article in English | MEDLINE | ID: mdl-39007295

ABSTRACT

This study delves into the genomic features of 10 Vibrio strains collected from deep-sea hydrothermal vents in the Pacific Ocean, providing insights into their evolutionary history and ecological adaptations. Through sequencing and pan-genome analysis involving 141 Vibrio species, we found that deep-sea strains exhibit larger genomes with unique gene distributions, suggesting adaptation to the vent environment. The phylogenomic reconstruction of the investigated isolates revealed the presence of 2 main clades: The first is monophyletic, consisting exclusively of Vibrio alginolyticus, while the second forms a monophyletic clade comprising both Vibrio antiquarius and Vibrio diabolicus species, which were previously isolated from deep-sea vents. All strains carry virulence and antibiotic resistance genes related to those found in human pathogenic Vibrio species which may play a wider ecological role other than host infection in these environments. In addition, functional genomic analysis identified genes potentially related to deep-sea survival and stress response, alongside candidate genes encoding for novel antimicrobial agents. Ultimately, the pan-genome we generated represents a valuable resource for future studies investigating the taxonomy, evolution, and ecology of Vibrio species.


Subject(s)
Genome, Bacterial , Hydrothermal Vents , Phylogeny , Vibrio , Vibrio/genetics , Hydrothermal Vents/microbiology , Evolution, Molecular , Adaptation, Physiological/genetics , Pacific Ocean
12.
Int J Mol Sci ; 25(13)2024 Jun 28.
Article in English | MEDLINE | ID: mdl-39000250

ABSTRACT

Beef is a major global source of protein, playing an essential role in the human diet. The worldwide production and consumption of beef continue to rise, reflecting a significant trend. However, despite the critical importance of beef cattle resources in agriculture, the diversity of cattle breeds faces severe challenges, with many breeds at risk of extinction. The initiation of the Beef Cattle Genome Project is crucial. By constructing a high-precision functional annotation map of their genome, it becomes possible to analyze the genetic mechanisms underlying important traits in beef cattle, laying a solid foundation for breeding more efficient and productive cattle breeds. This review details advances in genome sequencing and assembly technologies, iterative upgrades of the beef cattle reference genome, and its application in pan-genome research. Additionally, it summarizes relevant studies on the discovery of functional genes associated with key traits in beef cattle, such as growth, meat quality, reproduction, polled traits, disease resistance, and environmental adaptability. Finally, the review explores the potential of telomere-to-telomere (T2T) genome assembly, structural variations (SVs), and multi-omics techniques in future beef cattle genetic breeding. These advancements collectively offer promising avenues for enhancing beef cattle breeding and improving genetic traits.


Subject(s)
Genome , Animals , Cattle/genetics , Genomics/methods , Breeding/methods , Whole Genome Sequencing/methods , Red Meat , Quantitative Trait Loci
13.
Microorganisms ; 12(7)2024 Jul 12.
Article in English | MEDLINE | ID: mdl-39065181

ABSTRACT

A Gram-negative, aerobic, rod-shaped, non-motile, yellow-pigmented bacterium, KMM 9835T, was isolated from the sediment sample obtained from the Amur Bay of the Sea of Japan seashore, Russia. Phylogenetic analyses based on the 16S rRNA gene and whole genome sequences positioned the novel strain KMM 9835T in the genus Mariniflexile as a separate line sharing the highest 16S rRNA gene sequence similarities of 96.6% and 96.2% with Mariniflexile soesokkakense RSSK-9T and Mariniflexile fucanivorans SW5T, respectively, and similarity values of <96% to other recognized Mariniflexile species. The average nucleotide identity and digital DNA-DNA hybridization values between strain KMM 9835T and M. soesokkakense KCTC 32427T, Mariniflexile gromovii KCTC 12570T, M. fucanivorans DSM 18792T, and M. maritimum M5A1MT were 83.0%, 82.5%, 83.4%, and 78.3% and 30.7%, 29.6%, 29.5%, and 24.4%, respectively. The genomic DNA GC content of strain KMM 9835T was 32.5 mol%. The dominant menaquinone was MK-6, and the major fatty acids were iso-C15:0, iso-C15:1ω10c, and C15:0. The polar lipids of strain KMM 9835T consisted of phosphatidylethanolamine, two unidentified aminolipids, an unidentified phospholipid, and six unidentified lipids. A pan-genome analysis showed that the KMM 9835T genome encoded 753 singletons. The annotated singletons were more often related to transport protein systems (SusC), transcriptional regulators (AraC, LytTR, LacI), and enzymes (glycosylases). The KMM 9835T genome was highly enriched in CAZyme-encoding genes, the proportion of which reached 7.3%. Moreover, the KMM 9835T genome was characterized by a high abundance of CAZyme gene families (GH43, GH28, PL1, PL10, CE8, and CE12), indicating its potential to catabolize pectin. This may represent part of an adaptation strategy facilitating microbial consumption of plant polymeric substrates in aquatic environments near shorelines and freshwater sources. Based on the combination of phylogenetic and phenotypic characterization, the marine sediment strain KMM 9835T (=KCTC 92792T) represents a novel species of the genus Mariniflexile, for which the name Mariniflexile litorale sp. nov. is proposed.

14.
Res Sq ; 2024 Jun 11.
Article in English | MEDLINE | ID: mdl-38947078

ABSTRACT

Background: The Borreliaceae family includes many obligate parasitic bacterial species which are etiologically associated with a myriad of zoonotic borrelioses including Lyme disease and vector-borne relapsing fevers. Infections by the Borreliaceae are difficult to detect by both direct and indirect methods, often leading to delayed and missed diagnoses. Efforts to improve diagnoses center around the development of molecular diagnostics (MDx), but due to deep tissue sequestration of the causative spirochaetes and the lack of persistent bacteremias, even MDx assays suffer from a lack of sensitivity. Additionally, the highly extensive genomic heterogeneity among isolates, even within the same species, contributes to the lack of assay sensitivity as single target assays cannot provide universal coverage. This within-species heterogeneity is partly due to differences in replicon repertoires and genomic structures that have likely arisen to support the complex Borreliaceae lifecycle in which these parasites have to survive in multiple hosts each with unique immune responses. Results: We constructed a Borreliaceae family-level pangenome and characterized the phylogenetic relationships among the constituent taxa which supports the recent taxonomy of splitting the family into at least two genera. Gene content pro les were created for the majority of the Borreliaceae replicons, providing for the first time their unambiguous molecular typing. Conclusion: Our characterization of the Borreliaceae pan-genome supports the splitting of the former Borrelia genus into two genera and provides for the phylogenetic placement of several non-species designated isolates. Mining this family-level pangenome will enable precision diagnostics corresponding to gene content-driven clinical outcomes while also providing targets for interventions.

15.
Arch Microbiol ; 206(8): 342, 2024 Jul 05.
Article in English | MEDLINE | ID: mdl-38967823

ABSTRACT

A novel mangrove soil-derived actinomycete, strain S2-29T, was found to be most closely related to Saccharopolyspora karakumensis 5K548T based on 16 S rRNA sequence (99.24% similarity) and genomic phylogenetic analyses. However, significant divergence in digital DNA-DNA hybridization, average nucleotide identity, and unique biosynthetic gene cluster possession distinguished S2-29T as a distinct Saccharopolyspora species. Pan genome evaluation revealed exceptional genomic flexibility in genus Saccharopolyspora, with > 95% accessory genome content. Strain S2-29T harbored 718 unique genes, largely implicated in energetic metabolisms, indicating different metabolic capacities from its close relatives. Several uncharacterized biosynthetic gene clusters in strain S2-29T highlighted the strain's untapped capacity to produce novel functional compounds with potential biotechnological applications. Designation as novel species Saccharopolyspora mangrovi sp. nov. (type strain S2-29T = JCM 34,548T = CGMCC 4.7716T) was warranted, expanding the known Saccharopolyspora diversity and ecology. The discovery of this mangrove-adapted strain advances understanding of the genus while highlighting an untapped source of chemical diversity.


Subject(s)
DNA, Bacterial , Genome, Bacterial , Phylogeny , RNA, Ribosomal, 16S , Saccharopolyspora , Soil Microbiology , Saccharopolyspora/genetics , Saccharopolyspora/metabolism , Saccharopolyspora/classification , RNA, Ribosomal, 16S/genetics , DNA, Bacterial/genetics , Multigene Family , Genomics , Sequence Analysis, DNA , Wetlands , Nucleic Acid Hybridization , Bacterial Typing Techniques
16.
BMC Genomics ; 25(1): 723, 2024 Jul 25.
Article in English | MEDLINE | ID: mdl-39054411

ABSTRACT

BACKGROUND: The genus Geobacillus and its associated taxa have been the focal point of numerous thermophilic biotechnological investigations, both at the whole cell and enzyme level. By contrast, comparatively little research has been done on its recently delineated sister genus, Parageobacillus. Here we performed pan-genomic analyses on a subset of publicly available Parageobacillus and Saccharococcus genomes to elucidate their biotechnological potential. RESULTS: Phylogenomic analysis delineated the compared taxa into two distinct genera, Parageobacillus and Saccharococcus, with P. caldoxylosilyticus isolates clustering with S. thermophilus in the latter genus. Both genera present open pan-genomes, with the species P. toebii being characterized with the highest novel gene accrual. Diversification of the two genera is driven through the variable presence of plasmids, bacteriophages and transposable elements. Both genera present a range of potentially biotechnologically relevant features, including a source of novel antimicrobials, thermostable enzymes including DNA-active enzymes, carbohydrate active enzymes, proteases, lipases and carboxylesterases. Furthermore, they present a number of metabolic pathways pertinent to degradation of complex hydrocarbons and xenobiotics and for green energy production. CONCLUSIONS: Comparative genomic analyses of Parageobacillus and Saccharococcus suggest that taxa in both of these genera can serve as a rich source of biotechnologically and industrially relevant secondary metabolites, thermostable enzymes and metabolic pathways that warrant further investigation.


Subject(s)
Bacillaceae , Genome, Bacterial , Genomics , Phylogeny , Genomics/methods , Bacillaceae/genetics , Bacillaceae/classification , Biotechnology
17.
Biologicals ; 87: 101782, 2024 Aug.
Article in English | MEDLINE | ID: mdl-39003966

ABSTRACT

Human cytomegalovirus (HCMV) is accountable for high morbidity in neonates and immunosuppressed individuals. Due to the high genetic variability of HCMV, current prophylactic measures are insufficient. In this study, we employed a pan-genome and reverse vaccinology approach to screen the target for efficient vaccine candidates. Four proteins, envelope glycoprotein M, UL41A, US23, and US28, were shortlisted based on cellular localization, high solubility, antigenicity, and immunogenicity. A total of 29 B-cell and 44 T-cell highly immunogenic and antigenic epitopes with high global population coverage were finalized using immunoinformatics tools and algorithms. Further, the epitopes that were overlapping among the finalized B-cell and T-cell epitopes were linked with suitable linkers to form various combinations of multi-epitopic vaccine constructs. Among 16 vaccine constructs, Vc12 was selected based on physicochemical and structural properties. The docking and molecular simulations of VC12 were performed, which showed its high binding affinity (-23.35 kcal/mol) towards TLR4 due to intermolecular hydrogen bonds, salt bridges, and hydrophobic interactions, and there were only minimal fluctuations. Furthermore, Vc12 eliciting a good response was checked for its expression in Escherichia coli through in silico cloning and codon optimization, suggesting it to be a potent vaccine candidate.


Subject(s)
Cytomegalovirus , Epitopes, T-Lymphocyte , Humans , Cytomegalovirus/immunology , Cytomegalovirus/genetics , Epitopes, T-Lymphocyte/immunology , Epitopes, T-Lymphocyte/genetics , Epitopes, T-Lymphocyte/chemistry , Cytomegalovirus Vaccines/immunology , Cytomegalovirus Vaccines/genetics , Epitopes, B-Lymphocyte/immunology , Epitopes, B-Lymphocyte/genetics , Epitopes, B-Lymphocyte/chemistry , Vaccinology/methods , Genome, Viral , Cytomegalovirus Infections/prevention & control , Cytomegalovirus Infections/immunology , Molecular Docking Simulation
18.
Natl Sci Rev ; 11(6): nwae188, 2024 Jun.
Article in English | MEDLINE | ID: mdl-38962716

ABSTRACT

Transposable elements (TEs) are ubiquitous genomic components and hard to study due to being highly repetitive. Here we assembled 232 chromosome-level genomes based on long-read sequencing data. Coupling the 232 genomes with 15 existing assemblies, we developed a pan-TE map comprising both cultivated and wild Asian rice. We detected 177 084 high-quality TE variations and inferred their derived state using outgroups. We found TEs were one source of phenotypic variation during rice domestication and differentiation. We identified 1246 genes whose expression variation was associated with TEs but not single-nucleotide polymorphisms (SNPs), such as OsRbohB, and validated OsRbohB's relative expression activity using a dual-Luciferase (LUC) reporter assays system. Our pan-TE map allowed us to detect multiple novel loci associated with agronomic traits. Collectively, our findings highlight the contributions of TEs to domestication, differentiation and agronomic traits in rice, and there is massive potential for gene cloning and molecular breeding by the high-quality Asian pan-TE map we generated.

19.
Article in English | MEDLINE | ID: mdl-38995188

ABSTRACT

A Gram-negative, ellipsoidal to short-rod-shaped, motile bacterium was isolated from Beijing's urban air. The isolate exhibited the closest kinship with Noviherbaspirillum aerium 122213-3T, exhibiting 98.4 % 16S rRNA gene sequence similarity. Phylogenetic analyses based on 16S rRNA gene sequences and genomes showed that it clustered closely with N. aerium 122213-3T, thus forming a distinct phylogenetic lineage within the genus Noviherbaspirillum. The average nucleotide identity and digital DNA-DNA hybridization values between strain I16B-00201T and N. aerium 122213-3T were 84.6 and 29.4 %, respectively. The respiratory ubiquinone was ubiquinone 8. The major fatty acids (>10 %) were summed feature 3 (C16:1ω6c/C16:1ω7c, 43.3 %), summed feature 8 (C18:1ω7c/C18:1ω6c, 15.9 %) and C12:0 (11.0 %). The polyamine profile showed putrescine as the predominant compound. The polar lipid profile consisted of diphosphatidylglycerol, phosphatidylglycerol, phosphatidylethanolamine, phosphatidylcholine, unknown lipids and unknown phosphatidylaminolipids. The phenotypic, phylogenetic and chemotaxonomic results consistently supported that strain I16B-00201T represented a novel species of the genus Noviherbaspirillum, for which the name Noviherbaspirillum album sp. nov. is proposed, with I16B-00201T (=CPCC 100848T=KCTC 52095T) designated as the type strain. Its DNA G+C content is 59.4 mol%. Pan-genome analysis indicated that some Noviherbaspirillum species possess diverse nitrogen and aromatic compound metabolism pathways, suggesting their potential value in pollutant treatment.


Subject(s)
Air Microbiology , Bacterial Typing Techniques , Base Composition , DNA, Bacterial , Fatty Acids , Nucleic Acid Hybridization , Phospholipids , Phylogeny , RNA, Ribosomal, 16S , Sequence Analysis, DNA , Ubiquinone , RNA, Ribosomal, 16S/genetics , Beijing , DNA, Bacterial/genetics , Fatty Acids/analysis , Phospholipids/analysis
20.
J Fungi (Basel) ; 10(6)2024 May 30.
Article in English | MEDLINE | ID: mdl-38921378

ABSTRACT

Candida auris is an emerging multidrug-resistant and opportunistic pathogenic yeast. Whole-genome sequencing analysis has defined five major clades, each from a distinct geographic region. The current study aimed to examine the genome of the C. auris 20-1498 strain, which is the first isolate of this fungus identified in Mexico. Based on whole-genome sequencing, the draft genome was found to contain 70 contigs. It had a total genome size of 12.86 Mbp, an N50 value of 1.6 Mbp, and an average guanine-cytosine (GC) content of 45.5%. Genome annotation revealed a total of 5432 genes encoding 5515 proteins. According to the genomic analysis, the C. auris 20-1498 strain belongs to clade IV (containing strains endemic to South America). Of the two genes (ERG11 and FKS1) associated with drug resistance in C. auris, a mutation was detected in K143R, a gene located in a mutation hotspot of ERG11 (lanosterol 14-α-demethylase), an antifungal drug target. The focus on whole-genome sequencing and the identification of mutations linked to the drug resistance of fungi could lead to the discovery of new therapeutic targets and new antifungal compounds.

SELECTION OF CITATIONS
SEARCH DETAIL