RESUMO
The majority of pharmaceuticals are derived from natural products, bioactive compounds naturally synthesized by organisms to provide evolutionary advantages. Although the rich evolutionary history of eukaryotic algal species implicates a high potential for natural product-based drug discovery, it remains largely untouched. This study investigates 2762 putative biosynthetic gene clusters (BGCs) from 212 eukaryotic algal genomes. To analyze a vast set of structurally diverse BGCs, we employed comparative analysis based on the vectorization of biosynthetic domains, referred to as biosynthetic domain architecture (BDA). By characterizing core biosynthetic machineries through BDA, we identified key BDAs of modular BGCs in diverse eukaryotes and introduced 16 candidate modular BGCs with similar BDAs to previously validated BGCs. This study provides a global characterization of eukaryotic algal BGCs, offering an alternative to laborious manual curation for BGC prioritization.
Assuntos
Eucariotos , Família Multigênica , Eucariotos/genética , Vias Biossintéticas/genéticaRESUMO
Despite a continuous increase in algal genome sequencing, structural annotations of most algal genome assemblies remain unavailable. This pervasive scarcity of genome annotation has restricted rigorous investigation of these genomic resources and may have precipitated misleading biological interpretations. However, the annotation process for eukaryotic algal species is often challenging as genomic resources and transcriptomic evidence are not always available. To address this challenge, we benchmark the cutting-edge gene prediction methods that can be generalized for a broad range of non-model eukaryotes. Using the most accurate methods selected based on high-quality algal genomes, we predict structural annotations for 135 unannotated algal genomes. Using previously available genomic data pooled together with new data obtained in this study, we identified the core orthologous genes and the multi-gene phylogeny of eukaryotic algae, including of previously unexplored algal species. This study not only provides a benchmark for the use of structural annotation methods on a variety of non-model eukaryotes, but also compensates for missing data in the current spectrum of algal genomic resources. These results bring us one step closer to the full potential of eukaryotic algal genomics.
Assuntos
Eucariotos , Células Eucarióticas , Eucariotos/genética , Genômica/métodos , Genoma , Anotação de Sequência MolecularRESUMO
Species of Nannochloropsis are single-celled Stramenopiles commonly used in microalgae-based technologies for the manufacturing of bioproducts. Nannochloropsis oceanica QH25 was isolated from an algal cultivation pond located in Imperial, Texas (USA). We used PacBio continuous long read (CLR) sequencing to produce a highly contiguous 29.34 Mb genome.
RESUMO
High-quality genome sequences were generated for the nonaxenic marine microalga Nannochloris desiccata UTEX 2437 and eight of its associated environmental bacterial species. N. desiccata UTEX 2437 is diploid, and its 20.738-Mbp nuclear genome sequence is assembled in 29 contigs.
RESUMO
Microalgae are diverse, with many gaps remaining in phylogenetic and physiological understanding. Thus, studying new microalgae species increases our broader comprehension of biological diversity, and evaluation of new candidates as algal production platforms can lead to improved productivity under a variety of cultivation conditions. Chlorella is a genus of fast-growing species often isolated from freshwater habitats and cultivated as a source of nutritional supplements. However, the use of freshwater increases competition with other freshwater needs. We identified Chlorella desiccata to be worthy of further investigation as a potential algae production strain, due to its isolation from a marine environment and its promising growth and biochemical composition properties. Long-read genomic sequencing was conducted for C. desiccata UTEX 2526, resulting in a high-quality, near chromosome level, diploid genome with an assembly length of 21.55 Mbp in only 18 contigs. We also report complete circular mitochondrial and chloroplast genomes. Phylogenomic and phylogenetic analyses using nuclear, chloroplast, 18S rRNA, and actin sequences revealed that this species clades within strains currently identified as Nannochloris (Trebouxiophyceae, Chlorophyta), leading to its reclassification as Nannochloris sp. "desiccata" UTEX 2526. The mode of cell division for this species is autosporulation, differing from the type species N. bacillaris. As has occurred across multiple microalgae genera, there are repeated examples of Nannochloris species reclassification in the literature. This high-quality genome assembly and phylogenetic analysis of the potential algal production strain Nannochloris sp. "desiccata" UTEX 2526 provides an important reference and useful tool for further studying this region of the phylogenetic tree.
Assuntos
Chlorella , Genoma de Cloroplastos , Microalgas , Chlorella/genética , Microalgas/genética , Filogenia , RNA Ribossômico 18S/genéticaRESUMO
Bushmeat, the meat and organs derived from wildlife species, is a common source of animal protein in the diets of those living in sub-Saharan Africa and is frequently associated with zoonotic spillover of dangerous pathogens. Given the frequent consumption of bushmeat in this region and the lack of knowledge about the microbial communities associated with this meat, the microbiome of 56 fresh and processed bushmeat samples ascertained from three districts in the Western Serengeti ecosystem in Tanzania was characterized using 16S rRNA metagenomic sequencing. The results show that the most abundant phyla present in bushmeat samples include Firmicutes (67.8%), Proteobacteria (18.4%), Cyanobacteria (8.9%), and Bacteroidetes (3.1%). Regardless of wildlife species, sample condition, season, or region, the microbiome is diverse across all samples, with no significant difference in alpha or beta diversity. The findings also suggest the presence of DNA signatures of potentially dangerous zoonotic pathogens, including those from the genus Bacillus, Brucella, Coxiella, and others, in bushmeat. Together, this investigation provides a better understanding of the microbiome associated with this major food source in samples collected from the Western Serengeti in Tanzania and highlights a need for future investigations on the potential health risks associated with the harvesting, trade, and consumption of bushmeat in Sub-Saharan Africa.
Assuntos
Animais Selvagens/microbiologia , Carne/microbiologia , Animais , Bactérias/genética , Bactérias/isolamento & purificação , Ecossistema , Humanos , Carne/provisão & distribuição , Microbiota , RNA Ribossômico 16S/genética , Tanzânia , Zoonoses/etiologia , Zoonoses/microbiologiaRESUMO
Microalgae are promising biocatalysts for applications in sustainable fuel, food, and chemical production. Here, we describe culture collection screening, down-selection, and development of a high-productivity, halophilic, thermotolerant microalga, Picochlorum renovo. This microalga displays a rapid growth rate and high diel biomass productivity (34 g m-2 day-1), with a composition well-suited for downstream processing. P. renovo exhibits broad salinity tolerance (growth at 107.5 g L-1 salinity) and thermotolerance (growth up to 40 °C), beneficial traits for outdoor cultivation. We report complete genome sequencing and analysis, and genetic tool development suitable for expression of transgenes inserted into the nuclear or chloroplast genomes. We further evaluate mechanisms of halotolerance via comparative transcriptomics, identifying novel genes differentially regulated in response to high salinity cultivation. These findings will enable basic science inquiries into control mechanisms governing Picochlorum biology and lay the foundation for development of a microalga with industrially relevant traits as a model photobiology platform.
Assuntos
Clorófitas/metabolismo , Microalgas/metabolismo , Biocatálise , Biomassa , Biotecnologia , Clorófitas/genética , Clorófitas/crescimento & desenvolvimento , Perfilação da Expressão Gênica , Engenharia Genética , Genoma de Cloroplastos , Genoma Microbiano , Microbiologia Industrial/métodos , Microalgas/genética , Microalgas/crescimento & desenvolvimento , Processos Fototróficos , Tolerância ao Sal/genética , Termotolerância/genéticaRESUMO
The sequenced genome and the leaf transcriptome of a near relative of Abrus pulchellus and Abrus precatorius was analyzed to characterize the genetic basis of toxin gene expression. From the high-quality genome assembly, a total of 26 potential coding regions were identified that contain genes with abrin-like, pulchellin-like, and agglutinin-like homology, with full-length transcripts detected in leaf tissue for 9 of the 26 coding regions. All of the toxin-like genes were identified within only five isolated regions of the genome, with each region containing 1 to 16 gene variants within each genomic region (<1 Mbp). The Abrusprecatorius cultivar sequenced here contains genes which encode for proteins that are homologous to certain abrin and prepropulchellin genes previously identified, and we observed substantial diversity of genes and predicted gene products in Abrus precatorius and previously characterized toxins. This suggests diverse toxin repertoires within Abrus, potentially the results of rapid toxin evolution.
Assuntos
Abrina/genética , Abrus/genética , Genoma de Planta/genética , DNA de Plantas/química , DNA de Plantas/genética , Filogenia , Folhas de Planta/química , Lectinas de Plantas , Toxinas Biológicas , Transcriptoma , Sequenciamento Completo do GenomaRESUMO
A high-quality draft genome sequence of the microalgal species Tetraselmis striata was generated using PacBio sequencing. The assembled genome is 228 Mb, derived from 3,613 polished contigs at 84× coverage depth. This genome contains an average GC content of 57.9% and 48,906 predicted genes.
RESUMO
This study identified 35 new sites for targeted transgene insertion that have the potential to serve as new human genomic "safe harbor" sites (SHS). SHS potential for these 35 sites, located on 16 chromosomes, including both arms of the human X chromosome, and for the existing human SHS AAVS1, hROSA26, and CCR5 was assessed using eight different desirable, widely accepted criteria for SHS verifiable with human genomic data. Three representative newly identified sites on human chromosomes 2 and 4 were then experimentally validated by in vitro and in vivo cleavage-sensitivity tests, and analyzed for population-level and cell line-specific sequence variants that might confound site targeting. The highly ranked site on chromosome 4 (SHS231) was further characterized by targeted homology-dependent and -independent transgene insertion and expression in different human cell lines. The structure and fidelity of transgene insertions at this site were confirmed, together with analyses that demonstrated stable expression and function of transgene-encoded proteins, including fluorescent protein markers, selectable marker cassettes, and Cas9 protein variants. SHS-integrated transgene-encoded Cas9 proteins were shown to be capable of introducing a large (17 kb) gRNA-specified deletion in the PAX3/FOXO1 fusion oncogene in human rhabdomyosarcoma cells and as a Cas9-VPR fusion protein to upregulate expression of the muscle-specific transcription factor MYF5 in human rhabdomyosarcoma cells. An engineering "toolkit" was developed to enable easy use of the most extensively characterized of these new human sites, SHS231, located on the proximal long arm of chromosome 4. The target sites identified here have the potential to serve as additional human SHS to enable basic and clinical gene editing and genome-engineering applications.
Assuntos
Cromossomos Humanos , Mutagênese Insercional , Transgenes , Sequência de Bases , Sistemas CRISPR-Cas , Linhagem Celular , Pontos de Quebra do Cromossomo , Edição de Genes , Expressão Gênica , Técnicas de Introdução de Genes , Marcação de Genes , Loci Gênicos , Genoma Humano , Mapeamento Geográfico , HumanosRESUMO
To understand how complex genetic networks perform and regulate diverse cellular processes, the function of each individual component must be defined. Comprehensive phenotypic studies of mutant alleles have been successful in model organisms in determining what processes depend on the normal function of a gene. These results are often ported to newly sequenced genomes by using sequence homology. However, sequence similarity does not always mean identical function or phenotype, suggesting that new methods are required to functionally annotate newly sequenced species. We have implemented comparative analysis by high-throughput experimental testing of gene dispensability in Saccharomyces uvarum, a sister species of Saccharomyces cerevisiae. We created haploid and heterozygous diploid Tn7 insertional mutagenesis libraries in S. uvarum to identify species-dependent essential genes, with the goal of detecting genes with divergent functions and/or different genetic interactions. Comprehensive gene dispensability comparisons with S. cerevisiae predicted diverged dispensability at 12% of conserved orthologs, and validation experiments confirmed 22 differentially essential genes. Despite their differences in essentiality, these genes were capable of cross-species complementation, demonstrating that trans-acting factors that are background-dependent contribute to differential gene essentiality. This study shows that direct experimental testing of gene disruption phenotypes across species can inform comparative genomic analyses and improve gene annotations. Our method can be widely applied in microorganisms to further our understanding of genome evolution.
Assuntos
Elementos de DNA Transponíveis/genética , Regulação Fúngica da Expressão Gênica , Genes Essenciais , Saccharomyces/genética , Ativação Transcricional , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo , Mutagênese , Especificidade da Espécie , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
Picochlorum soloecismus is a halotolerant, fast-growing, and moderate-lipid-producing microalga that is being evaluated as a renewable feedstock for biofuel production. Herein, we report on an improved high-quality draft assembly and annotation for the nuclear, chloroplast, and mitochondrial genomes of P. soloecismus DOE 101.
RESUMO
The microalgal division Haptophyta uses a range of nutritional sourcing, including mixotrophy. The genome of a member of this taxon, Chrysochromulina tobin, suggests that interactions with its bacterial cohort are critical for C. tobin physiology. Here, we report the genomes of eight bacterial species in coculture with C. tobin.
RESUMO
Haptophytes are recognized as seminal players in aquatic ecosystem function. These algae are important in global carbon sequestration, form destructive harmful blooms, and given their rich fatty acid content, serve as a highly nutritive food source to a broad range of eco-cohorts. Haptophyte dominance in both fresh and marine waters is supported by the mixotrophic nature of many taxa. Despite their importance the nuclear genome sequence of only one haptophyte, Emiliania huxleyi (Isochrysidales), is available. Here we report the draft genome sequence of Chrysochromulina tobin (Prymnesiales), and transcriptome data collected at seven time points over a 24-hour light/dark cycle. The nuclear genome of C. tobin is small (59 Mb), compact (â¼ 40% of the genome is protein coding) and encodes approximately 16,777 genes. Genes important to fatty acid synthesis, modification, and catabolism show distinct patterns of expression when monitored over the circadian photoperiod. The C. tobin genome harbors the first hybrid polyketide synthase/non-ribosomal peptide synthase gene complex reported for an algal species, and encodes potential anti-microbial peptides and proteins involved in multidrug and toxic compound extrusion. A new haptophyte xanthorhodopsin was also identified, together with two "red" RuBisCO activases that are shared across many algal lineages. The Chrysochromulina tobin genome sequence provides new information on the evolutionary history, ecology and economic importance of haptophytes.
Assuntos
Aptidão Genética , Genoma/genética , Haptófitas/genética , Ribulose-Bifosfato Carboxilase/genética , Sequência de Bases , Perfilação da Expressão Gênica , Anotação de Sequência Molecular , Filogenia , Análise de Sequência de DNARESUMO
BACKGROUND: Haptophytes are widely and abundantly distributed in both marine and freshwater ecosystems. Few genomic analyses of representatives within this taxon have been reported, despite their early evolutionary origins and their prominent role in global carbon fixation. RESULTS: The complete mitochondrial and chloroplast genome sequences of the haptophyte Chrysochromulina tobin (Prymnesiales) provide insight into the architecture and gene content of haptophyte organellar genomes. The mitochondrial genome (~34 kb) encodes 21 protein coding genes and contains a complex, 9 kb tandem repeat region. Similar to other haptophytes and rhodophytes, but not cryptophytes or stramenopiles, the mitochondrial genome has lost the nad7, nad9 and nad11 genes. The ~105 kb chloroplast genome encodes 112 protein coding genes, including ycf39 which has strong structural homology to NADP-binding nitrate transcriptional regulators; a divergent 'CheY-like' two-component response regulator (ycf55) and Tic/Toc (ycf60 and ycf80) membrane transporters. Notably, a zinc finger domain has been identified in the rpl36 ribosomal protein gene of all chloroplasts sequenced to date with the exception of haptophytes and cryptophytes--algae that have gained (via lateral gene transfer) an alternative rpl36 lacking the zinc finger motif. The two C. tobin chloroplast ribosomal RNA operon spacer regions differ in tRNA content. Additionally, each ribosomal operon contains multiple single nucleotide polymorphisms (SNPs)--a pattern observed in rhodophytes and cryptophytes, but few stramenopiles. Analysis of small (<200 bp) chloroplast encoded tandem and inverted repeats in C. tobin and 78 other algal chloroplast genomes show that repeat type, size and location are correlated with gene identity and taxonomic clade. CONCLUSION: The Chrysochromulina tobin organellar genomes provide new insight into organellar function and evolution. These are the first organellar genomes to be determined for the prymnesiales, a taxon that is present in both oceanic and freshwater systems and represents major primary photosynthetic producers and contributors to global ecosystem stability.
Assuntos
Genoma de Cloroplastos , Genoma Mitocondrial , Haptófitas/genética , Mapeamento Cromossômico , Sequência Conservada , Proteínas de Membrana Transportadoras/genética , Modelos Moleculares , Fases de Leitura Aberta , Óperon , Filogenia , Sequências Repetitivas de Ácido Nucleico , Proteínas Ribossômicas/genética , Análise de Sequência de DNA , Transdução de Sinais , Homologia Estrutural de ProteínaRESUMO
Homing endonucleases (HEs) promote the evolutionary persistence of selfish DNA elements by catalyzing element lateral transfer into new host organisms. The high site specificity of this lateral transfer reaction, termed homing, reflects both the length (14-40 bp) and the limited tolerance of target or homing sites for base pair changes. In order to better understand molecular determinants of homing, we systematically determined the binding and cleavage properties of all single base pair variant target sites of the canonical LAGLIDADG homing endonucleases I-CreI and I-MsoI. These Chlorophyta algal HEs have very similar three-dimensional folds and recognize nearly identical 22 bp target sites, but use substantially different sets of DNA-protein contacts to mediate site-specific recognition and cleavage. The site specificity differences between I-CreI and I-MsoI suggest different evolutionary strategies for HE persistence. These differences also provide practical guidance in target site finding, and in the generation of HE variants with high site specificity and cleavage activity, to enable genome engineering applications.
Assuntos
Clivagem do DNA , Enzimas de Restrição do DNA/metabolismo , Endodesoxirribonucleases/metabolismo , Evolução Molecular , Pareamento de Bases , DNA/química , DNA/metabolismo , Enzimas de Restrição do DNA/química , Endodesoxirribonucleases/química , Engenharia Genética , Genômica , Humanos , Especificidade por SubstratoRESUMO
Genetic methods of manipulating or eradicating disease vector populations have long been discussed as an attractive alternative to existing control measures because of their potential advantages in terms of effectiveness and species specificity. The development of genetically engineered malaria-resistant mosquitoes has shown, as a proof of principle, the possibility of targeting the mosquito's ability to serve as a disease vector. The translation of these achievements into control measures requires an effective technology to spread a genetic modification from laboratory mosquitoes to field populations. We have suggested previously that homing endonuclease genes (HEGs), a class of simple selfish genetic elements, could be exploited for this purpose. Here we demonstrate that a synthetic genetic element, consisting of mosquito regulatory regions and the homing endonuclease gene I-SceI, can substantially increase its transmission to the progeny in transgenic mosquitoes of the human malaria vector Anopheles gambiae. We show that the I-SceI element is able to invade receptive mosquito cage populations rapidly, validating mathematical models for the transmission dynamics of HEGs. Molecular analyses confirm that expression of I-SceI in the male germline induces high rates of site-specific chromosomal cleavage and gene conversion, which results in the gain of the I-SceI gene, and underlies the observed genetic drive. These findings demonstrate a new mechanism by which genetic control measures can be implemented. Our results also show in principle how sequence-specific genetic drive elements like HEGs could be used to take the step from the genetic engineering of individuals to the genetic engineering of populations.