RESUMO
In 2015, sweetpotato producers in the United States experienced one of the worst outbreaks of black rot recorded in history, with up to 60% losses reported in the field and packing houses and at shipping ports. Host resistance remains the ideal management tool to decrease crop losses. Lack of knowledge of Ceratocystis fimbriata biology represents a critical barrier for the deployment of resistance to black rot in sweetpotato. In this study, we scanned the recent near chromosomal-level assembly for putative secreted effectors in the sweetpotato C. fimbriata isolate AS236 using a custom fungal effector annotation pipeline. We identified a set of 188 putative effectors on the basis of secretion signal and in silico prediction in EffectorP. We conducted a deep RNA time-course sequencing experiment to determine whether C. fimbriata modulates effectors in planta and to define a candidate list of effectors expressed during infection. We examined the expression profile of two C. fimbriata isolates, a pre-epidemic (1990s) isolate and a post-epidemic (2015) isolate. Our in planta expression profiling revealed clusters of co-expressed secreted effector candidates. Based on fold-change differences of putative effectors in both isolates and over the course of infection, we suggested prioritization of 31 effectors for functional characterization. Among this set, we identified several effectors that provide evidence for a marked biotrophic phase in C. fimbriata during infection of sweetpotato storage roots. Our study revealed a catalog of effector proteins that provide insight into C. fimbriata infection mechanisms and represent a core catalog to implement effector-assisted breeding in sweetpotato. [Formula: see text] Copyright © 2024 The Author(s). This is an open access article distributed under the CC BY-NC-ND 4.0 International license.
Assuntos
Ascomicetos , Ascomicetos/genética , Melhoramento Vegetal , Ceratocystis/genética , Sequência de BasesRESUMO
KEY MESSAGE: The C. roseus ZCTs are jasmonate-responsive, can be induced by CrMYC2a, and can act as significant regulators of the terpenoid indole alkaloid pathway when highly expressed. Catharanthus roseus is the sole known producer of the anti-cancer terpenoid indole alkaloids (TIAs), vinblastine and vincristine. While the enzymatic steps of the pathway have been elucidated, an understanding of its regulation is still emerging. The present study characterizes an important subgroup of Cys2-His2 zinc finger transcription factors known as Zinc finger Catharanthus Transcription factors (ZCTs). We identified three new ZCT members (named ZCT4, ZCT5, and ZCT6) that clustered with the putative repressors of the TIA pathway, ZCT1, ZCT2, and ZCT3. We characterized the role of these six ZCTs as potential redundant regulators of the TIA pathway, and their tissue-specific and jasmonate-responsive expression. These ZCTs share high sequence conservation in their two Cys2-His2 zinc finger domains but differ in the spacer length and sequence between these zinc fingers. The transient overexpression of ZCTs in seedlings significantly repressed the promoters of the terpenoid (pLAMT) and condensation branch (pSTR1) of the TIA pathway, consistent with that previously reported for ZCT1, ZCT2, and ZCT3. In addition, ZCTs significantly repressed and indirectly activated several promoters of the vindoline pathway (not previously studied). The ZCTs differed in their tissue-specific expression but similarly increased with jasmonate in a dosage-dependent manner (except for ZCT5). We showed significant activation of the pZCT1 and pZCT3 promoters by the de-repressed CrMYC2a, suggesting that the jasmonate-responsive expression of the ZCTs can be mediated by CrMYC2a. In summary, the C. roseus ZCTs are jasmonate-responsive, can be induced by CrMYC2a, and can act as significant regulators of the TIA pathway when highly expressed.
Assuntos
Catharanthus , Ciclopentanos , Regulação da Expressão Gênica de Plantas , Oxilipinas , Proteínas de Plantas , Fatores de Transcrição , Catharanthus/genética , Catharanthus/metabolismo , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Oxilipinas/metabolismo , Oxilipinas/farmacologia , Ciclopentanos/metabolismo , Ciclopentanos/farmacologia , Dedos de Zinco CYS2-HIS2/genética , Plantas Geneticamente Modificadas , Alcaloides de Triptamina e Secologanina/metabolismo , Filogenia , Dedos de ZincoRESUMO
The marine microalgae Nannochloropsis oceanica (CCMP1779) is a prolific producer of oil and is considered a viable and sustainable resource for biofuel feedstocks. Nitrogen (N) availability has a strong impact on the physiological status and metabolism of microalgal cells, but the exact nature of this response is poorly understood. To fill this gap we performed transcriptomic profiling combined with cellular and molecular analyses of N. oceanica CCMP1779 during the transition from quiescence to autotrophy. N deprivation-induced quiescence was accompanied by a strong reorganization of the photosynthetic apparatus and changes in the lipid homeostasis, leading to accumulation of triacylglycerol. Cell cycle activation and re-establishment of photosynthetic activity observed in response to resupply of the growth medium with N were accompanied by a rapid degradation of triacylglycerol stored in lipid droplets (LDs). Besides observing LD translocation into vacuoles, we also provide evidence for direct interaction between the LD surface protein (NoLDSP) and AUTOPHAGY-RELATED8 (NoATG8) protein and show a role of microlipophagy in LD turnover in N. oceanica CCMP1779. This knowledge is crucial not only for understanding the fundamental mechanisms controlling the cellular energy homeostasis in microalgal cells but also for development of efficient strategies to achieve higher algal biomass and better microalgal lipid productivity.
Assuntos
Processos Autotróficos/genética , Microalgas/metabolismo , Nitrogênio/metabolismo , Nutrigenômica , Fotossíntese/genética , Estramenópilas/metabolismo , Triglicerídeos/metabolismo , Autofagia/genética , Autofagia/fisiologia , Família da Proteína 8 Relacionada à Autofagia/metabolismo , Processos Autotróficos/fisiologia , Ciclo Celular/genética , Ciclo Celular/fisiologia , Análise por Conglomerados , Ácidos Graxos/biossíntese , Ácidos Graxos/metabolismo , Perfilação da Expressão Gênica , Regulação da Expressão Gênica/genética , Regulação da Expressão Gênica/fisiologia , Ontologia Genética , Homeostase/genética , Homeostase/fisiologia , Gotículas Lipídicas/metabolismo , Gotículas Lipídicas/ultraestrutura , Metabolismo dos Lipídeos/genética , Metabolismo dos Lipídeos/fisiologia , Microalgas/genética , Microscopia Eletrônica de Transmissão , Família Multigênica , Fotossíntese/fisiologia , Estramenópilas/genética , Vacúolos/metabolismo , Vacúolos/ultraestruturaRESUMO
Isoprene synthase converts dimethylallyl diphosphate to isoprene and appears to be necessary and sufficient to allow plants to emit isoprene at significant rates. Isoprene can protect plants from abiotic stress but is not produced naturally by all plants; for example, Arabidopsis (Arabidopsis thaliana) and tobacco (Nicotiana tabacum) do not produce isoprene. It is typically present at very low concentrations, suggesting a role as a signaling molecule; however, its exact physiological role and mechanism of action are not fully understood. We transformed Arabidopsis with a Eucalyptus globulus isoprene synthase The regulatory mechanisms of photosynthesis and isoprene emission were similar to those of native emitters, indicating that regulation of isoprene emission is not specific to isoprene-emitting species. Leaf chlorophyll and carotenoid contents were enhanced by isoprene, which also had a marked positive effect on hypocotyl, cotyledon, leaf, and inflorescence growth in Arabidopsis. By contrast, leaf and stem growth was reduced in tobacco engineered to emit isoprene. Expression of genes belonging to signaling networks or associated with specific growth regulators (e.g. gibberellic acid that promotes growth and jasmonic acid that promotes defense) and genes that lead to stress tolerance was altered by isoprene emission. Isoprene likely executes its effects on growth and stress tolerance through direct regulation of gene expression. Enhancement of jasmonic acid-mediated defense signaling by isoprene may trigger a growth-defense tradeoff leading to variations in the growth response. Our data support a role for isoprene as a signaling molecule.
Assuntos
Alquil e Aril Transferases/genética , Arabidopsis/genética , Hemiterpenos/fisiologia , Nicotiana/genética , Estresse Fisiológico , Arabidopsis/efeitos dos fármacos , Arabidopsis/crescimento & desenvolvimento , Arabidopsis/metabolismo , Butadienos/farmacologia , Carotenoides/metabolismo , Clorofila/metabolismo , Eucalyptus/genética , Regulação da Expressão Gênica de Plantas , Hemiterpenos/biossíntese , Hemiterpenos/farmacologia , Fotossíntese , Folhas de Planta/genética , Folhas de Planta/crescimento & desenvolvimento , Folhas de Planta/metabolismo , Transdução de Sinais , Nicotiana/crescimento & desenvolvimento , Nicotiana/metabolismo , Transformação GenéticaRESUMO
Chloroplast size varies considerably in nature, but the underlying mechanisms are unknown. By exploiting a near-isogenic line population derived from a cross between the Arabidopsis (Arabidopsis thaliana) accessions Cape Verde Islands (Cvi-1), which has larger chloroplasts, and Landsberg erecta (Ler-0), with smaller chloroplasts, we determined that the large-chloroplast phenotype in Cvi-1 is associated with allelic variation in the gene encoding the chloroplast-division protein FtsZ2-2, a tubulin-related cytoskeletal component of the contractile FtsZ ring inside chloroplasts. Sequencing revealed that the Cvi-1 FtsZ2-2 allele encodes a C-terminally truncated protein lacking a region required for FtsZ2-2 interaction with inner-envelope proteins, and functional complementation experiments in a Columbia-0 ftsZ2-2 null mutant confirmed this allele as causal for the increased chloroplast size in Cvi-1. Comparison of FtsZ2-2 coding sequences in the 1001 Genomes database showed that the Cvi-1 allele is rare and identified additional rare loss-of-function alleles, including a natural null allele, in three other accessions, all of which had enlarged-chloroplast phenotypes. The ratio of nonsynonymous to synonymous substitutions was higher among the FtsZ2-2 genes than among the two other FtsZ family members in Arabidopsis, FtsZ2-1, a close paralog of FtsZ2-2, and the functionally distinct FtsZ1-1, indicating more relaxed constraint on the FtsZ2-2 coding sequence than on those of FtsZ2-1 or FtsZ1-1 Our results establish that allelic variation in FtsZ2-2 contributes to natural variation in chloroplast size in Arabidopsis, and they also demonstrate that natural variation in Arabidopsis can be used to decipher the genetic basis of differences in fundamental cell biological traits, such as organelle size.
Assuntos
Proteínas de Arabidopsis/metabolismo , Arabidopsis/metabolismo , Cloroplastos/metabolismo , Alelos , Arabidopsis/genética , Proteínas de Arabidopsis/genética , Cloroplastos/genética , Fases de Leitura Aberta/genéticaRESUMO
Recent studies have shown that one of the parental subgenomes in ancient polyploids is generally more dominant, having retained more genes and being more highly expressed, a phenomenon termed subgenome dominance. The genomic features that determine how quickly and which subgenome dominates within a newly formed polyploid remain poorly understood. To investigate the rate of emergence of subgenome dominance, we examined gene expression, gene methylation, and transposable element (TE) methylation in a natural, <140-year-old allopolyploid (Mimulus peregrinus), a resynthesized interspecies triploid hybrid (M. robertsii), a resynthesized allopolyploid (M. peregrinus), and progenitor species (M. guttatus and M. luteus). We show that subgenome expression dominance occurs instantly following the hybridization of divergent genomes and significantly increases over generations. Additionally, CHH methylation levels are reduced in regions near genes and within TEs in the first-generation hybrid, intermediate in the resynthesized allopolyploid, and are repatterned differently between the dominant and recessive subgenomes in the natural allopolyploid. Subgenome differences in levels of TE methylation mirror the increase in expression bias observed over the generations following hybridization. These findings provide important insights into genomic and epigenomic shock that occurs following hybridization and polyploid events and may also contribute to uncovering the mechanistic basis of heterosis and subgenome dominance.
Assuntos
Genoma de Planta , Hibridização Genética , Mimulus/genética , Poliploidia , Metilação de DNA/genética , Duplicação Gênica , Regulação da Expressão Gênica de Plantas , Filogenia , Especificidade da EspécieRESUMO
Few clades of plants have proven as difficult to classify as cacti. One explanation may be an unusually high level of convergent and parallel evolution (homoplasy). To evaluate support for this phylogenetic hypothesis at the molecular level, we sequenced the genomes of four cacti in the especially problematic tribe Pachycereeae, which contains most of the large columnar cacti of Mexico and adjacent areas, including the iconic saguaro cactus (Carnegiea gigantea) of the Sonoran Desert. We assembled a high-coverage draft genome for saguaro and lower coverage genomes for three other genera of tribe Pachycereeae (Pachycereus, Lophocereus, and Stenocereus) and a more distant outgroup cactus, Pereskia We used these to construct 4,436 orthologous gene alignments. Species tree inference consistently returned the same phylogeny, but gene tree discordance was high: 37% of gene trees having at least 90% bootstrap support conflicted with the species tree. Evidently, discordance is a product of long generation times and moderately large effective population sizes, leading to extensive incomplete lineage sorting (ILS). In the best supported gene trees, 58% of apparent homoplasy at amino sites in the species tree is due to gene tree-species tree discordance rather than parallel substitutions in the gene trees themselves, a phenomenon termed "hemiplasy." The high rate of genomic hemiplasy may contribute to apparent parallelisms in phenotypic traits, which could confound understanding of species relationships and character evolution in cacti.
Assuntos
Cactaceae/genética , Genoma de Planta/genética , Sequência de Bases , Evolução Molecular , Genômica/métodos , México , Modelos Genéticos , América do Norte , FilogeniaRESUMO
[This corrects the article DOI: 10.1371/journal.pgen.1003064.].
RESUMO
BACKGROUND: Gene is a key step in genome annotation. Ab initio gene prediction enables gene annotation of new genomes regardless of availability of homologous sequences. There exist a number of ab initio gene prediction tools and they have been widely used for gene annotation for various species. However, existing tools are not optimized for identifying genes with highly variable GC content. In addition, some genes in grass genomes exhibit a sharp 5 '- 3' decreasing GC content gradient, which is not carefully modeled by available gene prediction tools. Thus, there is still room to improve the sensitivity and accuracy for predicting genes with GC gradients. RESULTS: In this work, we designed and implemented a new hidden Markov model (HMM)-based ab initio gene prediction tool, which is optimized for finding genes with highly variable GC contents, such as the genes with negative GC gradients in grass genomes. We tested the tool on three datasets from Arabidopsis thaliana and Oryza sativa. The results showed that our tool can identify genes missed by existing tools due to the highly variable GC contents. CONCLUSIONS: GPRED-GC can effectively predict genes with highly variable GC contents without manual intervention. It provides a useful complementary tool to existing ones such as Augustus for more sensitive gene discovery. The source code is freely available at https://sourceforge.net/projects/gpred-gc/.
Assuntos
Composição de Bases , Genoma , Genômica , Anotação de Sequência Molecular , SoftwareRESUMO
BACKGROUND: Macrophomina phaseolina is a fungal plant pathogen with a broad host range, but one genotype was shown to exhibit host preference/specificity on strawberry. This pathogen lacked a high-quality genome assembly and annotation, and little was known about genomic differences among isolates from different hosts. RESULTS: We used PacBio sequencing and Hi-C scaffolding to provide nearly complete genome assemblies for M. phaseolina isolates representing the strawberry-specific genotype and another genotype recovered from alfalfa. The strawberry isolate had 59 contigs/scaffolds with an N50 of 4.3 Mb. The isolate from alfalfa had an N50 of 5.0 Mb and 14 nuclear contigs with half including telomeres. Both genomes were annotated with MAKER using transcript evidence generated in this study with over 13,000 protein-coding genes predicted. Unique groups of genes for each isolate were identified when compared to closely related fungal species. Structural comparisons between the isolates reveal large-scale rearrangements including chromosomal inversions and translocations. To include isolates representing a range of pathogen genotypes, an additional 30 isolates were sequenced with Illumina, assembled, and compared to the strawberry genotype assembly. Within the limits of comparing Illumina and PacBio assemblies, no conserved structural rearrangements were identified among the isolates from the strawberry genotype compared to those from other hosts, but some candidate genes were identified that were largely present in isolates of the strawberry genotype and absent in other genotypes. CONCLUSIONS: High-quality reference genomes of M. phaseolina have allowed for the identification of structural changes associated with a genotype that has a host preference toward strawberry and will enable future comparative genomics studies. Having more complete assemblies allows for structural rearrangements to be more fully assessed and ensures a greater representation of all the genes. Work with Illumina data from additional isolates suggests that some genes are predominately present in isolates of the strawberry genotype, but additional work is needed to confirm the role of these genes in pathogenesis. Additional work is also needed to complete the scaffolding of smaller contigs identified in the strawberry genotype assembly and to determine if unique genes in the strawberry genotype play a role in pathogenicity.
Assuntos
Ascomicetos/genética , Ascomicetos/fisiologia , Fragaria/microbiologia , Genômica , Especificidade de Hospedeiro/genética , Anotação de Sequência Molecular , Animais , Ascomicetos/isolamento & purificação , Rearranjo Gênico , Camundongos , Família Multigênica/genéticaRESUMO
Intense artificial selection over the last 100 years has produced elite maize (Zea mays) inbred lines that combine to produce high-yielding hybrids. To further our understanding of how genome and transcriptome variation contribute to the production of high-yielding hybrids, we generated a draft genome assembly of the inbred line PH207 to complement and compare with the existing B73 reference sequence. B73 is a founder of the Stiff Stalk germplasm pool, while PH207 is a founder of Iodent germplasm, both of which have contributed substantially to the production of temperate commercial maize and are combined to make heterotic hybrids. Comparison of these two assemblies revealed over 2500 genes present in only one of the two genotypes and 136 gene families that have undergone extensive expansion or contraction. Transcriptome profiling revealed extensive expression variation, with as many as 10,564 differentially expressed transcripts and 7128 transcripts expressed in only one of the two genotypes in a single tissue. Genotype-specific genes were more likely to have tissue/condition-specific expression and lower transcript abundance. The availability of a high-quality genome assembly for the elite maize inbred PH207 expands our knowledge of the breadth of natural genome and transcriptome variation in elite maize inbred lines across heterotic pools.
Assuntos
Genoma de Planta/genética , Transcriptoma/genética , Zea mays/genética , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas/genética , Variação Genética/genéticaRESUMO
Nannochloropsis oceanica is an oleaginous microalga rich in ω3 long-chain polyunsaturated fatty acids (LC-PUFAs) content, in the form of eicosapentaenoic acid (EPA). We identified the enzymes involved in LC-PUFA biosynthesis in N. oceanica CCMP1779 and generated multigene expression vectors aiming at increasing LC-PUFA content in vivo. We isolated the cDNAs encoding four fatty acid desaturases (FAD) and determined their function by heterologous expression in S. cerevisiae. To increase the expression of multiple fatty acid desaturases in N. oceanica CCMP1779, we developed a genetic engineering toolkit that includes an endogenous bidirectional promoter and optimized peptide bond skipping 2A peptides. The toolkit also includes multiple epitopes for tagged fusion protein production and two antibiotic resistance genes. We applied this toolkit, towards building a gene stacking system for N. oceanica that consists of two vector series, pNOC-OX and pNOC-stacked. These tools for genetic engineering were employed to test the effects of the overproduction of one, two or three desaturase-encoding cDNAs in N. oceanica CCMP1779 and prove the feasibility of gene stacking in this genetically tractable oleaginous microalga. All FAD overexpressing lines had considerable increases in the proportion of LC-PUFAs, with the overexpression of Δ12 and Δ5 FAD encoding sequences leading to an increase in the final ω3 product, EPA.
Assuntos
Ácido Eicosapentaenoico/metabolismo , Ácidos Graxos Insaturados/metabolismo , Engenharia Genética/métodos , Ácidos Graxos Dessaturases/metabolismo , Microalgas/metabolismoRESUMO
BACKGROUND: Accurate structural annotation depends on well-trained gene prediction programs. Training data for gene prediction programs are often chosen randomly from a subset of high-quality genes that ideally represent the variation found within a genome. One aspect of gene variation is GC content, which differs across species and is bimodal in grass genomes. When gene prediction programs are trained on a subset of grass genes with random GC content, they are effectively being trained on two classes of genes at once, and this can be expected to result in poor results when genes are predicted in new genome sequences. RESULTS: We find that gene prediction programs trained on grass genes with random GC content do not completely predict all grass genes with extreme GC content. We show that gene prediction programs that are trained with grass genes with high or low GC content can make both better and unique gene predictions compared to gene prediction programs that are trained on genes with random GC content. By separately training gene prediction programs with genes from multiple GC ranges and using the programs within the MAKER genome annotation pipeline, we were able to improve the annotation of the Oryza sativa genome compared to using the standard MAKER annotation protocol. Gene structure was improved in over 13% of genes, and 651 novel genes were predicted by the GC-specific MAKER protocol. CONCLUSIONS: We present a new GC-specific MAKER annotation protocol to predict new and improved gene models and assess the biological significance of this method in Oryza sativa. We expect that this protocol will also be beneficial for gene prediction in any organism with bimodal or other unusual gene GC content.
Assuntos
Genoma de Planta , Anotação de Sequência Molecular/métodos , Oryza/genética , Composição de Bases , Cadeias de Markov , RNA de Plantas/química , RNA de Plantas/isolamento & purificação , RNA de Plantas/metabolismo , Ribossomos/metabolismo , Análise de Sequência de RNARESUMO
The tropane alkaloids, hyoscyamine and scopolamine, are medicinal compounds that are the active components of several therapeutics. Hyoscyamine and scopolamine are synthesized in the roots of specific genera of the Solanaceae in a multistep pathway that is only partially elucidated. To facilitate greater understanding of tropane alkaloid biosynthesis, a de novo transcriptome assembly was developed for Deadly Nightshade (Atropa belladonna). Littorine is a key intermediate in hyoscyamine and scopolamine biosynthesis that is produced by the condensation of tropine and phenyllactic acid. Phenyllactic acid is derived from phenylalanine via its transamination to phenylpyruvate, and mining of the transcriptome identified a phylogenetically distinct aromatic amino acid aminotransferase (ArAT), designated Ab-ArAT4, that is coexpressed with known tropane alkaloid biosynthesis genes in the roots of A. belladonna. Silencing of Ab-ArAT4 disrupted synthesis of hyoscyamine and scopolamine through reduction of phenyllactic acid levels. Recombinant Ab-ArAT4 preferentially catalyzes the first step in phenyllactic acid synthesis, the transamination of phenylalanine to phenylpyruvate. However, rather than utilizing the typical keto-acid cosubstrates, 2-oxoglutarate, pyruvate, and oxaloacetate, Ab-ArAT4 possesses strong substrate preference and highest activity with the aromatic keto-acid, 4-hydroxyphenylpyruvate. Thus, Ab-ArAT4 operates at the interface between primary and specialized metabolism, contributing to both tropane alkaloid biosynthesis and the direct conversion of phenylalanine to tyrosine.
Assuntos
Atropa belladonna/enzimologia , Vias Biossintéticas , Fenilalanina/metabolismo , Ácidos Fenilpirúvicos/metabolismo , Raízes de Plantas/enzimologia , Transaminases/metabolismo , Tropanos/metabolismo , Atropa belladonna/genética , Vias Biossintéticas/genética , Simulação por Computador , Regulação da Expressão Gênica de Plantas , Inativação Gênica , Cinética , Anotação de Sequência Molecular , Dados de Sequência Molecular , Família Multigênica , Filogenia , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Análise de Sequência de RNA , Transaminases/genética , Transcriptoma/genética , Tropanos/químicaRESUMO
The medicinal plant Madagascar periwinkle, Catharanthus roseus (L.) G. Don, produces hundreds of biologically active monoterpene-derived indole alkaloid (MIA) metabolites and is the sole source of the potent, expensive anti-cancer compounds vinblastine and vincristine. Access to a genome sequence would enable insights into the biochemistry, control, and evolution of genes responsible for MIA biosynthesis. However, generation of a near-complete, scaffolded genome is prohibitive to small research communities due to the expense, time, and expertise required. In this study, we generated a genome assembly for C. roseus that provides a near-comprehensive representation of the genic space that revealed the genomic context of key points within the MIA biosynthetic pathway including physically clustered genes, tandem gene duplication, expression sub-functionalization, and putative neo-functionalization. The genome sequence also facilitated high resolution co-expression analyses that revealed three distinct clusters of co-expression within the components of the MIA pathway. Coordinated biosynthesis of precursors and intermediates throughout the pathway appear to be a feature of vinblastine/vincristine biosynthesis. The C. roseus genome also revealed localization of enzyme-rich genic regions and transporters near known biosynthetic enzymes, highlighting how even a draft genome sequence can empower the study of high-value specialized metabolites.
Assuntos
Produtos Biológicos/metabolismo , Catharanthus/metabolismo , Regulação da Expressão Gênica de Plantas , Genoma de Planta/genética , Vimblastina/metabolismoRESUMO
BACKGROUND: The cyclic peptide toxins of Amanita mushrooms, such as α-amanitin and phalloidin, are encoded by the "MSDIN" gene family and ribosomally biosynthesized. Based on partial genome sequence and PCR analysis, some members of the MSDIN family were previously identified in Amanita bisporigera, and several other members are known from other species of Amanita. However, the complete complement in any one species, and hence the genetic capacity for these fungi to make cyclic peptides, remains unknown. RESULTS: Draft genome sequences of two cyclic peptide-producing mushrooms, the "Death Cap" A. phalloides and the "Destroying Angel" A. bisporigera, were obtained. Each species has ~30 MSDIN genes, most of which are predicted to encode unknown cyclic peptides. Some MSDIN genes were duplicated in one or the other species, but only three were common to both species. A gene encoding cycloamanide B, a previously described nontoxic cyclic heptapeptide, was also present in A. phalloides, but genes for antamanide and cycloamanides A, C, and D were not. In A. bisporigera, RNA expression was observed for 20 of the MSDIN family members. Based on their predicted sequences, novel cyclic peptides were searched for by LC/MS/MS in extracts of A. phalloides. The presence of two cyclic peptides, named cycloamanides E and F with structures cyclo(SFFFPVP) and cyclo(IVGILGLP), was thereby demonstrated. Of the MSDIN genes reported earlier from another specimen of A. bisporigera, 9 of 14 were not found in the current genome assembly. Differences between previous and current results for the complement of MSDIN genes and cyclic peptides in the two fungi probably represents natural variation among geographically dispersed isolates of A. phalloides and among the members of the poorly defined A. bisporigera species complex. Both A. phalloides and A. bisporigera contain two prolyl oligopeptidase genes, one of which (POPB) is probably dedicated to cyclic peptide biosynthesis as it is in Galerina marginata. CONCLUSION: The MSDIN gene family has expanded and diverged rapidly in Amanita section Phalloideae. Together, A. bisporigera and A. phalloides are predicted to have the capacity to make more than 50 cyclic hexa-, hepta-, octa-, nona- and decapeptides.
Assuntos
Agaricales/genética , Peptídeos Cíclicos/genética , Toxinas Biológicas/genética , Sequência de Aminoácidos , Biologia Computacional/métodos , Sequência Conservada , DNA Espaçador Ribossômico , Perfilação da Expressão Gênica , Genoma Fúngico , Genômica/métodos , Anotação de Sequência Molecular , Família Multigênica , Peptídeos Cíclicos/química , Toxinas Biológicas/química , TranscriptomaRESUMO
BACKGROUND: Switchgrass (Panicum virgatum L.) is a warm-season perennial grass that can be used as a second generation bioenergy crop. However, foliar fungal pathogens, like switchgrass rust, have the potential to significantly reduce switchgrass biomass yield. Despite its importance as a prominent bioenergy crop, a genome-wide comprehensive analysis of NB-LRR disease resistance genes has yet to be performed in switchgrass. RESULTS: In this study, we used a homology-based computational approach to identify 1011 potential NB-LRR resistance gene homologs (RGHs) in the switchgrass genome (v 1.1). In addition, we identified 40 RGHs that potentially contain unique domains including major sperm protein domain, jacalin-like binding domain, calmodulin-like binding, and thioredoxin. RNA-sequencing analysis of leaf tissue from 'Alamo', a rust-resistant switchgrass cultivar, and 'Dacotah', a rust-susceptible switchgrass cultivar, identified 2634 high quality variants in the RGHs between the two cultivars. RNA-sequencing data from field-grown cultivar 'Summer' plants indicated that the expression of some of these RGHs was developmentally regulated. CONCLUSIONS: Our results provide useful insight into the molecular structure, distribution, and expression patterns of members of the NB-LRR gene family in switchgrass. These results also provide a foundation for future work aimed at elucidating the molecular mechanisms underlying disease resistance in this important bioenergy crop.
Assuntos
Resistência à Doença/genética , Perfilação da Expressão Gênica , Genes de Plantas , Estudos de Associação Genética , Panicum/genética , Alelos , Sequência de Aminoácidos , Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Predisposição Genética para Doença , Genoma de Planta , Genômica/métodos , Panicum/classificação , Filogenia , Polimorfismo de Nucleotídeo Único , Matrizes de Pontuação de Posição Específica , Domínios e Motivos de Interação entre Proteínas/genética , Reprodutibilidade dos TestesRESUMO
The large size and relative complexity of many plant genomes make creation, quality control, and dissemination of high-quality gene structure annotations challenging. In response, we have developed MAKER-P, a fast and easy-to-use genome annotation engine for plants. Here, we report the use of MAKER-P to update and revise the maize (Zea mays) B73 RefGen_v3 annotation build (5b+) in less than 3 h using the iPlant Cyberinfrastructure. MAKER-P identified and annotated 4,466 additional, well-supported protein-coding genes not present in the 5b+ annotation build, added additional untranslated regions to 1,393 5b+ gene models, identified 2,647 5b+ gene models that lack any supporting evidence (despite the use of large and diverse evidence data sets), identified 104,215 pseudogene fragments, and created an additional 2,522 noncoding gene annotations. We also describe a method for de novo training of MAKER-P for the annotation of newly sequenced grass genomes. Collectively, these results lead to the 6a maize genome annotation and demonstrate the utility of MAKER-P for rapid annotation, management, and quality control of grasses and other difficult-to-annotate plant genomes.
Assuntos
Genes de Plantas/genética , Genoma de Planta/genética , Anotação de Sequência Molecular/métodos , Zea mays/genética , Bases de Dados Genéticas/normas , Éxons/genética , Íntrons/genética , Modelos Genéticos , Anotação de Sequência Molecular/normas , Pseudogenes/genética , Controle de Qualidade , RNA não Traduzido/genéticaRESUMO
KEY MESSAGE: SrTA10187 was fine-mapped to a 1.1 cM interval, candidate genes were identified in the region of interest, and molecular markers were developed for marker-assisted selection and Sr gene pyramiding. Stem rust (Puccinia graminis f. sp. tritici, Pgt) races belonging to the Ug99 (TTKSK) race group pose a serious threat to global wheat (Triticum aestivum L.) production. To improve Pgt host resistance, the Ug99-effective resistance gene SrTA10187 previously identified in Aegilops tauschii Coss. was introgressed into wheat, and mapped to the short arm of wheat chromosome 6D. In this study, high-resolution mapping of SrTA10187 was done using a population of 1,060 plants. Pgt resistance was screened using race QFCSC. PCR-based SNP and STS markers were developed from genotyping-by-sequencing tags and SNP sequences available in online databases. SrTA10187 segregated as expected in a 3:1 ratio of resistant to susceptible individuals in three out of six BC3F2 families, and was fine-mapped to a 1.1 cM region on wheat chromosome 6DS. Marker context sequence was aligned to the reference Ae. tauschii genome to identify the physical region encompassing SrTA10187. Due to the size of the corresponding region, candidate disease resistance genes could not be identified with confidence. Comparisons with the Ae. tauschii genetic map developed by Luo et al. (PNAS 110(19):7940-7945, 2013) enabled identification of a discrete genetic locus and a BAC minimum tiling path of the region spanning SrTA10187. Annotation of pooled BAC library sequences led to the identification of candidate genes in the region of interest-including a single NB-ARC-LRR gene. The shorter genetic interval and flanking KASP™ and STS markers developed in this study will facilitate marker-assisted selection, gene pyramiding, and positional cloning of SrTA10187.
Assuntos
Mapeamento Cromossômico , Resistência à Doença/genética , Genes de Plantas , Doenças das Plantas/genética , Triticum/genética , Basidiomycota , Cromossomos de Plantas , Ligação Genética , Fenótipo , Mapeamento Físico do Cromossomo , Doenças das Plantas/microbiologia , Poaceae/genética , Polimorfismo de Nucleotídeo Único , Sitios de Sequências Rotuladas , Triticum/microbiologiaRESUMO
Switchgrass (Panicum virgatum) is a polyploid, outcrossing grass species native to North America and has recently been recognized as a potential biofuel feedstock crop. Significant phenotypic variation including ploidy is present across the two primary ecotypes of switchgrass, referred to as upland and lowland switchgrass. The tetraploid switchgrass genome is approximately 1400 Mbp, split between two subgenomes, with significant repetitive sequence content limiting the efficiency of re-sequencing approaches for determining genome diversity. To characterize genetic diversity in upland and lowland switchgrass as a first step in linking genotype to phenotype, we designed an exome capture probe set based on transcript assemblies that represent approximately 50 Mb of annotated switchgrass exome sequences. We then evaluated and optimized the probe set using solid phase comparative genome hybridization and liquid phase exome capture followed by next-generation sequencing. Using the optimized probe set, we assessed variation in the exomes of eight switchgrass genotypes representing tetraploid lowland and octoploid upland cultivars to benchmark our exome capture probe set design. We identified ample variation in the switchgrass genome including 1,395,501 single nucleotide polymorphisms (SNPs), 8173 putative copy number variants and 3336 presence/absence variants. While the majority of the SNPs (84%) detected was bi-allelic, a substantial number was tri-allelic with limited occurrence of tetra-allelic polymorphisms consistent with the heterozygous and polyploid nature of the switchgrass genome. Collectively, these data demonstrate the efficacy of exome capture for discovery of genome variation in a polyploid species with a large, repetitive and heterozygous genome.