ABSTRACT
The development of perennial crops holds great promise for sustainable agriculture and food security. However, the evolution of the transition between perenniality and annuality is poorly understood. Here, using two Brassicaceae species, Crucihimalaya himalaica and Erysimum nevadense, as polycarpic perennial models, we reveal that the transition from polycarpic perennial to biennial and annual flowering behavior is a continuum determined by the dosage of three closely related MADS-box genes. Diversification of the expression patterns, functional strengths, and combinations of these genes endows species with the potential to adopt various life-history strategies. Remarkably, we find that a single gene among these three is sufficient to convert winter-annual or annual Brassicaceae plants into polycarpic perennial flowering plants. Our work delineates a genetic basis for the evolution of diverse life-history strategies in plants and lays the groundwork for the generation of diverse perennial Brassicaceae crops in the future.
Subject(s)
Brassicaceae , Flowers , Gene Expression Regulation, Plant , Brassicaceae/genetics , Brassicaceae/physiology , Crops, Agricultural/genetics , Flowers/genetics , Flowers/physiology , MADS Domain Proteins/genetics , MADS Domain Proteins/metabolism , Phylogeny , Plant Proteins/genetics , Plant Proteins/metabolism , Genome, Plant , Plant Physiological Phenomena , Chromosome Mapping , MutationABSTRACT
Potato breeding efforts have long been hindered by the genetic consequences of millennia of clonal propagation. To mitigate genomic constraints, Zhang et al. leverage an unprecedented scale of sequencing and marker-assisted breeding to unlock traits that have not been possible through classical breeding, providing a blueprint for plant genome design.
Subject(s)
Solanum tuberosum , Genome, Plant , Genomics , Phenotype , Plant Breeding , Solanum tuberosum/geneticsABSTRACT
The functional regulatory elements of agronomically important plant genomes have been long sought after. Marand et. al. generate a comprehensive atlas of cis-regulatory elements at single cell resolution in maize, providing a powerful resource for inquiries into the rules of multicellular development and for precision crop engineering.
Subject(s)
Gene Expression Regulation, Plant , Zea mays , Genome, Plant , Regulatory Sequences, Nucleic Acid/genetics , Zea mays/geneticsABSTRACT
Feeding the ever-growing population is a major challenge, especially in light of rapidly changing climate conditions. Genome editing is set to revolutionize plant breeding and could help secure the global food supply. Here, I review the development and application of genome editing tools in plants while highlighting newly developed techniques. I describe new plant breeding strategies based on genome editing and discuss their impact on crop production, with an emphasis on recent advancements in genome editing-based plant improvements that could not be achieved by conventional breeding. I also discuss challenges facing genome editing that must be overcome before realizing the full potential of this technology toward future crops and food production.
Subject(s)
Agriculture , Crops, Agricultural/genetics , Genetic Engineering , Genome, Plant , Plant Breeding , Quantitative Trait Loci/geneticsABSTRACT
Structural variations (SVs) and gene copy number variations (gCNVs) have contributed to crop evolution, domestication, and improvement. Here, we assembled 31 high-quality genomes of genetically diverse rice accessions. Coupling with two existing assemblies, we developed pan-genome-scale genomic resources including a graph-based genome, providing access to rice genomic variations. Specifically, we discovered 171,072 SVs and 25,549 gCNVs and used an Oryza glaberrima assembly to infer the derived states of SVs in the Oryza sativa population. Our analyses of SV formation mechanisms, impacts on gene expression, and distributions among subpopulations illustrate the utility of these resources for understanding how SVs and gCNVs shaped rice environmental adaptation and domestication. Our graph-based genome enabled genome-wide association study (GWAS)-based identification of phenotype-associated genetic variations undetectable when using only SNPs and a single reference assembly. Our work provides rich population-scale resources paired with easy-to-access tools to facilitate rice breeding as well as plant functional genomics and evolutionary biology research.
Subject(s)
Ecotype , Genetic Variation , Genome, Plant , Oryza/genetics , Adaptation, Physiological/genetics , Agriculture , Domestication , Gene Expression Profiling , Gene Expression Regulation, Plant , Genes, Plant , Genomic Structural Variation , Molecular Sequence Annotation , PhenotypeABSTRACT
Cultivated rice varieties are all diploid, and polyploidization of rice has long been desired because of its advantages in genome buffering, vigorousness, and environmental robustness. However, a workable route remains elusive. Here, we describe a practical strategy, namely de novo domestication of wild allotetraploid rice. By screening allotetraploid wild rice inventory, we identified one genotype of Oryza alta (CCDD), polyploid rice 1 (PPR1), and established two important resources for its de novo domestication: (1) an efficient tissue culture, transformation, and genome editing system and (2) a high-quality genome assembly discriminated into two subgenomes of 12 chromosomes apiece. With these resources, we show that six agronomically important traits could be rapidly improved by editing O. alta homologs of the genes controlling these traits in diploid rice. Our results demonstrate the possibility that de novo domesticated allotetraploid rice can be developed into a new staple cereal to strengthen world food security.
Subject(s)
Crops, Agricultural/genetics , Domestication , Oryza/genetics , CRISPR-Cas Systems , Food Security , Gene Editing , Genetic Variation , Genome, Plant , Oryza/classification , PolyploidyABSTRACT
Reinventing potato from a clonally propagated tetraploid into a seed-propagated diploid, hybrid potato, is an important innovation in agriculture. Due to deleterious mutations, it has remained a challenge to develop highly homozygous inbred lines, a prerequisite to breed hybrid potato. Here, we employed genome design to develop a generation of pure and fertile potato lines and thereby the uniform, vigorous F1s. The metrics we applied in genome design included the percentage of genome homozygosity and the number of deleterious mutations in the starting material, the number of segregation distortions in the S1 population, the haplotype information to infer the break of tight linkage between beneficial and deleterious alleles, and the genome complementarity of the parental lines. This study transforms potato breeding from a slow, non-accumulative mode into a fast-iterative one, thereby potentiating a broad spectrum of benefits to farmers and consumers.
Subject(s)
Genome, Plant , Hybridization, Genetic , Solanum tuberosum/genetics , Crosses, Genetic , Diploidy , Fertility/genetics , Genes, Plant , Genetic Variation , Genetics, Population , Heterozygote , Homozygote , Hybrid Vigor/genetics , Mutation/genetics , Pedigree , Plant Breeding , Principal Component Analysis , Selection, GeneticABSTRACT
Structural variants (SVs) underlie important crop improvement and domestication traits. However, resolving the extent, diversity, and quantitative impact of SVs has been challenging. We used long-read nanopore sequencing to capture 238,490 SVs in 100 diverse tomato lines. This panSV genome, along with 14 new reference assemblies, revealed large-scale intermixing of diverse genotypes, as well as thousands of SVs intersecting genes and cis-regulatory regions. Hundreds of SV-gene pairs exhibit subtle and significant expression changes, which could broadly influence quantitative trait variation. By combining quantitative genetics with genome editing, we show how multiple SVs that changed gene dosage and expression levels modified fruit flavor, size, and production. In the last example, higher order epistasis among four SVs affecting three related transcription factors allowed introduction of an important harvesting trait in modern tomato. Our findings highlight the underexplored role of SVs in genotype-to-phenotype relationships and their widespread importance and utility in crop improvement.
Subject(s)
Crops, Agricultural/genetics , Gene Expression Regulation, Plant , Genomic Structural Variation , Solanum lycopersicum/genetics , Alleles , Cytochrome P-450 Enzyme System/genetics , Ecotype , Epistasis, Genetic , Fruit/genetics , Gene Duplication , Genome, Plant , Genotype , Inbreeding , Molecular Sequence Annotation , Phenotype , Plant Breeding , Quantitative Trait Loci/geneticsABSTRACT
Soybean is one of the most important vegetable oil and protein feed crops. To capture the entire genomic diversity, it is needed to construct a complete high-quality pan-genome from diverse soybean accessions. In this study, we performed individual de novo genome assemblies for 26 representative soybeans that were selected from 2,898 deeply sequenced accessions. Using these assembled genomes together with three previously reported genomes, we constructed a graph-based genome and performed pan-genome analysis, which identified numerous genetic variations that cannot be detected by direct mapping of short sequence reads onto a single reference genome. The structural variations from the 2,898 accessions that were genotyped based on the graph-based genome and the RNA sequencing (RNA-seq) data from the representative 26 accessions helped to link genetic variations to candidate genes that are responsible for important traits. This pan-genome resource will promote evolutionary and functional genomics studies in soybean.
Subject(s)
Genome, Plant , Glycine max/growth & development , Glycine max/genetics , Base Sequence , Chromosomes, Plant/genetics , Domestication , Ecotype , Gene Duplication , Gene Expression Regulation, Plant , Gene Fusion , Geography , Molecular Sequence Annotation , Phylogeny , Polymorphism, Single Nucleotide/genetics , PolyploidyABSTRACT
Banyan trees are distinguished by their extraordinary aerial roots. The Ficus genus includes species that have evolved a species-specific mutualism system with wasp pollinators. We sequenced genomes of the Chinese banyan tree, F. microcarpa, and a species lacking aerial roots, F. hispida, and one wasp genome coevolving with F. microcarpa, Eupristina verticillata. Comparative analysis of the two Ficus genomes revealed dynamic karyotype variation associated with adaptive evolution. Copy number expansion of auxin-related genes from duplications and elevated auxin production are associated with aerial root development in F. microcarpa. A male-specific AGAMOUS paralog, FhAG2, was identified as a candidate gene for sex determination in F. hispida. Population genomic analyses of Ficus species revealed genomic signatures of morphological and physiological coadaptation with their pollinators involving terpenoid- and benzenoid-derived compounds. These three genomes offer insights into and genomic resources for investigating the geneses of aerial roots, monoecy and dioecy, and codiversification in a symbiotic system.
Subject(s)
Biological Evolution , Ficus/genetics , Genome, Plant , Pollination/physiology , Trees/genetics , Wasps/physiology , Animals , Chromosomes, Plant/genetics , DNA Transposable Elements/genetics , Female , Gene Expression Profiling , Gene Expression Regulation, Plant , Genes, Plant , Indoleacetic Acids/metabolism , Molecular Sequence Annotation , Phylogeny , Plant Roots/growth & development , Segmental Duplications, Genomic/genetics , Sex Chromosomes/genetics , Volatile Organic Compounds/analysisABSTRACT
Infectious disease is both a major force of selection in nature and a prime cause of yield loss in agriculture. In plants, disease resistance is often conferred by nucleotide-binding leucine-rich repeat (NLR) proteins, intracellular immune receptors that recognize pathogen proteins and their effects on the host. Consistent with extensive balancing and positive selection, NLRs are encoded by one of the most variable gene families in plants, but the true extent of intraspecific NLR diversity has been unclear. Here, we define a nearly complete species-wide pan-NLRome in Arabidopsis thaliana based on sequence enrichment and long-read sequencing. The pan-NLRome largely saturates with approximately 40 well-chosen wild strains, with half of the pan-NLRome being present in most accessions. We chart NLR architectural diversity, identify new architectures, and quantify selective forces that act on specific NLRs and NLR domains. Our study provides a blueprint for defining pan-NLRomes.
Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/genetics , NLR Proteins/genetics , Alleles , Arabidopsis Proteins/metabolism , Disease Resistance/genetics , Genetic Variation , Genome, Plant , NLR Proteins/metabolism , Plant Diseases/genetics , Plant Immunity , Species SpecificityABSTRACT
The transition to a terrestrial environment, termed terrestrialization, is generally regarded as a pivotal event in the evolution and diversification of the land plant flora that changed the surface of our planet. Through phylogenomic studies, a group of streptophyte algae, the Zygnematophyceae, have recently been recognized as the likely sister group to land plants (embryophytes). Here, we report genome sequences and analyses of two early diverging Zygnematophyceae (Spirogloea muscicola gen. nov. and Mesotaenium endlicherianum) that share the same subaerial/terrestrial habitat with the earliest-diverging embryophytes, the bryophytes. We provide evidence that genes (i.e., GRAS and PYR/PYL/RCAR) that increase resistance to biotic and abiotic stresses in land plants, in particular desiccation, originated or expanded in the common ancestor of Zygnematophyceae and embryophytes, and were gained by horizontal gene transfer (HGT) from soil bacteria. These two Zygnematophyceae genomes represent a cornerstone for future studies to understand the underlying molecular mechanism and process of plant terrestrialization.
Subject(s)
Biological Evolution , Embryophyta/genetics , Genome, Plant , Streptophyta/genetics , Abscisic Acid/pharmacology , Amino Acid Sequence , Multigene Family , Phylogeny , Plant Proteins/chemistry , Protein Domains , Streptophyta/classification , Symbiosis/genetics , Synteny/geneticsABSTRACT
Land plants evolved from charophytic algae, among which Charophyceae possess the most complex body plans. We present the genome of Chara braunii; comparison of the genome to those of land plants identified evolutionary novelties for plant terrestrialization and land plant heritage genes. C. braunii employs unique xylan synthases for cell wall biosynthesis, a phragmoplast (cell separation) mechanism similar to that of land plants, and many phytohormones. C. braunii plastids are controlled via land-plant-like retrograde signaling, and transcriptional regulation is more elaborate than in other algae. The morphological complexity of this organism may result from expanded gene families, with three cases of particular note: genes effecting tolerance to reactive oxygen species (ROS), LysM receptor-like kinases, and transcription factors (TFs). Transcriptomic analysis of sexual reproductive structures reveals intricate control by TFs, activity of the ROS gene network, and the ancestral use of plant-like storage and stress protection proteins in the zygote.
Subject(s)
Chara/genetics , Genome, Plant , Biological Evolution , Cell Wall/metabolism , Chara/growth & development , Embryophyta/genetics , Gene Regulatory Networks , Pentosyltransferases/genetics , Plant Growth Regulators/metabolism , Plant Proteins/genetics , Plant Proteins/metabolism , Protein Serine-Threonine Kinases/genetics , Protein Serine-Threonine Kinases/metabolism , Reactive Oxygen Species/metabolism , Transcription Factors/genetics , Transcription Factors/metabolism , TranscriptomeABSTRACT
Major advances in crop yields are needed in the coming decades. However, plant breeding is currently limited by incremental improvements in quantitative traits that often rely on laborious selection of rare naturally occurring mutations in gene-regulatory regions. Here, we demonstrate that CRISPR/Cas9 genome editing of promoters generates diverse cis-regulatory alleles that provide beneficial quantitative variation for breeding. We devised a simple genetic scheme, which exploits trans-generational heritability of Cas9 activity in heterozygous loss-of-function mutant backgrounds, to rapidly evaluate the phenotypic impact of numerous promoter variants for genes regulating three major productivity traits in tomato: fruit size, inflorescence branching, and plant architecture. Our approach allows immediate selection and fixation of novel alleles in transgene-free plants and fine manipulation of yield components. Beyond a platform to enhance variation for diverse agricultural traits, our findings provide a foundation for dissecting complex relationships between gene-regulatory changes and control of quantitative traits.
Subject(s)
Crops, Agricultural/genetics , Gene Editing , Genome, Plant , CRISPR-Cas Systems , Promoter Regions, Genetic , Quantitative Trait LociABSTRACT
The evolution of land flora transformed the terrestrial environment. Land plants evolved from an ancestral charophycean alga from which they inherited developmental, biochemical, and cell biological attributes. Additional biochemical and physiological adaptations to land, and a life cycle with an alternation between multicellular haploid and diploid generations that facilitated efficient dispersal of desiccation tolerant spores, evolved in the ancestral land plant. We analyzed the genome of the liverwort Marchantia polymorpha, a member of a basal land plant lineage. Relative to charophycean algae, land plant genomes are characterized by genes encoding novel biochemical pathways, new phytohormone signaling pathways (notably auxin), expanded repertoires of signaling pathways, and increased diversity in some transcription factor families. Compared with other sequenced land plants, M. polymorpha exhibits low genetic redundancy in most regulatory pathways, with this portion of its genome resembling that predicted for the ancestral land plant. PAPERCLIP.
Subject(s)
Biological Evolution , Embryophyta/genetics , Genome, Plant , Marchantia/genetics , Adaptation, Biological , Embryophyta/physiology , Gene Expression Regulation, Plant , Marchantia/physiology , Molecular Sequence Annotation , Signal Transduction , Transcription, GeneticABSTRACT
Rice feeds half the world's population, and rice blast is often a destructive disease that results in significant crop loss. Non-race-specific resistance has been more effective in controlling crop diseases than race-specific resistance because of its broad spectrum and durability. Through a genome-wide association study, we report the identification of a natural allele of a C2H2-type transcription factor in rice that confers non-race-specific resistance to blast. A survey of 3,000 sequenced rice genomes reveals that this allele exists in 10% of rice, suggesting that this favorable trait has been selected through breeding. This allele causes a single nucleotide change in the promoter of the bsr-d1 gene, which results in reduced expression of the gene through the binding of the repressive MYB transcription factor and, consequently, an inhibition of H2O2 degradation and enhanced disease resistance. Our discovery highlights this novel allele as a strategy for breeding durable resistance in rice.
Subject(s)
Oryza/genetics , Plant Proteins/genetics , Transcription Factors/genetics , Base Sequence , Breeding , Disease Resistance , Gene Knockout Techniques , Genome, Plant , Genome-Wide Association Study , Plant Diseases , Promoter Regions, GeneticABSTRACT
The epigenome orchestrates genome accessibility, functionality, and three-dimensional structure. Because epigenetic variation can impact transcription and thus phenotypes, it may contribute to adaptation. Here, we report 1,107 high-quality single-base resolution methylomes and 1,203 transcriptomes from the 1001 Genomes collection of Arabidopsis thaliana. Although the genetic basis of methylation variation is highly complex, geographic origin is a major predictor of genome-wide DNA methylation levels and of altered gene expression caused by epialleles. Comparison to cistrome and epicistrome datasets identifies associations between transcription factor binding sites, methylation, nucleotide variation, and co-expression modules. Physical maps for nine of the most diverse genomes reveal how transposons and other structural variants shape the epigenome, with dramatic effects on immunity genes. The 1001 Epigenomes Project provides a comprehensive resource for understanding how variation in DNA methylation contributes to molecular and non-molecular phenotypes in natural populations of the most studied model plant.
Subject(s)
Arabidopsis/genetics , Epigenesis, Genetic , DNA Methylation , Epigenomics , Gene Expression Regulation, Plant , Genome, Plant , TranscriptomeABSTRACT
Arabidopsis thaliana serves as a model organism for the study of fundamental physiological, cellular, and molecular processes. It has also greatly advanced our understanding of intraspecific genome variation. We present a detailed map of variation in 1,135 high-quality re-sequenced natural inbred lines representing the native Eurasian and North African range and recently colonized North America. We identify relict populations that continue to inhabit ancestral habitats, primarily in the Iberian Peninsula. They have mixed with a lineage that has spread to northern latitudes from an unknown glacial refugium and is now found in a much broader spectrum of habitats. Insights into the history of the species and the fine-scale distribution of genetic diversity provide the basis for full exploitation of A. thaliana natural variation through integration of genomes and epigenomes with molecular and non-molecular phenotypes.
Subject(s)
Arabidopsis/genetics , Genome, Plant , Polymorphism, Genetic , Epigenesis, Genetic , Epigenomics , Genome-Wide Association Study , PhenotypeABSTRACT
The cistrome is the complete set of transcription factor (TF) binding sites (cis-elements) in an organism, while an epicistrome incorporates tissue-specific DNA chemical modifications and TF-specific chemical sensitivities into these binding profiles. Robust methods to construct comprehensive cistrome and epicistrome maps are critical for elucidating complex transcriptional networks that underlie growth, behavior, and disease. Here, we describe DNA affinity purification sequencing (DAP-seq), a high-throughput TF binding site discovery method that interrogates genomic DNA with in-vitro-expressed TFs. Using DAP-seq, we defined the Arabidopsis cistrome by resolving motifs and peaks for 529 TFs. Because genomic DNA used in DAP-seq retains 5-methylcytosines, we determined that >75% (248/327) of Arabidopsis TFs surveyed were methylation sensitive, a property that strongly impacts the epicistrome landscape. DAP-seq datasets also yielded insight into the biology and binding site architecture of numerous TFs, demonstrating the value of DAP-seq for cost-effective cistromic and epicistromic annotation in any organism.
Subject(s)
Arabidopsis/genetics , DNA, Plant/genetics , Genome, Plant , Response Elements , Sequence Analysis, DNA/methods , Transcription Factors/metabolism , Amino Acid Motifs , DNA, Plant/metabolism , Epigenesis, Genetic , Indoleacetic Acids/metabolism , Plant Proteins/geneticsABSTRACT
Within the life cycle of a living organism, another life cycle exists for the selfish genome inhabitants, which are called transposable elements (TEs). These mobile sequences invade, duplicate, amplify, and diversify within a genome, increasing the genome's size and generating new mutations. Cells act to defend their genome, but rather than permanently destroying TEs, they use chromatin-level repression and epigenetic inheritance to silence TE activity. This level of silencing is ephemeral and reversible, leading to a dynamic equilibrium between TE suppression and reactivation within a host genome. The coexistence of the TE and host genome can also lead to the domestication of the TE to serve in host genome evolution and function. In this review, we describe the life cycle of a TE, with emphasis on how epigenetic regulation is harnessed to control TEs for host genome stability and innovation.