ABSTRACT
The availability of public genomic resources can greatly assist biodiversity assessment, conservation, and restoration efforts by providing evidence for scientifically informed management decisions. Here we survey the main approaches and applications in biodiversity and conservation genomics, considering practical factors, such as cost, time, prerequisite skills, and current shortcomings of applications. Most approaches perform best in combination with reference genomes from the target species or closely related species. We review case studies to illustrate how reference genomes can facilitate biodiversity research and conservation across the tree of life. We conclude that the time is ripe to view reference genomes as fundamental resources and to integrate their use as a best practice in conservation genomics.
Subject(s)
Biodiversity , Conservation of Natural Resources , Genomics , GenomeABSTRACT
The gynoecium is critical for the reproduction of flowering plants as it contains the ovules and the tissues that foster pollen germination, growth, and guidance. These tissues, known as the reproductive tract (ReT), comprise the stigma, style, and transmitting tract (TT). The ReT and ovules originate from the carpel margin meristem (CMM) within the pistil. SHOOT MERISTEMLESS (STM) is a key transcription factor for meristem formation and maintenance. In all above-ground meristems, including the CMM, local STM downregulation is required for organ formation. However, how this downregulation is achieved in the CMM is unknown. Here, we have studied the role of HISTONE DEACETYLASE 19 (HDA19) in Arabidopsis (Arabidopsis thaliana) during ovule and ReT differentiation based on the observation that the hda19-3 mutant displays a reduced ovule number and fails to differentiate the TT properly. Fluorescence-activated cell sorting coupled with RNA-sequencing revealed that in the CMM of hda19-3 mutants, genes promoting organ development are downregulated while meristematic markers, including STM, are upregulated. HDA19 was essential to downregulate STM in the CMM, thereby allowing ovule formation and TT differentiation. STM is ectopically expressed in hda19-3 at intermediate stages of pistil development, and its downregulation by RNA interference alleviated the hda19-3 phenotype. Chromatin immunoprecipitation assays indicated that STM is a direct target of HDA19 during pistil development and that the transcription factor SEEDSTICK is also required to regulate STM via histone acetylation. Thus, we identified factors required for the downregulation of STM in the CMM, which is necessary for organogenesis and tissue differentiation.
Subject(s)
Arabidopsis Proteins , Arabidopsis , Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Histones/genetics , Ovule/genetics , Ovule/metabolism , Arabidopsis/physiology , Transcription Factors/metabolism , Meristem , Gene Expression Regulation, Plant , MADS Domain Proteins/genetics , Histone Deacetylases/metabolismABSTRACT
Flowering plants adjust their reproductive period to maximize the success of the offspring. Monocarpic plants, those with a single reproductive cycle that precedes plant senescence and death, tightly regulate both flowering initiation and flowering cessation. The end of the flowering period involves the arrest of the inflorescence meristem activity, known as proliferative arrest, in what has been interpreted as an evolutionary adaptation to maximize the allocation of resources to seed production and the viability of the progeny. Factors influencing proliferative arrest were described for several monocarpic plant species many decades ago, but only in the last few years studies performed in Arabidopsis have allowed to approach proliferative arrest regulation in a comprehensive manner by studying the physiology, hormone dynamics, and genetic factors involved in its regulation. However, these studies remain restricted to Arabidopsis and there is a need to expand our knowledge to other monocarpic species to propose general mechanisms controlling the process. In this work, we have characterized proliferative arrest in Pisum sativum, trying to parallel available studies in Arabidopsis to maximize this comparative framework. We have assessed quantitatively the role of fruits/seeds in the process, the influence of the positional effect of these fruits/seeds in the behavior of the inflorescence meristem, and the transcriptomic changes in the inflorescence associated with the arrested state of the meristem. Our results support a high conservation of the factors triggering arrest in pea and Arabidopsis, but also reveal differences reinforcing the need to perform similar studies in other species.
Subject(s)
Flowers , Gene Expression Regulation, Plant , Inflorescence , Meristem , Pisum sativum , Seeds , Meristem/genetics , Meristem/growth & development , Meristem/physiology , Pisum sativum/genetics , Pisum sativum/physiology , Pisum sativum/growth & development , Inflorescence/genetics , Inflorescence/physiology , Inflorescence/growth & development , Flowers/genetics , Flowers/physiology , Flowers/growth & development , Seeds/genetics , Seeds/growth & development , Seeds/physiology , Plant Dormancy/genetics , Plant Dormancy/physiologyABSTRACT
SUMMARY: EasyGDB is an easy-to-implement low-maintenance tool developed to create genomic data management web platforms. It can be used for any species, group of species, or multiple genome or annotation versions. EasyGDB provides a framework to develop a web portal that includes the general information about species, projects and members, and bioinformatics tools such as file downloads, BLAST, genome browser, annotation search, gene expression visualization, annotation and sequence download, and gene ids and orthologs lookup. The code of EasyGDB facilitates data maintenance and update for non-experienced bioinformaticians, using BLAST databases to store and retrieve sequence data in gene annotation pages and bioinformatics tools, and JSON files to customize metadata. EasyGDB is a highly customizable tool. Any section and tool can be enabled or disabled like a switch through a single configuration file. This tool aims to simplify the development of genomics portals in non-model species, providing a modern web style with embedded interactive bioinformatics tools to cover all the common needs derived from genomics projects. AVAILABILITY AND IMPLEMENTATION: The code and manual to use EasyGDB can be found at https://github.com/noefp/easy_gdb.
Subject(s)
Genome , Genomics , Software , Computational Biology , Molecular Sequence AnnotationABSTRACT
Speciation begins with the isolation of some individuals or subpopulations due to drivers promoting a diverging genetic distribution. Such isolation may occur, followed by different processes and pressures. Isolation-by-distance (IBD), isolation-by-adaptation (IBA), and isolation-by-colonization (IBC) have been recognized as the main divergence patterns. Still, it is not easy to distinguish which one is the main pattern as each one may act at different points in time or even simultaneously. Using an extensive genome coverage from a Petunia species complex with coastal and inland distribution and multiple analytical approaches on population genomics and phylogeography, we showed a complex interplay between neutral and selective forces acting on the divergence process. We found 18,887 SNPs potentially neutral and 924 potentially under selection (outlier) loci. All analyses pointed that each subspecies displays its own genetic component and evolutionary history. We suggested plausible ecological drivers for such divergence in a southernmost South Atlantic coastal plain in Brazil and Uruguay and identified a connection between adaptation and environment heterogeneity.
Subject(s)
Genetics, Population , Solanaceae , Biological Evolution , Humans , Phylogeography , Polymorphism, Single Nucleotide/geneticsABSTRACT
BACKGROUND: Mango, Mangifera indica L., an important tropical fruit crop, is grown for its sweet and aromatic fruits. Past improvement of this species has predominantly relied on chance seedlings derived from over 1000 cultivars in the Indian sub-continent with a large variation for fruit size, yield, biotic and abiotic stress resistance, and fruit quality among other traits. Historically, mango has been an orphan crop with very limited molecular information. Only recently have molecular and genomics-based analyses enabled the creation of linkage maps, transcriptomes, and diversity analysis of large collections. Additionally, the combined analysis of genomic and phenotypic information is poised to improve mango breeding efficiency. RESULTS: This study sequenced, de novo assembled, analyzed, and annotated the genome of the monoembryonic mango cultivar 'Tommy Atkins'. The draft genome sequence was generated using NRGene de-novo Magic on high molecular weight DNA of 'Tommy Atkins', supplemented by 10X Genomics long read sequencing to improve the initial assembly. A hybrid population between 'Tommy Atkins' x 'Kensington Pride' was used to generate phased haplotype chromosomes and a highly resolved phased SNP map. The final 'Tommy Atkins' genome assembly was a consensus sequence that included 20 pseudomolecules representing the 20 chromosomes of mango and included ~ 86% of the ~ 439 Mb haploid mango genome. Skim sequencing identified ~ 3.3 M SNPs using the 'Tommy Atkins' x 'Kensington Pride' mapping population. Repeat masking identified 26,616 genes with a median length of 3348 bp. A whole genome duplication analysis revealed an ancestral 65 MYA polyploidization event shared with Anacardium occidentale. Two regions, one on LG4 and one on LG7 containing 28 candidate genes, were associated with the commercially important fruit size characteristic in the mapping population. CONCLUSIONS: The availability of the complete 'Tommy Atkins' mango genome will aid global initiatives to study mango genetics.
Subject(s)
Crops, Agricultural/growth & development , Crops, Agricultural/genetics , Fruit/growth & development , Fruit/genetics , Mangifera/growth & development , Mangifera/genetics , Taste/genetics , Genetic Variation , Genome, Plant , Genotype , Plant Breeding/methodsABSTRACT
MOTIVATION: Transposable elements (TEs) classification is an essential step to decode their roles in genome evolution. With a large number of genomes from non-model species becoming available, accurate and efficient TE classification has emerged as a new challenge in genomic sequence analysis. RESULTS: We developed a novel tool, DeepTE, which classifies unknown TEs using convolutional neural networks (CNNs). DeepTE transferred sequences into input vectors based on k-mer counts. A tree structured classification process was used where eight models were trained to classify TEs into super families and orders. DeepTE also detected domains inside TEs to correct false classification. An additional model was trained to distinguish between non-TEs and TEs in plants. Given unclassified TEs of different species, DeepTE can classify TEs into seven orders, which include 15, 24 and 16 super families in plants, metazoans and fungi, respectively. In several benchmarking tests, DeepTE outperformed other existing tools for TE classification. In conclusion, DeepTE successfully leverages CNN for TE classification, and can be used to precisely classify TEs in newly sequenced eukaryotic genomes. AVAILABILITY AND IMPLEMENTATION: DeepTE is accessible at https://github.com/LiLabAtVT/DeepTE. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Subject(s)
DNA Transposable Elements , Eukaryota , Base Sequence , DNA Transposable Elements/genetics , Eukaryota/genetics , Humans , Neural Networks, ComputerABSTRACT
Orchardgrass (Dactylis glomerata L.) is an important forage grass for cultivating livestock worldwide. Here, we report an ~1.84-Gb chromosome-scale diploid genome assembly of orchardgrass, with a contig N50 of 0.93 Mb, a scaffold N50 of 6.08 Mb and a super-scaffold N50 of 252.52 Mb, which is the first chromosome-scale assembled genome of a cool-season forage grass. The genome includes 40 088 protein-coding genes, and 69% of the assembled sequences are transposable elements, with long terminal repeats (LTRs) being the most abundant. The LTRretrotransposons may have been activated and expanded in the grass genome in response to environmental changes during the Pleistocene between 0 and 1 million years ago. Phylogenetic analysis reveals that orchardgrass diverged after rice but before three Triticeae species, and evolutionarily conserved chromosomes were detected by analysing ancient chromosome rearrangements in these grass species. We also resequenced the whole genome of 76 orchardgrass accessions and found that germplasm from Northern Europe and East Asia clustered together, likely due to the exchange of plants along the 'Silk Road' or other ancient trade routes connecting the East and West. Last, a combined transcriptome, quantitative genetic and bulk segregant analysis provided insights into the genetic network regulating flowering time in orchardgrass and revealed four main candidate genes controlling this trait. This chromosome-scale genome and the online database of orchardgrass developed here will facilitate the discovery of genes controlling agronomically important traits, stimulate genetic improvement of and functional genetic research on orchardgrass and provide comparative genetic resources for other forage grasses.
Subject(s)
Dactylis , Evolution, Molecular , Flowers , Gene Regulatory Networks , Dactylis/genetics , Flowers/genetics , Microsatellite Repeats , Phenotype , PhylogenyABSTRACT
Resolving complex plant pathogen genomes is important for identifying the genomic shifts associated with rapid adaptation to selective agents such as hosts and fungicides, yet assembling these genomes remains challenging and expensive. Phytophthora capsici is an important, globally distributed plant pathogen that exhibits widespread fungicide resistance and a broad host range. As with other pathogenic oomycetes, P. capsici has a complex life history and a complex genome. Here, we leverage Oxford Nanopore Technologies and existing short-read resources to rapidly generate a low-cost, improved assembly. We generated 10 Gbp from a single MinION flow cell resulting in >1.25 million reads with an N50 of 13 kb. The resulting assembly is 95.2 Mbp in 424 scaffolds with an N50 length of 313 kb. This assembly is approximately 30 Mbp bigger than the current reference genome of 64 Mbp. We confirmed this larger genome size using flow cytometry, with an estimated size of 110 Mbp. BUSCO analysis identified 97.4% complete orthologs (19.2% duplicated). Evolutionary analysis supports a recent whole-genome duplication in this group. Our work provides a blueprint for rapidly integrating benchtop long-read sequencing with existing short-read data, to dramatically improve assembly quality and integrity of complex genomes and offer novel insights into pathogen genome function and evolution.
Subject(s)
Genome, Protozoan , Phytophthora , Sequence Analysis, DNA , Genome Size , Genome, Protozoan/genetics , Genomics , High-Throughput Nucleotide Sequencing , Phytophthora/geneticsABSTRACT
Plant domestication is the process of adapting plants to human use by selecting specific traits. The selection process often involves the modification of some components of the plant reproductive mechanisms. Allelic variants of genes associated with flowering time, vernalization, and the circadian clock are responsible for the adaptation of crops, such as rice, maize, barley, wheat, and tomato, to non-native latitudes. Modifications in the plant architecture and branching have been selected for higher yields and easier harvests. These phenotypes are often produced by alterations in the regulation of the transition of shoot apical meristems to inflorescences, and then to floral meristems. Floral homeotic mutants are responsible for popular double-flower phenotypes in Japanese cherries, roses, camellias, and lilies. The rise of peloric flowers in ornamentals such as snapdragon and florists' gloxinia is associated with non-functional alleles that control the relative expansion of lateral and ventral petals. Mechanisms to force outcrossing such as self-incompatibility have been removed in some tree crops cultivars such as almonds and peaches. In this review, we revisit some of these important concepts from the plant domestication perspective, focusing on four topics related to the pre-fertilization mechanisms: flowering time, inflorescence architecture, flower development, and pre-fertilization self-incompatibility mechanisms.
Subject(s)
Crops, Agricultural/genetics , Domestication , Flowers/growth & development , Inflorescence/anatomy & histology , Self-Incompatibility in Flowering Plants/physiology , Crops, Agricultural/physiology , FertilizationABSTRACT
Polyploidization is a significant source of genomic and organism diversification during plant evolution, and leads to substantial alterations in plant phenotypes and natural fitness. To help understand the phenotypic and molecular impacts of autopolyploidization, we conducted epigenetic and full-transcriptomic analyses of a synthesized autopolyploid accession of switchgrass (Panicum virgatum) in order to interpret the molecular and phenotypic changes. We found that mCHH levels were decreased in both genic and transposable element (TE) regions, and that TE methylation near genes was decreased as well. Among 142 differentially expressed genes involved in cell division, cellulose biosynthesis, auxin response, growth, and reproduction processes, 75 of them were modified by 122 differentially methylated regions, 10 miRNAs, and 15 siRNAs. In addition, up-regulated PvTOE1 and suppressed PvFT probably contribute to later flowering time of the autopolyploid. The expression changes were probably associated with modification of nearby methylation sites and siRNAs. We also experimentally demonstrated that expression levels of PvFT and PvTOE1 were regulated by DNA methylation, supporting the link between alterations in methylation induced by polyploidization and the phenotypic changes that were observed. Collectively, our results show epigenetic modifications in synthetic autopolyploid switchgrass for the first time, and support the hypothesis that polyploidization-induced methylation is an important cause of phenotypic alterations and is potentially important for plant evolution and improved fitness.
Subject(s)
Epigenome/genetics , Panicum/genetics , DNA Transposable Elements/genetics , Gene Expression Profiling , Gene Expression Regulation, Plant/genetics , Transcriptome/geneticsABSTRACT
The Cucurbita genus (squashes, pumpkins and gourds) includes important domesticated species such as C. pepo, C. maxima and C. moschata. In this study, we present a high-quality draft of the zucchini (C. pepo) genome. The assembly has a size of 263 Mb, a scaffold N50 of 1.8 Mb and 34 240 gene models. It includes 92% of the conserved BUSCO core gene set, and it is estimated to cover 93.0% of the genome. The genome is organized in 20 pseudomolecules that represent 81.4% of the assembly, and it is integrated with a genetic map of 7718 SNPs. Despite the small genome size, three independent lines of evidence support that the C. pepo genome is the result of a whole-genome duplication: the topology of the gene family phylogenies, the karyotype organization and the distribution of 4DTv distances. Additionally, 40 transcriptomes of 12 species of the genus were assembled and analysed together with all the other published genomes of the Cucurbitaceae family. The duplication was detected in all the Cucurbita species analysed, including C. maxima and C. moschata, but not in the more distant cucurbits belonging to the Cucumis and Citrullus genera, and it is likely to have occurred 30 ± 4 Mya in the ancestral species that gave rise to the genus.
Subject(s)
Biological Evolution , Cucurbita/genetics , Gene Duplication , Genome, Plant , Transcriptome , Cucurbita/metabolismABSTRACT
BACKGROUND: The development of long-read sequencing technologies, such as single-molecule real-time (SMRT) sequencing by PacBio, has produced a revolution in the sequencing of small genomes. Sequencing organelle genomes using PacBio long-read data is a cost effective, straightforward approach. Nevertheless, the availability of simple-to-use software to perform the assembly from raw reads is limited at present. RESULTS: We present Organelle-PBA, a Perl program designed specifically for the assembly of chloroplast and mitochondrial genomes. For chloroplast genomes, the program selects the chloroplast reads from a whole genome sequencing pool, maps the reads to a reference sequence from a closely related species, and then performs read correction and de novo assembly using Sprai. Organelle-PBA completes the assembly process with the additional step of scaffolding by SSPACE-LongRead. The program then detects the chloroplast inverted repeats and reassembles and re-orients the assembly based on the organelle origin of the reference. We have evaluated the performance of the software using PacBio reads from different species, read coverage, and reference genomes. Finally, we present the assembly of two novel chloroplast genomes from the species Picea glauca (Pinaceae) and Sinningia speciosa (Gesneriaceae). CONCLUSION: Organelle-PBA is an easy-to-use Perl-based software pipeline that was written specifically to assemble mitochondrial and chloroplast genomes from whole genome PacBio reads. The program is available at https://github.com/aubombarely/Organelle_PBA .
Subject(s)
Genome, Chloroplast/genetics , Genome, Mitochondrial/genetics , Genomics/methods , Sequence Analysis, DNA , Software , Animals , Arabidopsis/cytology , Arabidopsis/genetics , Base Sequence , Mice , RatsABSTRACT
BACKGROUND: Allopolyploids contain genomes composed of more than two complete sets of chromosomes that originate from at least two species. Allopolyploidy has been suggested as an important evolutionary mechanism that can lead to instant speciation. Arabidopsis suecica is a relatively recent allopolyploid species, suggesting that its natural accessions might be genetically very similar to each other. Nonetheless, subtle phenotypic differences have been described between different geographic accessions of A. suecica grown in a common garden. RESULTS: To determine the degree of genomic similarity between different populations of A. suecica, we obtained transcriptomic sequence, quantified SNP variation within the gene space, and analyzed gene expression levels genome-wide from leaf material grown in controlled lab conditions. Despite their origin from the same progenitor species, the two accessions of A. suecica used in our study show genomic and transcriptomic variation. We report significant gene expression differences between the accessions, mostly in genes with stress-related functions. Among the differentially expressed genes, there are a surprising number of homoeologs coordinately regulated between sister accessions. CONCLUSIONS: Many of these homoeologous genes and other differentially expressed genes affect transpiration and stomatal regulation, suggesting that they might be involved in the establishment of the phenotypic differences between the two accessions.
Subject(s)
Arabidopsis/genetics , Arabidopsis/physiology , Genes, Plant/genetics , Genetic Variation , Polyploidy , Stress, Physiological/genetics , Gene Ontology , GenomicsABSTRACT
The Arabidopsis thaliana gene XYLEM NAC DOMAIN1 (XND1) is upregulated in xylem tracheary elements. Yet overexpression of XND1 blocks differentiation of tracheary elements. The molecular mechanism of XND1 action was investigated. Phylogenetic and motif analyses indicated that XND1 and its homologs are present only in angiosperms and possess a highly conserved C-terminal region containing linear motifs (CKII-acidic, LXCXE, E2FTD -like and LXCXE-mimic) predicted to interact with the cell cycle and differentiation regulator RETINOBLASTOMA-RELATED (RBR). Protein-protein interaction and functional analyses of XND1 deletion mutants were used to test the importance of RBR-interaction motifs. Deletion of either the LXCXE or the LXCXE-mimic motif reduced both the XND1-RBR interaction and XND1 efficacy as a repressor of differentiation, with loss of the LXCXE motif having the strongest negative impacts. The function of the XND1 C-terminal domain could be partially replaced by RBR fused to the N-terminal domain of XND1. XND1 also transactivated gene expression in yeast and plants. The properties of XND1, a transactivator that depends on multiple linear RBR-interaction motifs to inhibit differentiation, have not previously been described for a plant protein. XND1 harbors an apparently angiosperm-specific combination of interaction motifs potentially linking the general differentiation regulator RBR with a xylem-specific pathway for inhibition of differentiation.
Subject(s)
Arabidopsis Proteins/chemistry , Arabidopsis Proteins/metabolism , Cell Differentiation , Conserved Sequence , DNA-Binding Proteins/chemistry , DNA-Binding Proteins/metabolism , Transcription Factors/chemistry , Transcription Factors/metabolism , Xylem/cytology , Amino Acid Motifs , Amino Acid Sequence , Arabidopsis , Phenotype , Phosphorylation , Promoter Regions, Genetic/genetics , Protein Binding , Protein Domains , Sequence Homology, Amino Acid , Trans-Activators/metabolismABSTRACT
The Sol Genomics Network (SGN, http://solgenomics.net) is a web portal with genomic and phenotypic data, and analysis tools for the Solanaceae family and close relatives. SGN hosts whole genome data for an increasing number of Solanaceae family members including tomato, potato, pepper, eggplant, tobacco and Nicotiana benthamiana. The database also stores loci and phenotype data, which researchers can upload and edit with user-friendly web interfaces. Tools such as BLAST, GBrowse and JBrowse for browsing genomes, expression and map data viewers, a locus community annotation system and a QTL analysis tools are available. A new tool was recently implemented to improve Virus-Induced Gene Silencing (VIGS) constructs called the SGN VIGS tool. With the growing genomic and phenotypic data in the database, SGN is now advancing to develop new web-based breeding tools and implement the code and database structure for other species or clade-specific databases.
Subject(s)
Databases, Nucleic Acid , Genome, Plant , Solanaceae/genetics , Breeding , Crosses, Genetic , Genomics , Genotype , Internet , Phenotype , Solanaceae/metabolismABSTRACT
BACKGROUND: Switchgrass (Panicum virgatum L.) is a warm-season perennial grass that can be used as a second generation bioenergy crop. However, foliar fungal pathogens, like switchgrass rust, have the potential to significantly reduce switchgrass biomass yield. Despite its importance as a prominent bioenergy crop, a genome-wide comprehensive analysis of NB-LRR disease resistance genes has yet to be performed in switchgrass. RESULTS: In this study, we used a homology-based computational approach to identify 1011 potential NB-LRR resistance gene homologs (RGHs) in the switchgrass genome (v 1.1). In addition, we identified 40 RGHs that potentially contain unique domains including major sperm protein domain, jacalin-like binding domain, calmodulin-like binding, and thioredoxin. RNA-sequencing analysis of leaf tissue from 'Alamo', a rust-resistant switchgrass cultivar, and 'Dacotah', a rust-susceptible switchgrass cultivar, identified 2634 high quality variants in the RGHs between the two cultivars. RNA-sequencing data from field-grown cultivar 'Summer' plants indicated that the expression of some of these RGHs was developmentally regulated. CONCLUSIONS: Our results provide useful insight into the molecular structure, distribution, and expression patterns of members of the NB-LRR gene family in switchgrass. These results also provide a foundation for future work aimed at elucidating the molecular mechanisms underlying disease resistance in this important bioenergy crop.
Subject(s)
Disease Resistance/genetics , Gene Expression Profiling , Genes, Plant , Genetic Association Studies , Panicum/genetics , Alleles , Amino Acid Sequence , Computational Biology/methods , Databases, Nucleic Acid , Genetic Predisposition to Disease , Genome, Plant , Genomics/methods , Panicum/classification , Phylogeny , Polymorphism, Single Nucleotide , Position-Specific Scoring Matrices , Protein Interaction Domains and Motifs/genetics , Reproducibility of ResultsABSTRACT
Retention or loss of paralogs following duplication correlates strongly with the function of the gene and whether the gene was duplicated by whole-genome duplication (WGD) or by small-scale duplication. Selection on relative gene dosage (to maintain proper stoichiometry among interacting proteins) has been invoked to explain these patterns of duplicate gene retention and loss. In order for gene dosage to be visible to natural selection, there must necessarily be a correlation between gene copy number and gene expression level (transcript abundance), but this has rarely been examined. We used RNA-Seq data from seven Glycine subgenus Glycine species (three recently formed allotetraploids and their four diploid progenitors) to determine if expression patterns and gene dosage responses at the level of transcription are consistent with selection on relative gene dosage. As predicted, metabolic pathways and gene ontologies that are putatively dosage-sensitive based on duplication history exhibited reduced expression variance across species, and more coordinated expression responses to recent WGD, relative to putatively dosage-insensitive networks. We conclude that selection on relative dosage has played an important role in shaping gene networks in Glycine.
Subject(s)
Diploidy , Gene Dosage , Gene Expression Regulation, Plant , Glycine max/genetics , Polyploidy , Gene Duplication , Gene Ontology , Gene Regulatory Networks , Genes, PlantABSTRACT
The 3-hydroxy-3-methylglutaryl-CoA reductase (HMGR) enzyme catalyzes the major rate-limiting step of the mevalonic acid (MVA) pathway from which sterols and other isoprenoids are synthesized. In contrast with our extensive knowledge of the regulation of HMGR in yeast and animals, little is known about this process in plants. To identify regulatory components of the MVA pathway in plants, we performed a genetic screen for second-site suppressor mutations of the Arabidopsis thaliana highly drought-sensitive drought hypersensitive2 (dry2) mutant that shows decreased squalene epoxidase activity. We show that mutations in SUPPRESSOR OF DRY2 DEFECTS1 (SUD1) gene recover most developmental defects in dry2 through changes in HMGR activity. SUD1 encodes a putative E3 ubiquitin ligase that shows sequence and structural similarity to yeast Degradation of α factor (Doα10) and human TEB4, components of the endoplasmic reticulum-associated degradation C (ERAD-C) pathway. While in yeast and animals, the alternative ERAD-L/ERAD-M pathway regulates HMGR activity by controlling protein stability, SUD1 regulates HMGR activity without apparent changes in protein content. These results highlight similarities, as well as important mechanistic differences, among the components involved in HMGR regulation in plants, yeast, and animals.
Subject(s)
Arabidopsis Proteins/metabolism , Arabidopsis/metabolism , Hydroxymethylglutaryl CoA Reductases/metabolism , Ubiquitin-Protein Ligases/metabolism , Arabidopsis/genetics , Arabidopsis Proteins/genetics , Humans , Membrane Proteins/genetics , Mevalonic Acid/metabolism , Mutation , Phenotype , Plant Roots/genetics , Plant Roots/growth & development , Plant Roots/metabolism , Plant Shoots/genetics , Plant Shoots/metabolism , Plants, Genetically Modified , Saccharomyces cerevisiae Proteins/genetics , Sequence Homology, Amino Acid , Sterols/metabolism , Ubiquitin-Protein Ligases/geneticsABSTRACT
The Genome Database for Rosaceae (GDR, http:/www.rosaceae.org), the long-standing central repository and data mining resource for Rosaceae research, has been enhanced with new genomic, genetic and breeding data, and improved functionality. Whole genome sequences of apple, peach and strawberry are available to browse or download with a range of annotations, including gene model predictions, aligned transcripts, repetitive elements, polymorphisms, mapped genetic markers, mapped NCBI Rosaceae genes, gene homologs and association of InterPro protein domains, GO terms and Kyoto Encyclopedia of Genes and Genomes pathway terms. Annotated sequences can be queried using search interfaces and visualized using GBrowse. New expressed sequence tag unigene sets are available for major genera, and Pathway data are available through FragariaCyc, AppleCyc and PeachCyc databases. Synteny among the three sequenced genomes can be viewed using GBrowse_Syn. New markers, genetic maps and extensively curated qualitative/Mendelian and quantitative trait loci are available. Phenotype and genotype data from breeding projects and genetic diversity projects are also included. Improved search pages are available for marker, trait locus, genetic diversity and publication data. New search tools for breeders enable selection comparison and assistance with breeding decision making.