Search | Nursing VHL Search Portal

1.

Differences in activity and stability drive transposable element variation in tropical and temperate maize.

Ou, Shujun; Scheben, Armin; Collins, Tyler; Qiu, Yinjie; Seetharam, Arun S; Menard, Claire C; Manchanda, Nancy; Gent, Jonathan I; Schatz, Michael C; Anderson, Sarah N; Hufford, Matthew B; Hirsch, Candice N.

Genome Res ; 34(8): 1140-1153, 2024 Sep 20.

Article in English | MEDLINE | ID: mdl-39251347

ABSTRACT

Much of the profound interspecific variation in genome content has been attributed to transposable elements (TEs). To explore the extent of TE variation within species, we developed an optimized open-source algorithm, panEDTA, to de novo annotate TEs in a pangenome context. We then generated a unified TE annotation for a maize pangenome derived from 26 reference-quality genomes, which reveals an excess of 35.1 Mb of TE sequences per genome in tropical maize relative to temperate maize. A small number (n = 216) of TE families, mainly LTR retrotransposons, drive these differences. Evidence from the methylome, transcriptome, LTR age distribution, and LTR insertional polymorphisms reveals that 64.7% of the variability is contributed by LTR families that are young, less methylated, and more expressed in tropical maize, whereas 18.5% is driven by LTR families with removal or loss in temperate maize. Additionally, we find enrichment for Young LTR families adjacent to nucleotide-binding and leucine-rich repeat (NLR) clusters of varying copy number across lines, suggesting TE activity may be associated with disease resistance in maize.

Subject(s)

DNA Transposable Elements , Genome, Plant , Retroelements , Terminal Repeat Sequences , Zea mays , Zea mays/genetics , Retroelements/genetics , Genetic Variation , Molecular Sequence Annotation , Tropical Climate , DNA Methylation

2.

An adaptive teosinte mexicana introgression modulates phosphatidylcholine levels and is associated with maize flowering time.

Barnes, Allison C; Rodríguez-Zapata, Fausto; Juárez-Núñez, Karla A; Gates, Daniel J; Janzen, Garrett M; Kur, Andi; Wang, Li; Jensen, Sarah E; Estévez-Palmas, Juan M; Crow, Taylor M; Kavi, Heli S; Pil, Hannah D; Stokes, Ruthie L; Knizner, Kevan T; Aguilar-Rangel, Maria R; Demesa-Arévalo, Edgar; Skopelitis, Tara; Pérez-Limón, Sergio; Stutts, Whitney L; Thompson, Peter; Chiu, Yu-Chun; Jackson, David; Muddiman, David C; Fiehn, Oliver; Runcie, Daniel; Buckler, Edward S; Ross-Ibarra, Jeffrey; Hufford, Matthew B; Sawers, Ruairidh J H; Rellán-Álvarez, Rubén.

Proc Natl Acad Sci U S A ; 119(27): e2100036119, 2022 07 05.

Article in English | MEDLINE | ID: mdl-35771940

ABSTRACT

Native Americans domesticated maize (Zea mays ssp. mays) from lowland teosinte parviglumis (Zea mays ssp. parviglumis) in the warm Mexican southwest and brought it to the highlands of Mexico and South America where it was exposed to lower temperatures that imposed strong selection on flowering time. Phospholipids are important metabolites in plant responses to low-temperature and phosphorus availability and have been suggested to influence flowering time. Here, we combined linkage mapping with genome scans to identify High PhosphatidylCholine 1 (HPC1), a gene that encodes a phospholipase A1 enzyme, as a major driver of phospholipid variation in highland maize. Common garden experiments demonstrated strong genotype-by-environment interactions associated with variation at HPC1, with the highland HPC1 allele leading to higher fitness in highlands, possibly by hastening flowering. The highland maize HPC1 variant resulted in impaired function of the encoded protein due to a polymorphism in a highly conserved sequence. A meta-analysis across HPC1 orthologs indicated a strong association between the identity of the amino acid at this position and optimal growth in prokaryotes. Mutagenesis of HPC1 via genome editing validated its role in regulating phospholipid metabolism. Finally, we showed that the highland HPC1 allele entered cultivated maize by introgression from the wild highland teosinte Zea mays ssp. mexicana and has been maintained in maize breeding lines from the Northern United States, Canada, and Europe. Thus, HPC1 introgressed from teosinte mexicana underlies a large metabolic QTL that modulates phosphatidylcholine levels and has an adaptive effect at least in part via induction of early flowering time.

Subject(s)

Adaptation, Physiological , Flowers , Gene-Environment Interaction , Phosphatidylcholines , Phospholipases A1 , Plant Proteins , Zea mays , Alleles , Chromosome Mapping , Flowers/genetics , Flowers/metabolism , Genes, Plant , Genetic Linkage , Phosphatidylcholines/metabolism , Phospholipases A1/classification , Phospholipases A1/genetics , Phospholipases A1/metabolism , Plant Proteins/classification , Plant Proteins/genetics , Plant Proteins/metabolism , Zea mays/genetics , Zea mays/growth & development

3.

Unraveling Prevalence and Effects of Deleterious Mutations in Maize Elite Lines across Decades of Modern Breeding.

Sun, Shichao; Wang, Baobao; Li, Changyu; Xu, Gen; Yang, Jinliang; Hufford, Matthew B; Ross-Ibarra, Jeffrey; Wang, Haiyang; Wang, Li.

Mol Biol Evol ; 40(8)2023 08 03.

Article in English | MEDLINE | ID: mdl-37494285

ABSTRACT

Future breeding is likely to involve the detection and removal of deleterious alleles, which are mutations that negatively affect crop fitness. However, little is known about the prevalence of such mutations and their effects on phenotypic traits in the context of modern crop breeding. To address this, we examined the number and frequency of deleterious mutations in 350 elite maize inbred lines developed over the past few decades in China and the United States. Our findings reveal an accumulation of weakly deleterious mutations and a decrease in strongly deleterious mutations, indicating the dominant effects of genetic drift and purifying selection for the two types of mutations, respectively. We also discovered that slightly deleterious mutations, when at lower frequencies, were more likely to be heterozygous in the developed hybrids. This is consistent with complementation as a potential explanation for heterosis. Subsequently, we found that deleterious mutations accounted for more of the variation in phenotypic traits than nondeleterious mutations with matched minor allele frequencies, especially for traits related to leaf angle and flowering time. Moreover, we detected fewer deleterious mutations in the promoter and gene body regions of differentially expressed genes across breeding eras than in nondifferentially expressed genes. Overall, our results provide a comprehensive assessment of the prevalence and impact of deleterious mutations in modern maize breeding and establish a useful baseline for future maize improvement efforts.

Subject(s)

Plant Breeding , Zea mays , Zea mays/genetics , Prevalence , Gene Frequency , Mutation

4.

Conserved noncoding sequences provide insights into regulatory sequence and loss of gene expression in maize.

Song, Baoxing; Buckler, Edward S; Wang, Hai; Wu, Yaoyao; Rees, Evan; Kellogg, Elizabeth A; Gates, Daniel J; Khaipho-Burch, Merritt; Bradbury, Peter J; Ross-Ibarra, Jeffrey; Hufford, Matthew B; Romay, M Cinta.

Genome Res ; 31(7): 1245-1257, 2021 Jul.

Article in English | MEDLINE | ID: mdl-34045362

ABSTRACT

Thousands of species will be sequenced in the next few years; however, understanding how their genomes work, without an unlimited budget, requires both molecular and novel evolutionary approaches. We developed a sensitive sequence alignment pipeline to identify conserved noncoding sequences (CNSs) in the Andropogoneae tribe (multiple crop species descended from a common ancestor â¼18 million years ago). The Andropogoneae share similar physiology while being tremendously genomically diverse, harboring a broad range of ploidy levels, structural variation, and transposons. These contribute to the potential of Andropogoneae as a powerful system for studying CNSs and are factors we leverage to understand the function of maize CNSs. We found that 86% of CNSs were comprised of annotated features, including introns, UTRs, putative cis-regulatory elements, chromatin loop anchors, noncoding RNA (ncRNA) genes, and several transposable element superfamilies. CNSs were enriched in active regions of DNA replication in the early S phase of the mitotic cell cycle and showed different DNA methylation ratios compared to the genome-wide background. More than half of putative cis-regulatory sequences (identified via other methods) overlapped with CNSs detected in this study. Variants in CNSs were associated with gene expression levels, and CNS absence contributed to loss of gene expression. Furthermore, the evolution of CNSs was associated with the functional diversification of duplicated genes in the context of maize subgenomes. Our results provide a quantitative understanding of the molecular processes governing the evolution of CNSs in maize.

5.

Fine mapping of major QTL qshgd1 for spontaneous haploid genome doubling in maize (Zea mays L.).

Foster, Tyler L; Kloiber-Maitz, Monika; Gilles, Laurine; Frei, Ursula K; Pfeffer, Sarah; Chen, Yu-Ru; Dutta, Somak; Seetharam, Arun S; Hufford, Matthew B; Lübberstedt, Thomas.

Theor Appl Genet ; 137(5): 117, 2024 May 03.

Article in English | MEDLINE | ID: mdl-38700534

ABSTRACT

KEY MESSAGE: A large-effect QTL was fine mapped, which revealed 79 gene models, with 10 promising candidate genes, along with a novel inversion. In commercial maize breeding, doubled haploid (DH) technology is arguably the most efficient resource for rapidly developing novel, completely homozygous lines. However, the DH strategy, using in vivo haploid induction, currently requires the use of mutagenic agents which can be not only hazardous, but laborious. This study focuses on an alternative approach to develop DH lines-spontaneous haploid genome duplication (SHGD) via naturally restored haploid male fertility (HMF). Inbred lines A427 and Wf9, the former with high HMF and the latter with low HMF, were selected to fine-map a large-effect QTL associated with SHGD-qshgd1. SHGD alleles were derived from A427, with novel haploid recombinant groups having varying levels of the A427 chromosomal region recovered. The chromosomal region of interest is composed of 45 megabases (Mb) of genetic information on chromosome 5. Significant differences between haploid recombinant groups for HMF were identified, signaling the possibility of mapping the QTL more closely. Due to suppression of recombination from the proximity of the centromere, and a newly discovered inversion region, the associated QTL was only confined to a 25 Mb region, within which only a single recombinant was observed among ca. 9,000 BC1 individuals. Nevertheless, 79 gene models were identified within this 25 Mb region. Additionally, 10 promising candidate genes, based on RNA-seq data, are described for future evaluation, while the narrowed down genome region is accessible for straightforward introgression into elite germplasm by BC methods.

Subject(s)

Chromosome Mapping , Haploidy , Quantitative Trait Loci , Zea mays , Zea mays/genetics , Chromosome Mapping/methods , Plant Breeding , Genome, Plant , Phenotype , Alleles , Chromosomes, Plant/genetics , Genes, Plant

6.

Allele-specific Expression Reveals Multiple Paths to Highland Adaptation in Maize.

Hu, Haixiao; Crow, Taylor; Nojoomi, Saghi; Schulz, Aimee J; Estévez-Palmas, Juan M; Hufford, Matthew B; Flint-Garcia, Sherry; Sawers, Ruairidh; Rellán-Álvarez, Rubén; Ross-Ibarra, Jeffrey; Runcie, Daniel E.

Mol Biol Evol ; 39(11)2022 11 03.

Article in English | MEDLINE | ID: mdl-36327321

ABSTRACT

Maize is a staple food of smallholder farmers living in highland regions up to 4,000âm above sea level worldwide. Mexican and South American highlands are two major highland maize growing regions, and population genetic data suggest the maize's adaptation to these regions occurred largely independently, providing a case study for convergent evolution. To better understand the mechanistic basis of highland adaptation, we crossed maize landraces from 108 highland and lowland sites of Mexico and South America with the inbred line B73 to produce F1 hybrids and grew them in both highland and lowland sites in Mexico. We identified thousands of genes with divergent expression between highland and lowland populations. Hundreds of these genes show patterns of convergent evolution between Mexico and South America. To dissect the genetic architecture of the divergent gene expression, we developed a novel allele-specific expression analysis pipeline to detect genes with divergent functional cis-regulatory variation between highland and lowland populations. We identified hundreds of genes with divergent cis-regulation between highland and lowland landrace alleles, with 20 in common between regions, further suggesting convergence in the genes underlying highland adaptation. Further analyses suggest multiple mechanisms contribute to this convergence in gene regulation. Although the vast majority of evolutionary changes associated with highland adaptation were region specific, our findings highlight an important role for convergence at the gene expression and gene regulation levels as well.

Subject(s)

Adaptation, Physiological , Zea mays , Zea mays/genetics , Alleles , Adaptation, Physiological/genetics , Genetics, Population , Acclimatization

7.

Molecular Parallelism Underlies Convergent Highland Adaptation of Maize Landraces.

Wang, Li; Josephs, Emily B; Lee, Kristin M; Roberts, Lucas M; Rellán-Álvarez, Rubén; Ross-Ibarra, Jeffrey; Hufford, Matthew B.

Mol Biol Evol ; 38(9): 3567-3580, 2021 08 23.

Article in English | MEDLINE | ID: mdl-33905497

ABSTRACT

Convergent phenotypic evolution provides some of the strongest evidence for adaptation. However, the extent to which recurrent phenotypic adaptation has arisen via parallelism at the molecular level remains unresolved, as does the evolutionary origin of alleles underlying such adaptation. Here, we investigate genetic mechanisms of convergent highland adaptation in maize landrace populations and evaluate the genetic sources of recurrently selected alleles. Population branch excess statistics reveal substantial evidence of parallel adaptation at the level of individual single-nucleotide polymorphism (SNPs), genes, and pathways in four independent highland maize populations. The majority of convergently selected SNPs originated via migration from a single population, most likely in the Mesoamerican highlands, while standing variation introduced by ancient gene flow was also a contributor. Polygenic adaptation analyses of quantitative traits reveal that alleles affecting flowering time are significantly associated with elevation, indicating the flowering time pathway was targeted by highland adaptation. In addition, repeatedly selected genes were significantly enriched in the flowering time pathway, indicating their significance in adapting to highland conditions. Overall, our study system represents a promising model to study convergent evolution in plants with potential applications to crop adaptation across environmental gradients.

Subject(s)

Acclimatization , Altitude , Zea mays , Acclimatization/genetics , Alleles , Phenotype , Zea mays/genetics

8.

SequelTools: a suite of tools for working with PacBio Sequel raw sequence data.

Hufnagel, David E; Hufford, Matthew B; Seetharam, Arun S.

BMC Bioinformatics ; 21(1): 429, 2020 Oct 01.

Article in English | MEDLINE | ID: mdl-33004007

ABSTRACT

BACKGROUND: PacBio sequencing is an incredibly valuable third-generation DNA sequencing method due to very long read lengths, ability to detect methylated bases, and its real-time sequencing methodology. Yet, hitherto no tool was available for analyzing the quality of, subsampling, and filtering PacBio data. RESULTS: Here we present SequelTools, a command-line program containing three tools: Quality Control, Read Subsampling, and Read Filtering. The Quality Control tool quickly processes PacBio Sequel raw sequence data from multiple SMRTcells producing multiple statistics and publication-quality plots describing the quality of the data including N50, read length and count statistics, PSR, and ZOR. The Read Subsampling tool allows the user to subsample reads by one or more of the following criteria: longest subreads per CLR or random CLR selection. The Read Filtering tool provides options for normalizing data by filtering out certain low-quality scraps reads and/or by minimum CLR length. SequelTools is implemented in bash, R, and Python using only standard libraries and packages and is platform independent. CONCLUSIONS: SequelTools is a program that provides the only free, fast, and easy-to-use quality control tool, and the only program providing this kind of read subsampling and read filtering for PacBio Sequel raw sequence data, and is available at https://github.com/ISUgenomics/SequelTools .

Subject(s)

High-Throughput Nucleotide Sequencing/methods , Software , Arabidopsis/genetics , Benchmarking , High-Throughput Nucleotide Sequencing/standards , Quality Control

9.

GenomeQC: a quality assessment tool for genome assemblies and gene structure annotations.

Manchanda, Nancy; Portwood, John L; Woodhouse, Margaret R; Seetharam, Arun S; Lawrence-Dill, Carolyn J; Andorf, Carson M; Hufford, Matthew B.

BMC Genomics ; 21(1): 193, 2020 Mar 02.

Article in English | MEDLINE | ID: mdl-32122303

ABSTRACT

BACKGROUND: Genome assemblies are foundational for understanding the biology of a species. They provide a physical framework for mapping additional sequences, thereby enabling characterization of, for example, genomic diversity and differences in gene expression across individuals and tissue types. Quality metrics for genome assemblies gauge both the completeness and contiguity of an assembly and help provide confidence in downstream biological insights. To compare quality across multiple assemblies, a set of common metrics are typically calculated and then compared to one or more gold standard reference genomes. While several tools exist for calculating individual metrics, applications providing comprehensive evaluations of multiple assembly features are, perhaps surprisingly, lacking. Here, we describe a new toolkit that integrates multiple metrics to characterize both assembly and gene annotation quality in a way that enables comparison across multiple assemblies and assembly types. RESULTS: Our application, named GenomeQC, is an easy-to-use and interactive web framework that integrates various quantitative measures to characterize genome assemblies and annotations. GenomeQC provides researchers with a comprehensive summary of these statistics and allows for benchmarking against gold standard reference assemblies. CONCLUSIONS: The GenomeQC web application is implemented in R/Shiny version 1.5.9 and Python 3.6 and is freely available at https://genomeqc.maizegdb.org/ under the GPL license. All source code and a containerized version of the GenomeQC pipeline is available in the GitHub repository https://github.com/HuffordLab/GenomeQC.

Subject(s)

Genomics/methods , Chromosome Mapping , Computational Biology/methods , High-Throughput Nucleotide Sequencing , Humans , Molecular Sequence Annotation , Sequence Analysis, DNA , Software

10.

Haplotype structure in commercial maize breeding programs in relation to key founder lines.

Coffman, Stephanie M; Hufford, Matthew B; Andorf, Carson M; Lübberstedt, Thomas.

Theor Appl Genet ; 133(2): 547-561, 2020 Feb.

Article in English | MEDLINE | ID: mdl-31749017

ABSTRACT

KEY MESSAGE: High-density haplotype analysis revealed significant haplotype sharing between ex-PVPs registered from 1976 to 1992 and key maize founders, and uncovered similarities and differences in haplotype sharing patterns by company and heterotic group. Proprietary inbreds developed by the private seed industry have been the major source for driving genetic gain in successful North American maize hybrids for decades. Much of the history of industry germplasm can be traced back to key founder lines, some of which were pivotal in the development of prominent heterotic groups. Previous studies have summarized pedigree-based relationships, genetic diversity and population structure among commercial inbreds with expired Plant Variety Protection (ex-PVP). However, less is known about the extent of haplotype sharing between historical founders and ex-PVPs. A better understanding of the relationships between founders and ex-PVPs provides insight into the haplotype and heterotic group structure among industry germplasm. We performed high-density haplotype analysis with 11.3 million SNPs on 212 maize inbreds, which included 157 ex-PVPs registered 1976-1992 and 55 public lines relevant to PVPs. Among these lines were 12 key founders identified in literature review: 207, A632, B14, B37, B73, LH123HT, LH82, Mo17, Oh43, OH7, PHG39 and Wf9. Our results revealed that, on average, 81.6% of an ex-PVP's genome is shared with at least 1 of these 12 founder lines and more than half when limited to B73, Mo17 and 207. Quantifiable similarities and contrasts among heterotic groups and major US seed industry companies were also observed. The results from this study provide high-resolution haplotype data on ex-PVP germplasm, confirm founder relationship trends observed in previous studies, uncover region-specific haplotype structure differences and demonstrate how haplotype sharing analysis can be used as a tool to explore germplasm diversity.

Subject(s)

Crops, Agricultural/genetics , Haplotypes , Plant Breeding/history , Zea mays/genetics , Genetic Variation/genetics , Genotype , History, 20th Century , Hybrid Vigor , Polymorphism, Single Nucleotide

11.

Evidence for a Unique DNA-Dependent RNA Polymerase in Cereal Crops.

Trujillo, Joshua T; Seetharam, Arun S; Hufford, Matthew B; Beilstein, Mark A; Mosher, Rebecca A.

Mol Biol Evol ; 35(10): 2454-2462, 2018 10 01.

Article in English | MEDLINE | ID: mdl-30053133

ABSTRACT

Gene duplication is an important driver for the evolution of new genes and protein functions. Duplication of DNA-dependent RNA polymerase (Pol) II subunits within plants led to the emergence of RNA Pol IV and V complexes, each of which possess unique functions necessary for RNA-directed DNA Methylation. Comprehensive identification of Pol V subunit orthologs across the monocot radiation revealed a duplication of the largest two subunits within the grasses (Poaceae), including critical cereal crops. These paralogous Pol subunits display sequence conservation within catalytic domains, but their carboxy terminal domains differ in length and character of the Ago-binding platform, suggesting unique functional interactions. Phylogenetic analysis of the catalytic region indicates positive selection on one paralog following duplication, consistent with retention via neofunctionalization. Positive selection on residue pairs that are predicted to interact between subunits suggests that paralogous subunits have evolved specific assembly partners. Additional Pol subunits as well as Pol-interacting proteins also possess grass-specific paralogs, supporting the hypothesis that a novel Pol complex with distinct function has evolved in the grass family, Poaceae.

Subject(s)

DNA-Directed RNA Polymerases/genetics , Edible Grain/enzymology , Gene Duplication , Poaceae/enzymology , Selection, Genetic , Amino Acid Sequence , Edible Grain/genetics , Grain Proteins , Phylogeny , Poaceae/genetics

12.

The extent of adaptive wild introgression in crops.

Janzen, Garrett M; Wang, Li; Hufford, Matthew B.

New Phytol ; 221(3): 1279-1288, 2019 02.

Article in English | MEDLINE | ID: mdl-30368812

ABSTRACT

The study of crop evolution has focused primarily on the process of initial domestication. Post-domestication adaptation during the expansion of crops from their centers of origin has received considerably less attention. Recent research has revealed that, in at least some instances, crops have received introgression from their wild relatives that has facilitated adaptation to novel conditions encountered during expansion. Such adaptive introgression could have an important impact on the basic study of domestication, affecting estimates of several evolutionary processes of interest (e.g. the strength of the domestication bottleneck, the timing of domestication, the targets of selection during domestication). Identification of haplotypes introgressed from the wild may also help in the identification of alleles that are beneficial under particular environmental conditions. Here we review mounting evidence for substantial adaptive wild introgression in several crops and consider the implications of such gene flow to our understanding of crop histories.

Subject(s)

Adaptation, Physiological , Crops, Agricultural/physiology , Inbreeding , Crops, Agricultural/genetics , Domestication , Genetic Variation , Selection, Genetic

13.

Climate structures genetic variation across a species' elevation range: a test of range limits hypotheses.

Sexton, Jason P; Hufford, Matthew B; Bateman, Ashley C; Lowry, David B; Meimberg, Harald; Strauss, Sharon Y; Rice, Kevin J.

Mol Ecol ; 25(4): 911-28, 2016 02.

Article in English | MEDLINE | ID: mdl-26756973

ABSTRACT

Gene flow may influence the formation of species range limits, and yet little is known about the patterns of gene flow with respect to environmental gradients or proximity to range limits. With rapid environmental change, it is especially important to understand patterns of gene flow to inform conservation efforts. Here we investigate the species range of the selfing, annual plant, Mimulus laciniatus, in the California Sierra Nevada. We assessed genetic variation, gene flow, and population abundance across the entire elevation-based climate range. Contrary to expectations, within-population plant density increased towards both climate limits. Mean genetic diversity of edge populations was equivalent to central populations; however, all edge populations exhibited less genetic diversity than neighbouring interior populations. Genetic differentiation was fairly consistent and moderate among all populations, and no directional signals of contemporary gene flow were detected between central and peripheral elevations. Elevation-driven gene flow (isolation by environment), but not isolation by distance, was found across the species range. These findings were the same towards high- and low-elevation range limits and were inconsistent with two common centre-edge hypotheses invoked for the formation of species range limits: (i) decreasing habitat quality and population size; (ii) swamping gene flow from large, central populations. This pattern demonstrates that climate, but not centre-edge dynamics, is an important range-wide factor structuring M. laciniatus populations. To our knowledge, this is the first empirical study to relate environmental patterns of gene flow to range limits hypotheses. Similar investigations across a wide variety of taxa and life histories are needed.

Subject(s)

Climate , Gene Flow , Genetic Variation , Genetics, Population , Mimulus/genetics , California , DNA, Plant/genetics , Models, Genetic , Population Density , Sequence Analysis, DNA

14.

The genomic signature of crop-wild introgression in maize.

Hufford, Matthew B; Lubinksy, Pesach; Pyhäjärvi, Tanja; Devengenzo, Michael T; Ellstrand, Norman C; Ross-Ibarra, Jeffrey.

PLoS Genet ; 9(5): e1003477, 2013 May.

Article in English | MEDLINE | ID: mdl-23671421

ABSTRACT

The evolutionary significance of hybridization and subsequent introgression has long been appreciated, but evaluation of the genome-wide effects of these phenomena has only recently become possible. Crop-wild study systems represent ideal opportunities to examine evolution through hybridization. For example, maize and the conspecific wild teosinte Zea mays ssp. mexicana (hereafter, mexicana) are known to hybridize in the fields of highland Mexico. Despite widespread evidence of gene flow, maize and mexicana maintain distinct morphologies and have done so in sympatry for thousands of years. Neither the genomic extent nor the evolutionary importance of introgression between these taxa is understood. In this study we assessed patterns of genome-wide introgression based on 39,029 single nucleotide polymorphisms genotyped in 189 individuals from nine sympatric maize-mexicana populations and reference allopatric populations. While portions of the maize and mexicana genomes appeared resistant to introgression (notably near known cross-incompatibility and domestication loci), we detected widespread evidence for introgression in both directions of gene flow. Through further characterization of these genomic regions and preliminary growth chamber experiments, we found evidence suggestive of the incorporation of adaptive mexicana alleles into maize during its expansion to the highlands of central Mexico. In contrast, very little evidence was found for adaptive introgression from maize to mexicana. The methods we have applied here can be replicated widely, and such analyses have the potential to greatly inform our understanding of evolution through introgressive hybridization. Crop species, due to their exceptional genomic resources and frequent histories of spread into sympatry with relatives, should be particularly influential in these studies.

Subject(s)

Biological Evolution , Gene Flow , Genome-Wide Association Study , Zea mays/genetics , Alleles , Genetics, Population , Genome, Plant , Humans , Hybridization, Genetic , Mexico , Polymorphism, Single Nucleotide , Species Specificity , Sympatry/genetics

15.

Teosinte as a model system for population and ecological genomics.

Hufford, Matthew B; Bilinski, Paul; Pyhäjärvi, Tanja; Ross-Ibarra, Jeffrey.

Trends Genet ; 28(12): 606-15, 2012 Dec.

Article in English | MEDLINE | ID: mdl-23021022

ABSTRACT

As the cost of next-generation sequencing diminishes and genomic resources improve, crop wild relatives are well positioned to make major contributions to the field of ecological genomics via full-genome resequencing and reference-assisted de novo assembly of genomes of plants from natural populations. The wild relatives of maize, collectively known as teosinte, are a more varied and representative study system than many other model flowering plants. In this review of the population and ecological genomics of the teosintes we highlight recent advances in the study of maize domestication, introgressive hybridization, and local adaptation, and discuss future prospects for applying the genomic resources of maize to this intriguing group of species. The maize/teosinte study system is an excellent example of how crops and their wild relatives can bridge the model/non-model gap.

Subject(s)

Ecology , Genetics, Population , Zea mays/genetics , Adaptation, Physiological/genetics , Central America , Crops, Agricultural/genetics , Genome, Plant , Hybridization, Genetic , Phylogeography , Zea mays/classification

16.

Historical genomics of North American maize.

van Heerwaarden, Joost; Hufford, Matthew B; Ross-Ibarra, Jeffrey.

Proc Natl Acad Sci U S A ; 109(31): 12420-5, 2012 Jul 31.

Article in English | MEDLINE | ID: mdl-22802642

ABSTRACT

Since the advent of modern plant breeding in the 1930s, North American maize has undergone a dramatic adaptation to high-input agriculture. Despite the importance of genetic contributions to historical yield increases, little is known about the underlying genomic changes. Here we use high-density SNP genotyping to characterize a set of North American maize lines spanning the history of modern breeding. We provide a unique analysis of genome-wide developments in genetic diversity, ancestry, and selection. The genomic history of maize is marked by a steady increase in genetic differentiation and linkage disequilibrium, whereas allele frequencies in the total population have remained relatively constant. These changes are associated with increasing genetic separation of breeding pools and decreased diversity in the ancestry of individual lines. We confirm that modern heterotic groups are the product of ongoing divergence from a relatively homogeneous landrace population, but show that differential landrace ancestry remains evident. Using a recent association approach, we characterize signals of directional selection throughout the genome, identifying a number of candidate genes of potential agronomic relevance. However, overall we find that selection has had limited impact on genome-wide patterns of diversity and ancestry, with little evidence for individual lines contributing disproportionately to the accumulation of favorable alleles in today's elite germplasm. Our data suggest breeding progress has mainly involved selection and recombination of relatively common alleles, contributed by a representative but limited set of ancestral lines.

Subject(s)

Alleles , Genome, Plant/physiology , Linkage Disequilibrium , Polymorphism, Single Nucleotide , Zea mays/genetics , Breeding , Genome-Wide Association Study/methods , Genomics/methods , North America

17.

Reshaping of the maize transcriptome by domestication.

Swanson-Wagner, Ruth; Briskine, Roman; Schaefer, Robert; Hufford, Matthew B; Ross-Ibarra, Jeffrey; Myers, Chad L; Tiffin, Peter; Springer, Nathan M.

Proc Natl Acad Sci U S A ; 109(29): 11878-83, 2012 Jul 17.

Article in English | MEDLINE | ID: mdl-22753482

ABSTRACT

Through domestication, humans have substantially altered the morphology of Zea mays ssp. parviglumis (teosinte) into the currently recognizable maize. This system serves as a model for studying adaptation, genome evolution, and the genetics and evolution of complex traits. To examine how domestication has reshaped the transcriptome of maize seedlings, we used expression profiling of 18,242 genes for 38 diverse maize genotypes and 24 teosinte genotypes. We detected evidence for more than 600 genes having significantly different expression levels in maize compared with teosinte. Moreover, more than 1,100 genes showed significantly altered coexpression profiles, reflective of substantial rewiring of the transcriptome since domestication. The genes with altered expression show a significant enrichment for genes previously identified through population genetic analyses as likely targets of selection during maize domestication and improvement; 46 genes previously identified as putative targets of selection also exhibit altered expression levels and coexpression relationships. We also identified 45 genes with altered, primarily higher, expression in inbred relative to outcrossed teosinte. These genes are enriched for functions related to biotic stress and may reflect responses to the effects of inbreeding. This study not only documents alterations in the maize transcriptome following domestication, identifying several genes that may have contributed to the evolution of maize, but highlights the complementary information that can be gained by combining gene expression with population genetic analyses.

Subject(s)

Crops, Agricultural/genetics , Evolution, Molecular , Gene Regulatory Networks/genetics , Selection, Genetic/genetics , Transcriptome/genetics , Zea mays/genetics , Gene Expression Profiling , Genes, Plant/genetics , Genetics, Population , Genotype , Microarray Analysis , Molecular Sequence Annotation

18.

The Ga1 locus of the genus Zea is associated with novel genome structures derived from multiple, independent nonhomologous recombination events.

Bapat, Amruta R; Moran Lauter, Adrienne N; Hufford, Matthew B; Boerman, Nicholas A; Scott, M Paul.

G3 (Bethesda) ; 13(11)2023 11 01.

Article in English | MEDLINE | ID: mdl-37652030

ABSTRACT

The Ga1 locus controls cross-incompatibility between field corn and popcorn. The Ga1-S haplotype contains 2 types of pectin methylesterase (PME) genes, ZmPme3 and several copies of ZmGa1P that are expressed in silk and pollen, respectively. The ga1 haplotype contains nonfunctional tandem repeat sequences related to ZmPme3 and ZmGa1P. This haplotype can cross-pollinate freely and is widely present in field corn. The primary objective of this study is to characterize the repeat sequences from a diverse collection of maize and teosinte lines and use this information to understand the evolution of the Ga1 locus. First, we characterized the complexity of the Ga1 genome region in high-quality maize genome assemblies that led to their categorization into 5 groups based on the number and type of PME-like sequences found at this region. Second, we studied duplication events that led to the ga1 and Ga1-S repeats using maximum likelihood phylogenetic reconstruction. Divergence estimates of the ga1 haplotype suggest that the duplication events occurred more than 600 KYA whereas those in Ga1-S occurred at 3 time points, i.e. >600, â¼260, and â¼100 KYA. These estimates suggest that the ga1 and Ga1-S tandem duplication events occurred independently. Finally, analysis of ZmPme3 and ZmGa1P homologs in Zea and Tripsacum genomes suggests that ga1 and Ga1-S repeats originated from an ancestral pair of PME genes that duplicated and diverged through 2 evolutionary branches prior to the domestication of maize.

Subject(s)

Poaceae , Zea mays , Zea mays/genetics , Phylogeny , Poaceae/genetics , Tandem Repeat Sequences , Recombination, Genetic

19.

A complete telomere-to-telomere assembly of the maize genome.

Chen, Jian; Wang, Zijian; Tan, Kaiwen; Huang, Wei; Shi, Junpeng; Li, Tong; Hu, Jiang; Wang, Kai; Wang, Chao; Xin, Beibei; Zhao, Haiming; Song, Weibin; Hufford, Matthew B; Schnable, James C; Jin, Weiwei; Lai, Jinsheng.

Nat Genet ; 55(7): 1221-1231, 2023 07.

Article in English | MEDLINE | ID: mdl-37322109

ABSTRACT

A complete telomere-to-telomere (T2T) finished genome has been the long pursuit of genomic research. Through generating deep coverage ultralong Oxford Nanopore Technology (ONT) and PacBio HiFi reads, we report here a complete genome assembly of maize with each chromosome entirely traversed in a single contig. The 2,178.6 Mb T2T Mo17 genome with a base accuracy of over 99.99% unveiled the structural features of all repetitive regions of the genome. There were several super-long simple-sequence-repeat arrays having consecutive thymine-adenine-guanine (TAG) tri-nucleotide repeats up to 235 kb. The assembly of the entire nucleolar organizer region of the 26.8 Mb array with 2,974 45S rDNA copies revealed the enormously complex patterns of rDNA duplications and transposon insertions. Additionally, complete assemblies of all ten centromeres enabled us to precisely dissect the repeat compositions of both CentC-rich and CentC-poor centromeres. The complete Mo17 genome represents a major step forward in understanding the complexity of the highly recalcitrant repetitive regions of higher plant genomes.

Subject(s)

Genomics , Zea mays , Zea mays/genetics , Repetitive Sequences, Nucleic Acid/genetics , Genome, Plant , Telomere/genetics , Sequence Analysis, DNA , High-Throughput Nucleotide Sequencing

20.

A happy accident: a novel turfgrass reference genome.

Phillips, Alyssa R; Seetharam, Arun S; Albert, Patrice S; AuBuchon-Elder, Taylor; Birchler, James A; Buckler, Edward S; Gillespie, Lynn J; Hufford, Matthew B; Llaca, Victor; Romay, Maria Cinta; Soreng, Robert J; Kellogg, Elizabeth A; Ross-Ibarra, Jeffrey.

G3 (Bethesda) ; 13(6)2023 06 01.

Article in English | MEDLINE | ID: mdl-37002915

ABSTRACT

Poa pratensis, commonly known as Kentucky bluegrass, is a popular cool-season grass species used as turf in lawns and recreation areas globally. Despite its substantial economic value, a reference genome had not previously been assembled due to the genome's relatively large size and biological complexity that includes apomixis, polyploidy, and interspecific hybridization. We report here a fortuitous de novo assembly and annotation of a P. pratensis genome. Instead of sequencing the genome of a C4 grass, we accidentally sampled and sequenced tissue from a weedy P. pratensis whose stolon was intertwined with that of the C4 grass. The draft assembly consists of 6.09 Gbp with an N50 scaffold length of 65.1 Mbp, and a total of 118 scaffolds, generated using PacBio long reads and Bionano optical map technology. We annotated 256K gene models and found 58% of the genome to be composed of transposable elements. To demonstrate the applicability of the reference genome, we evaluated population structure and estimated genetic diversity in P. pratensis collected from three North American prairies, two in Manitoba, Canada and one in Colorado, USA. Our results support previous studies that found high genetic diversity and population structure within the species. The reference genome and annotation will be an important resource for turfgrass breeding and study of bluegrasses.

Subject(s)

Plant Breeding , Poa , Genome , Poa/genetics , Plant Weeds/genetics , Base Sequence , Molecular Sequence Annotation

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL