RESUMEN
Association mapping panels represent foundational resources for understanding the genetic basis of phenotypic diversity and serve to advance plant breeding by exploring genetic variation across diverse accessions. We report the whole-genome sequencing (WGS) of 400 sorghum (Sorghum bicolor (L.) Moench) accessions from the Sorghum Association Panel (SAP) at an average coverage of 38× (25-72×), enabling the development of a high-density genomic marker set of 43 983 694 variants including single-nucleotide polymorphisms (approximately 38 million), insertions/deletions (indels) (approximately 5 million), and copy number variants (CNVs) (approximately 170 000). We observe slightly more deletions among indels and a much higher prevalence of deletions among CNVs compared to insertions. This new marker set enabled the identification of several novel putative genomic associations for plant height and tannin content, which were not identified when using previous lower-density marker sets. WGS identified and scored variants in 5-kb bins where available genotyping-by-sequencing (GBS) data captured no variants, with half of all bins in the genome falling into this category. The predictive ability of genomic best unbiased linear predictor (GBLUP) models was increased by an average of 30% by using WGS markers rather than GBS markers. We identified 18 selection peaks across subpopulations that formed due to evolutionary divergence during domestication, and we found six Fst peaks resulting from comparisons between converted lines and breeding lines within the SAP that were distinct from the peaks associated with historic selection. This population has served and continues to serve as a significant public resource for sorghum research and demonstrates the value of improving upon existing genomic resources.
Asunto(s)
Sorghum , Grano Comestible/genética , Genoma , Estudio de Asociación del Genoma Completo , Genómica/métodos , Fitomejoramiento/métodos , Polimorfismo de Nucleótido Simple/genética , Sorghum/genéticaRESUMEN
With the recent development of genomic resources and high-throughput phenotyping platforms, the 21st century is primed for major breakthroughs in the discovery, understanding and utilization of plant genetic variation. Significant advances in agriculture remain at the forefront to increase crop production and quality to satisfy the global food demand in a changing climate all while reducing the environmental impacts of the world's food production. Sorghum, a resilient C4 grain and grass important for food and energy production, is being extensively dissected genetically and phenomically to help connect the relationship between genetic and phenotypic variation. Unlike genetically modified crops such as corn or soybean, sorghum improvement has relied heavily on public research; thus, many of the genetic resources serve a dual purpose for both academic and commercial pursuits. Genetic and genomic resources not only provide the foundation to identify and understand the genes underlying variation, but also serve as novel sources of genetic and phenotypic diversity in plant breeding programs. To better disseminate the collective information of this community, we discuss: (i) the genomic resources of sorghum that are at the disposal of the research community; (ii) the suite of sorghum traits as potential targets for increasing productivity in contrasting environments; and (iii) the prospective approaches and technologies that will help to dissect the genotype-phenotype relationship as well as those that will apply foundational knowledge for sorghum improvement.
Asunto(s)
Estudios de Asociación Genética , Genoma de Planta/genética , Genómica , Sorghum/genética , Agricultura , Productos Agrícolas , Ambiente , Genotipo , Fenotipo , FitomejoramientoRESUMEN
BACKGROUND: The process of crop domestication often consists of two stages: initial domestication, where the wild species is first cultivated by humans, followed by diversification, when the domesticated species are subsequently adapted to more environments and specialized uses. Selective pressure to increase sugar accumulation in certain varieties of the cereal crop Sorghum bicolor is an excellent example of the latter; this has resulted in pronounced phenotypic divergence between sweet and grain-type sorghums, but the genetic mechanisms underlying these differences remain poorly understood. RESULTS: Here we present a new reference genome based on an archetypal sweet sorghum line and compare it to the current grain sorghum reference, revealing a high rate of nonsynonymous and potential loss of function mutations, but few changes in gene content or overall genome structure. We also use comparative transcriptomics to highlight changes in gene expression correlated with high stalk sugar content and show that changes in the activity and possibly localization of transporters, along with the timing of sugar metabolism play a critical role in the sweet phenotype. CONCLUSIONS: The high level of genomic similarity between sweet and grain sorghum reflects their historical relatedness, rather than their current phenotypic differences, but we find key changes in signaling molecules and transcriptional regulators that represent new candidates for understanding and improving sugar metabolism in this important crop.
Asunto(s)
Genoma de Planta , Sorghum/genética , Azúcares/metabolismo , ADN de Plantas/química , Perfilación de la Expresión Génica , Genómica/normas , Genotipo , Estándares de Referencia , Homología de Secuencia de Ácido Nucleico , Sorghum/metabolismoRESUMEN
BACKGROUND: Sorghum [Sorghum bicolor (L.) Moench] is an important cereal crop for dryland areas in the United States and for small-holder farmers in Africa. Natural variation of sorghum grain composition (protein, fat, and starch) between accessions can be used for crop improvement, but the genetic controls are still unresolved. The goals of this study were to quantify natural variation of sorghum grain composition and to identify single-nucleotide polymorphisms (SNPs) associated with variation in grain composition concentrations. RESULTS: In this study, we quantified protein, fat, and starch in a global sorghum diversity panel using near-infrared spectroscopy (NIRS). Protein content ranged from 8.1 to 18.8%, fat content ranged from 1.0 to 4.3%, and starch content ranged from 61.7 to 71.1%. Durra and bicolor-durra sorghum from Ethiopia and India had the highest protein and fat and the lowest starch content, while kafir sorghum from USA, India, and South Africa had the lowest protein and the highest starch content. Genome-wide association studies (GWAS) identified quantitative trait loci (QTL) for sorghum protein, fat, and starch. Previously published RNAseq data was used to identify candidate genes within a GWAS QTL region. A putative alpha-amylase 3 gene, which has previously been shown to be associated with grain composition traits, was identified as a strong candidate for protein and fat variation. CONCLUSIONS: We identified promising sources of genetic material for manipulation of grain composition traits, and several loci and candidate genes that may control sorghum grain composition. This survey of grain composition in sorghum germplasm and identification of protein, fat, and starch QTL contributes to our understanding of the genetic basis of natural variation in sorghum grain nutritional traits.
Asunto(s)
Estudios de Asociación Genética , Carácter Cuantitativo Heredable , Semillas/química , Semillas/genética , Sorghum/genética , Grano Comestible/química , Grano Comestible/genética , Estudio de Asociación del Genoma Completo , Genómica/métodos , Fenotipo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Sorghum/química , Almidón/químicaRESUMEN
Seedling establishment and seed nutritional quality require the sequestration of sufficient element nutrients. The identification of genes and alleles that modify element content in the grains of cereals, including sorghum (Sorghum bicolor), is fundamental to developing breeding and selection methods aimed at increasing bioavailable element content and improving crop growth. We have developed a high-throughput work flow for the simultaneous measurement of multiple elements in sorghum seeds. We measured seed element levels in the genotyped Sorghum Association Panel, representing all major cultivated sorghum races from diverse geographic and climatic regions, and mapped alleles contributing to seed element variation across three environments by genome-wide association. We observed significant phenotypic and genetic correlation between several elements across multiple years and diverse environments. The power of combining high-precision measurements with genome-wide association was demonstrated by implementing rank transformation and a multilocus mixed model to map alleles controlling 20 element traits, identifying 255 loci affecting the sorghum seed ionome. Sequence similarity to genes characterized in previous studies identified likely causative genes for the accumulation of zinc, manganese, nickel, calcium, and cadmium in sorghum seeds. In addition to strong candidates for these five elements, we provide a list of candidate loci for several other elements. Our approach enabled the identification of single-nucleotide polymorphisms in strong linkage disequilibrium with causative polymorphisms that can be evaluated in targeted selection strategies for plant breeding and improvement.
Asunto(s)
Ambiente , Variación Genética , Semillas/genética , Sorghum/genética , Estudio de Asociación del Genoma Completo , Patrón de Herencia/genética , Modelos Biológicos , Fenotipo , Polimorfismo de Nucleótido Simple/genética , Carácter Cuantitativo HeredableRESUMEN
KEY MESSAGE: Coordinated association and linkage mapping identified 25 grain quality QTLs in multiple environments, and fine mapping of the Wx locus supports the use of high-density genetic markers in linkage mapping. There is a wide range of end-use products made from cereal grains, and these products often demand different grain characteristics. Fortunately, cereal crop species including sorghum [Sorghum bicolor (L.) Moench] contain high phenotypic variation for traits influencing grain quality. Identifying genetic variants underlying this phenotypic variation allows plant breeders to develop genotypes with grain attributes optimized for their intended usage. Multiple sorghum mapping populations were rigorously phenotyped across two environments (SC Coastal Plain and Central TX) in 2 years for five major grain quality traits: amylose, starch, crude protein, crude fat, and gross energy. Coordinated association and linkage mapping revealed several robust QTLs that make prime targets to improve grain quality for food, feed, and fuel products. Although the amylose QTL interval spanned many megabases, the marker with greatest significance was located just 12 kb from waxy (Wx), the primary gene regulating amylose production in cereal grains. This suggests higher resolution mapping in recombinant inbred line (RIL) populations can be obtained when genotyped at a high marker density. The major QTL for crude fat content, identified in both a RIL population and grain sorghum diversity panel, encompassed the DGAT1 locus, a critical gene involved in maize lipid biosynthesis. Another QTL on chromosome 1 was consistently mapped in both RIL populations for multiple grain quality traits including starch, crude protein, and gross energy. Collectively, these genetic regions offer excellent opportunities to manipulate grain composition and set up future studies for gene validation.
Asunto(s)
Mapeo Cromosómico , Marcadores Genéticos , Sitios de Carácter Cuantitativo , Sorghum/genética , Amilosa/química , Grano Comestible/química , Grano Comestible/genética , Grasas/química , Estudios de Asociación Genética , Ligamiento Genético , Genética de Población , Genotipo , Valor Nutritivo , Fenotipo , Proteínas de Plantas/química , Sorghum/química , Almidón/química , TexasRESUMEN
Molecular characterization of diverse germplasm can contribute to breeding programs by increasing genetic gain for sorghum [Sorghum bicolor (L.) Moench] improvement. Identifying novel marker-trait associations and candidate genes enriches the existing genomic resources and can improve bioenergy-related traits using genomic-assisted breeding. In the current scenario, identifying the genetic loci underlying biomass and carbon partitioning is vital for ongoing efforts to maximize each carbon sink's yield for bioenergy production. Here, we have processed a high-density genomic marker (22 466 550) data based on whole-genome sequencing (WGS) using a set of 365 accessions from the bioenergy association panel (BAP), which includes ~19.7 million (19 744 726) single nucleotide polymorphism (SNPs) and 2.7 million (~2 721 824) insertion deletions (indels). A set of high-quality filtered SNP (~5.48 million) derived markers facilitated the assessment of population structure, genetic diversity, and genome-wide association studies (GWAS) for various traits related to biomass and its composition using the BAP. The phenotypic traits for GWAS included seed color (SC), plant height (PH), days to harvest (DTH), fresh weight (FW), dry weight (DW), brix content % (BRX), neutral detergent fiber (NDF), acid detergent fiber (ADF), non-fibrous carbohydrate (NFC), and lignin content. Several novel loci and candidate genes were identified for bioenergy-related traits, and some well-characterized genes for plant height (Dw1 and Dw2) and the YELLOW SEED1 locus (Y1) were validated. We further performed a multi-variate adaptive shrinkage analysis to identify pleiotropic QTL, which resulted in several shared marker-trait associations among bioenergy and compositional traits. Significant marker-trait associations with pleiotropic effects can be used to develop molecular markers for trait improvement using a marker-assisted breeding approach. Significant nucleotide diversity and heterozygosity were observed between photoperiod-sensitive and insensitive individuals of the panel. This diverse bioenergy panel with genomic resources will provide an excellent opportunity for further genetic studies, including selecting parental lines for superior hybrid development to improve biomass-related traits in sorghum.
RESUMEN
Hybrid breeding in sorghum [Sorghum bicolor (L.) Moench] utilizes the cytoplasmic-nuclear male sterility (CMS) system for seed production and subsequently harnesses heterosis. Since the cost of developing and evaluating inbred and hybrid lines in the CMS system is costly and time-consuming, genomic prediction of parental lines and hybrids is based on genetic data genotype. We generated 602 hybrids by crossing two female (A) lines with 301 diverse and elite male (R) lines from the sorghum association panel and collected phenotypic data for agronomic traits over two years. We genotyped the inbred parents using whole genome resequencing and used 2,687,342 high quality (minor allele frequency > 2%) single nucleotide polymorphisms for genomic prediction. For grain yield, the experimental hybrids exhibited an average mid-parent heterosis of 40%. Genomic best linear unbiased prediction (GBLUP) for hybrid performance yielded an average prediction accuracy of 0.76-0.93 under the prediction scenario where both parental lines in validation sets were included in the training sets (T2). However, when only female tester was shared between training and validation sets (T1F), prediction accuracies declined by 12-90%, with plant height showing the greatest decline. Mean accuracies for predicting the general combining ability of male parents ranged from 0.33 to 0.62 for all traits. Our results showed hybrid performance for agronomic traits can be predicted with high accuracy, and optimizing genomic relationship is essential for optimal training population design for genomic selection in sorghum breeding.
Asunto(s)
Sorghum , Sorghum/genética , Hibridación Genética , Fitomejoramiento/métodos , Fenotipo , Genotipo , Grano Comestible , Vigor Híbrido/genética , Genómica/métodos , Polimorfismo de Nucleótido Simple , Modelos GenéticosRESUMEN
Multiparent advanced eneration inter-cross (MAGIC) populations improve the precision of quantitative trait loci (QTL) mapping over biparental populations by incorporating increased diversity and opportunities to reduce linkage disequilibrium among variants. Here, we describe the development of a MAGIC B-Line (MBL) population from an inter-cross among 4 diverse founders of grain sorghum [Sorghum bicolor (L.) Moench] across different races (kafir, guinea, durra, and caudatum). These founders were selected based on genetic uniqueness and several distinct qualitative features including panicle architecture, plant color, seed color, endosperm texture, and awns. A whole set of MBL (708 F6) recombinant inbred lines along with their founders were genotyped using Diversity Arrays Technology (DArTseq) and 5,683 single-nucleotide polymorphisms (SNPs) were generated. A genetic linkage map was constructed using a set of polymorphic, quality-filtered markers (2,728 SNPs) for QTL interval-mapping. For population validation, 3 traits (seed color, plant color, and awns) were used for QTL mapping and genome-wide association study (GWAS). QTL mapping and GWAS identified 4 major genomic regions located across 3 chromosomes (Chr1, Chr3, and Chr6) that correspond to known genetic loci for the targeted traits. Founders of this population consist of the fertility maintainer (A/B line) gene pool and derived MBL lines could serve as female/seed parents in the cytoplasmic male sterility breeding system. The MBL population will serve as a unique genetic and genomic resource to better characterize the genetics of complex traits and potentially identify superior alleles for crop improvement efforts to enrich the seed parent gene pool.
Asunto(s)
Sorghum , Sorghum/genética , Estudio de Asociación del Genoma Completo , Pool de Genes , Fitomejoramiento , Fenotipo , Grano Comestible/genética , Semillas/genética , Polimorfismo de Nucleótido SimpleRESUMEN
Multi-parent populations contain valuable genetic material for dissecting complex, quantitative traits and provide a unique opportunity to capture multi-allelic variation compared to the biparental populations. A multi-parent advanced generation inter-cross (MAGIC) B-line (MBL) population composed of 708 F6 recombinant inbred lines (RILs), was recently developed from four diverse founders. These selected founders strategically represented the four most prevalent botanical races (kafir, guinea, durra, and caudatum) to capture a significant source of genetic variation to study the quantitative traits in grain sorghum [Sorghum bicolor (L.) Moench]. MBL was phenotyped at two field locations for seven yield-influencing traits: panicle type (PT), days to anthesis (DTA), plant height (PH), grain yield (GY), 1000-grain weight (TGW), tiller number per meter (TN) and yield per panicle (YPP). High phenotypic variation was observed for all the quantitative traits, with broad-sense heritabilities ranging from 0.34 (TN) to 0.84 (PH). The entire population was genotyped using Diversity Arrays Technology (DArTseq), and 8,800 single nucleotide polymorphisms (SNPs) were generated. A set of polymorphic, quality-filtered markers (3,751 SNPs) and phenotypic data were used for genome-wide association studies (GWAS). We identified 52 marker-trait associations (MTAs) for the seven traits using BLUPs generated from replicated plots in two locations. We also identified desirable allelic combinations based on the plant height loci (Dw1, Dw2, and Dw3), which influences yield related traits. Additionally, two novel MTAs were identified each on Chr1 and Chr7 for yield traits independent of dwarfing genes. We further performed a multi-variate adaptive shrinkage analysis and 15 MTAs with pleiotropic effect were identified. The five best performing MBL progenies were selected carrying desirable allelic combinations. Since the MBL population was designed to capture significant diversity for maintainer line (B-line) accessions, these progenies can serve as valuable resources to develop superior sorghum hybrids after validation of their general combining abilities via crossing with elite pollinators. Further, newly identified desirable allelic combinations can be used to enrich the maintainer germplasm lines through marker-assisted backcross breeding.
RESUMEN
Introduction: Sorghum (Sorghum bicolor (L.) Moench) is an agriculturally and economically important staple crop that has immense potential as a bioenergy feedstock due to its relatively high productivity on marginal lands. To capitalize on and further improve sorghum as a potential source of sustainable biofuel, it is essential to understand the genomic mechanisms underlying complex traits related to yield, composition, and environmental adaptations. Methods: Expanding on a recently developed mapping population, we generated de novo genome assemblies for 10 parental genotypes from this population and identified a comprehensive set of over 24 thousand large structural variants (SVs) and over 10.5 million single nucleotide polymorphisms (SNPs). Results: We show that SVs and nonsynonymous SNPs are enriched in different gene categories, emphasizing the need for long read sequencing in crop species to identify novel variation. Furthermore, we highlight SVs and SNPs occurring in genes and pathways with known associations to critical bioenergy-related phenotypes and characterize the landscape of genetic differences between sweet and cellulosic genotypes. Discussion: These resources can be integrated into both ongoing and future mapping and trait discovery for sorghum and its myriad uses including food, feed, bioenergy, and increasingly as a carbon dioxide removal mechanism.
RESUMEN
Genomic structural mutations, especially deletions, are an important source of variation in many species and can play key roles in phenotypic diversification and evolution. Previous work in many plant species has identified multiple instances of structural variations (SVs) occurring in or near genes related to stress response and disease resistance, suggesting a possible role for SVs in local adaptation. Sorghum [Sorghum bicolor (L.) Moench] is one of the most widely grown cereal crops in the world. It has been adapted to an array of different climates as well as bred for multiple purposes, resulting in a striking phenotypic diversity. In this study, we identified genome-wide SVs in the Biomass Association Panel, a collection of 347 diverse sorghum genotypes collected from multiple countries and continents. Using Illumina-based, short-read whole-genome resequencing data from every genotype, we found a total of 24,648 SVs, including 22,359 deletions. The global site frequency spectrum of deletions and other types of SVs fit a model of neutral evolution, suggesting that the majority of these mutations were not under any types of selection. Clustering results based on single nucleotide polymorphisms separated the genotypes into eight clusters which largely corresponded with geographic origins, with many of the large deletions we uncovered being unique to a single cluster. Even though most deletions appeared to be neutral, a handful of cluster-specific deletions were found in genes related to biotic and abiotic stress responses, supporting the possibility that at least some of these deletions contribute to local adaptation in sorghum.
Asunto(s)
Sorghum , Sorghum/genética , Fitomejoramiento , Genotipo , Genómica/métodos , Genoma de PlantaRESUMEN
Sorghum bicolor, a photosynthetically efficient C4 grass, represents an important source of grain, forage, fermentable sugars, and cellulosic fibers that can be utilized in myriad applications ranging from bioenergy to bioindustrial feedstocks. Sorghum's efficient fixation of carbon per unit time per unit area per unit input has led to its classification as a preferred biomass crop highlighted by its designation as an advanced biofuel by the U.S. Department of Energy. Due to its extensive genetic diversity and worldwide colonization, sorghum has considerable diversity for a range of phenotypes influencing productivity, composition, and sink/source dynamics. To dissect the genetic basis of these key traits, we present a sorghum carbon-partitioning nested association mapping (NAM) population generated by crossing 11 diverse founder lines with Grassl as the single recurrent female. By exploiting existing variation among cellulosic, forage, sweet, and grain sorghum carbon partitioning regimes, the sorghum carbon-partitioning NAM population will allow the identification of important biomass-associated traits, elucidate the genetic architecture underlying carbon partitioning and improve our understanding of the genetic determinants affecting unique phenotypes within Poaceae. We contrast this NAM population with an existing grain population generated using Tx430 as the recurrent female. Genotypic data are assessed for quality by examining variant density, nucleotide diversity, linkage decay, and are validated using pericarp and testa phenotypes to map known genes affecting these phenotypes. We release the 11-family NAM population along with corresponding genomic data for use in genetic, genomic, and agronomic studies with a focus on carbon-partitioning regimes.
Asunto(s)
Sorghum , Carbono , Ligamiento Genético , Genotipo , Fenotipo , Polimorfismo de Nucleótido Simple , Sorghum/genéticaRESUMEN
Simple sugars are the essential foundation to plant life, and thus, their production, utilization, and storage are highly regulated processes with many complex genetic controls. Despite their importance, many of the genetic and biochemical mechanisms remain unknown or uncharacterized. Sorghum, a highly productive, diverse C4 grass important for both industrial and subsistence agricultural systems, has considerable phenotypic diversity in the accumulation of nonstructural sugars in the stem. We use this crop species to examine the genetic controls of high levels of sugar accumulation, identify genetic mechanisms for the accumulation of nonstructural sugars, and link carbon allocation with iron transport. We identify a species-specific tandem duplication event controlling sugar accumulation using genome-wide association analysis, characterize multiple allelic variants causing increased sugar content, and provide further evidence of a putative neofunctionalization event conferring adaptability in Sorghum bicolor Comparative genomics indicate that this event is unique to sorghum which may further elucidate evolutionary mechanisms for adaptation and divergence within the Poaceae. Furthermore, the identification and characterization of this event was only possible with the continued advancement and improvement of the reference genome. The characterization of this region and the process in which it was discovered serve as a reminder that any reference genome is imperfect and is in need of continual improvement.
Asunto(s)
Sorghum , Carbohidratos , Genoma de Planta , Estudio de Asociación del Genoma Completo , Poaceae/genética , Sorghum/genéticaRESUMEN
Grain yield and its primary determinants, grain number and weight, are important traits in cereal crops that have been well studied; however, the genetic basis of and interactions between these traits remain poorly understood. Characterization of grain yield per primary panicle (YPP), grain number per primary panicle (GNP), and 1000-grain weight (TGW) in sorghum [ (L.) Moench], a hardy C cereal with a genome size of â¼730 Mb, was implemented in a diversity panel containing 390 accessions. These accessions were genotyped to obtain 268,830 single-nucleotide polymorphisms (SNPs). Genome-wide association studies (GWAS) were performed to identify loci associated with each grain yield component and understand the genetic interactions between these traits. Genome-wide association studies identified associations across the genome with YPP, GNP, and TGW that were located within previously mapped sorghum QTL for panicle weight, grain yield, and seed size, respectively. There were no significant associations between GNP and TGW that were within 100 kb, much greater than the average linkage disequilibrium (LD) in sorghum. The identification of nonoverlapping loci for grain number and weight suggests these traits may be manipulated independently to increase the grain yield of sorghum. Following GWAS, genomic regions surrounding each associated SNP were mined for candidate genes. Previously published expression data indicated several TGW candidate genes, including an ethylene receptor homolog, were primarily expressed within developing seed tissues to support GWAS. Furthermore, maize ( L.) homologs of identified TGW candidates were differentially expressed within the seed between small- and large-kernel lines from a segregating maize population.
Asunto(s)
Estudio de Asociación del Genoma Completo , Sorghum/genética , Genotipo , Desequilibrio de Ligamiento , Fenotipo , Polimorfismo de Nucleótido Simple , Semillas/genéticaRESUMEN
With high productivity and stress tolerance, numerous grass genera of the Andropogoneae have emerged as candidates for bioenergy production. To optimize these candidates, research examining the genetic architecture of yield, carbon partitioning, and composition is required to advance breeding objectives. Significant progress has been made developing genetic and genomic resources for Andropogoneae, and advances in comparative and computational genomics have enabled research examining the genetic basis of photosynthesis, carbon partitioning, composition, and sink strength. To provide a pivotal resource aimed at developing a comparative understanding of key bioenergy traits in the Andropogoneae, we have established and characterized an association panel of 390 racially, geographically, and phenotypically diverse Sorghum bicolor accessions with 232,303 genetic markers. Sorghum bicolor was selected because of its genomic simplicity, phenotypic diversity, significant genomic tools, and its agricultural productivity and resilience. We have demonstrated the value of sorghum as a functional model for candidate gene discovery for bioenergy Andropogoneae by performing genome-wide association analysis for two contrasting phenotypes representing key components of structural and non-structural carbohydrates. We identified potential genes, including a cellulase enzyme and a vacuolar transporter, associated with increased non-structural carbohydrates that could lead to bioenergy sorghum improvement. Although our analysis identified genes with potentially clear functions, other candidates did not have assigned functions, suggesting novel molecular mechanisms for carbon partitioning traits. These results, combined with our characterization of phenotypic and genetic diversity and the public accessibility of each accession and genomic data, demonstrate the value of this resource and provide a foundation for future improvement of sorghum and related grasses for bioenergy production.
Asunto(s)
Biocombustibles , Sorghum/genética , Agricultura/métodos , Andropogon/genética , Andropogon/metabolismo , Carbohidratos/genética , Mapeo Cromosómico , Grano Comestible/genética , Marcadores Genéticos/genética , Variación Genética , Genoma de Planta , Estudio de Asociación del Genoma Completo , Modelos Genéticos , Fitomejoramiento , Poaceae/genética , Sorghum/metabolismoRESUMEN
Improving environmental adaptation in crops is essential for food security under global change, but phenotyping adaptive traits remains a major bottleneck. If associations between single-nucleotide polymorphism (SNP) alleles and environment of origin in crop landraces reflect adaptation, then these could be used to predict phenotypic variation for adaptive traits. We tested this proposition in the global food crop Sorghum bicolor, characterizing 1943 georeferenced landraces at 404,627 SNPs and quantifying allelic associations with bioclimatic and soil gradients. Environment explained a substantial portion of SNP variation, independent of geographical distance, and genic SNPs were enriched for environmental associations. Further, environment-associated SNPs predicted genotype-by-environment interactions under experimental drought stress and aluminum toxicity. Our results suggest that genomic signatures of environmental adaptation may be useful for crop improvement, enhancing germplasm identification and marker-assisted selection. Together, genome-environment associations and phenotypic analyses may reveal the basis of environmental adaptation.
RESUMEN
Genome-wide association studies are a powerful method to dissect the genetic basis of traits, although in practice the effects of complex genetic architecture and population structure remain poorly understood. To compare mapping strategies we dissected the genetic control of flavonoid pigmentation traits in the cereal grass sorghum by using high-resolution genotyping-by-sequencing single-nucleotide polymorphism markers. Studying the grain tannin trait, we find that general linear models (GLMs) are not able to precisely map tan1-a, a known loss-of-function allele of the Tannin1 gene, with either a small panel (n = 142) or large association panel (n = 336), and that indirect associations limit the mapping of the Tannin1 locus to Mb-resolution. A GLM that accounts for population structure (Q) or standard mixed linear model that accounts for kinship (K) can identify tan1-a, whereas a compressed mixed linear model performs worse than the naive GLM. Interestingly, a simple loss-of-function genome scan, for genotype-phenotype covariation only in the putative loss-of-function allele, is able to precisely identify the Tannin1 gene without considering relatedness. We also find that the tan1-a allele can be mapped with gene resolution in a biparental recombinant inbred line family (n = 263) using genotyping-by-sequencing markers but lower precision in the mapping of vegetative pigmentation traits suggest that consistent gene-level resolution will likely require larger families or multiple recombinant inbred lines. These findings highlight that complex association signals can emerge from even the simplest traits given epistasis and structured alleles, but that gene-resolution mapping of these traits is possible with high marker density and appropriate models.