RESUMO
Crop wild relatives represent valuable sources of alleles for crop improvement, including adaptation to climate change and emerging diseases. However, introgressions from wild relatives might have deleterious effects on desirable traits, including yield, due to linkage drag. Here, we analyzed the genomic and phenotypic impacts of wild introgressions in inbred lines of cultivated sunflower to estimate the impacts of linkage drag. First, we generated reference sequences for seven cultivated and one wild sunflower genotype, as well as improved assemblies for two additional cultivars. Next, relying on previously generated sequences from wild donor species, we identified introgressions in the cultivated reference sequences, as well as the sequence and structural variants they contain. We then used a ridge-regression best linear unbiased prediction (BLUP) model to test the effects of the introgressions on phenotypic traits in the cultivated sunflower association mapping population. We found that introgression has introduced substantial sequence and structural variation into the cultivated sunflower gene pool, including >3,000 new genes. While introgressions reduced genetic load at protein-coding sequences, they mostly had negative impacts on yield and quality traits. Introgressions found at high frequency in the cultivated gene pool had larger effects than low-frequency introgressions, suggesting that the former likely were targeted by artificial selection. Also, introgressions from more distantly related species were more likely to be maladaptive than those from the wild progenitor of cultivated sunflower. Thus, breeding efforts should focus, as far as possible, on closely related and fully compatible wild relatives.
Assuntos
Helianthus , Helianthus/genética , Genoma de Planta/genética , Melhoramento Vegetal , Genótipo , GenômicaRESUMO
Grasses derive from a family of monocotyledonous plants that includes crops of major economic importance such as wheat, rice, sorghum and barley, sharing a common ancestor some 100 million years ago. The genomic attributes of plant adaptation remain obscure and the consequences of recurrent whole genome duplications (WGD) or polyploidization events, a major force in plant evolution, remain largely speculative. We conducted a comparative analysis of omics data from ten grass species to unveil structural (inversions, fusions, fissions, duplications, substitutions) and regulatory (expression and methylation) basis of genome plasticity, as possible attributes of plant long lasting evolution and adaptation. The present study demonstrates that diverged polyploid lineages sharing a common WGD event often present the same patterns of structural changes and evolutionary dynamics, but these patterns are difficult to generalize across independent WGD events as a result of non-WGD factors such as selection and domestication of crops. Polyploidy is unequivocally linked to the evolutionary success of grasses during the past 100 million years, although it remains difficult to attribute this success to particular genomic consequences of polyploidization, suggesting that polyploids harness the potential of genome duplication, at least partially, in lineage-specific ways. Overall, the present study clearly demonstrates that post-polyploidization reprogramming is more complex than traditionally reported in investigating single species and calls for a critical and comprehensive comparison across independently polyploidized lineages.
Assuntos
Genoma de Planta , Poaceae , Poaceae/genética , Genoma de Planta/genética , Filogenia , Evolução Molecular , Grão Comestível/genética , Poliploidia , Duplicação GênicaRESUMO
Climate change is expected to intensify the occurrence of abiotic stress in plants, such as hypoxia and salt stresses, leading to the production of reactive oxygen species (ROS), which need to be effectively managed by various oxido-reductases encoded by the so-called ROS gene network. Here, we studied six oxido-reductases families in three Brassicaceae species, Arabidopsis thaliana as well as Nasturtium officinale and Eutrema salsugineum, which are adapted to hypoxia and salt stress, respectively. Using available and new genomic data, we performed a phylogenomic analysis and compared RNA-seq data to study genomic and transcriptomic adaptations. This comprehensive approach allowed for the gaining of insights into the impact of the adaptation to saline or hypoxia conditions on genome organization (gene gains and losses) and transcriptional regulation. Notably, the comparison of the N. officinale and E. salsugineum genomes to that of A. thaliana highlighted changes in the distribution of ohnologs and homologs, particularly affecting class III peroxidase genes (CIII Prxs). These changes were specific to each gene, to gene families subjected to duplication events and to each species, suggesting distinct evolutionary responses. The analysis of transcriptomic data has allowed for the identification of genes related to stress responses in A. thaliana, and, conversely, to adaptation in N. officinale and E. salsugineum.
Assuntos
Arabidopsis , Brassicaceae , Brassicaceae/genética , Arabidopsis/genética , Espécies Reativas de Oxigênio , Redes Reguladoras de Genes , Oxirredutases/genética , Hipóxia , Regulação da Expressão Gênica de Plantas , Estresse FisiológicoRESUMO
MADS-box transcription factors are important regulators of floral organ identity through their binding to specific motifs, termed CArG, in the promoter of their target genes. Petal initiation and development depend on class A and B genes, but MADS-box genes of the APETALA3 (AP3) clade are key regulators of this process. In the early diverging eudicot Nigella damascena, an apetalous [T] morph is characterized by the lack of expression of the NdAP3-3 gene, with its expression being petal-specific in the wild-type [P] morph. All [T] morph plants are homozygous for an NdAP3-3 allele with a Miniature Inverted-repeat Transposable Element (MITE) insertion in the second intron of the gene. Here, we investigated to which extent the MITE insertion impairs regulation of the NdAP3-3 gene. We found that expression of NdAP3-3 is initiated in the [T] morph, but the MITE insertion prevents its positive self-maintenance by affecting the correct splicing of the mRNA. We also found specific CArG features in the promoter of the NdAP3-3 genes with petal-specific expression. However, they are not sufficient to drive expression only in petals of transgenic Arabidopsis, highlighting the existence of Nigella-specific cis/trans-acting factors in regulating AP3 paralogs.
Assuntos
Proteínas de Arabidopsis , Arabidopsis , Nigella damascena , Nigella damascena/metabolismo , Elementos de DNA Transponíveis/genética , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Proteínas de Domínio MADS/genética , Proteínas de Arabidopsis/metabolismo , Flores , Arabidopsis/genética , Regulação da Expressão Gênica de PlantasRESUMO
The domesticated sunflower, Helianthus annuus L., is a global oil crop that has promise for climate change adaptation, because it can maintain stable yields across a wide variety of environmental conditions, including drought. Even greater resilience is achievable through the mining of resistance alleles from compatible wild sunflower relatives, including numerous extremophile species. Here we report a high-quality reference for the sunflower genome (3.6 gigabases), together with extensive transcriptomic data from vegetative and floral organs. The genome mostly consists of highly similar, related sequences and required single-molecule real-time sequencing technologies for successful assembly. Genome analyses enabled the reconstruction of the evolutionary history of the Asterids, further establishing the existence of a whole-genome triplication at the base of the Asterids II clade and a sunflower-specific whole-genome duplication around 29 million years ago. An integrative approach combining quantitative genetics, expression and diversity data permitted development of comprehensive gene networks for two major breeding traits, flowering time and oil metabolism, and revealed new candidate genes in these networks. We found that the genomic architecture of flowering time has been shaped by the most recent whole-genome duplication, which suggests that ancient paralogues can remain in the same regulatory networks for dozens of millions of years. This genome represents a cornerstone for future research programs aiming to exploit genetic diversity to improve biotic and abiotic stress resistance and oil production, while also considering agricultural constraints and human nutritional needs.
Assuntos
Evolução Molecular , Flores/genética , Flores/fisiologia , Genoma de Planta/genética , Helianthus/genética , Helianthus/metabolismo , Óleos de Plantas/metabolismo , Aclimatação/genética , Duplicação Gênica/genética , Regulação da Expressão Gênica de Plantas , Variação Genética , Genômica , Helianthus/classificação , Análise de Sequência de DNA , Estresse Fisiológico/genética , Óleo de Girassol , Transcriptoma/genéticaRESUMO
Cas9-assisted targeting of DNA fragments in complex genomes is viewed as an essential strategy to obtain high-quality and continuous sequence data. However, the purity of target loci selected by pulsed-field gel electrophoresis (PFGE) has so far been insufficient to assemble the sequence in one contig. Here, we describe the µLAS technology to capture and purify high molecular weight DNA. First, the technology is optimized to perform high sensitivity DNA profiling with a limit of detection of 20 fg/µl for 50 kb fragments and an analytical time of 50 min. Then, µLAS is operated to isolate a 31.5 kb locus cleaved by Cas9 in the genome of the plant Medicago truncatula. Target purification is validated on a Bacterial Artificial Chromosome plasmid, and subsequently carried out in whole genome with µLAS, PFGE or by combining these techniques. PacBio sequencing shows an enrichment factor of the target sequence of 84 with PFGE alone versus 892 by association of PFGE with µLAS. These performances allow us to sequence and assemble one contig of 29 441 bp with 99% sequence identity to the reference sequence.
Assuntos
Sistemas CRISPR-Cas , DNA de Plantas/genética , Genoma de Planta/genética , Medicago truncatula/genética , Análise de Sequência de DNA/métodos , Cromossomos Artificiais Bacterianos , Biologia Computacional/métodos , DNA de Plantas/isolamento & purificação , Eletroforese em Gel de Campo Pulsado/métodos , Reprodutibilidade dos TestesRESUMO
Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Myr ago). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species. Medicago truncatula is a long-established model for the study of legume biology. Here we describe the draft sequence of the M. truncatula euchromatin based on a recently completed BAC assembly supplemented with Illumina shotgun sequence, together capturing â¼94% of all M. truncatula genes. A whole-genome duplication (WGD) approximately 58 Myr ago had a major role in shaping the M. truncatula genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the M. truncatula genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max and Lotus japonicus. M. truncatula is a close relative of alfalfa (Medicago sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the M. truncatula genome sequence provides significant opportunities to expand alfalfa's genomic toolbox.
Assuntos
Evolução Biológica , Genoma de Planta , Medicago truncatula/genética , Medicago truncatula/microbiologia , Rhizobium/fisiologia , Simbiose , Dados de Sequência Molecular , Fixação de Nitrogênio/genética , Glycine max/genética , Sintenia , Vitis/genéticaRESUMO
Dispersal is a central component of life history evolution. An increasing number of studies suggest that spatiotemporally variable environments may promote the evolution of "dispersal syndromes," consisting of covariation patterns between dispersal and morphological, physiological, behavioral, and life history traits. At the interspecific scale, the "colonizer syndrome" appears to be one of the most frequently recorded associations between dispersal and life history traits, linking a high dispersal rate, high fecundity, and a short lifespan as systematically combined adaptations in spatiotemporally varying environments. However, few studies have highlighted the existence of a "colonizer syndrome" at the intraspecific scale, and none have investigated how different degrees of habitat stochasticity might shape covariation patterns between dispersal and life history traits. In this study, we examined this issue in free-ranging metapopulations of the yellow-bellied toad (Bombina variegata) using capture-recapture data. Combining the results of this study with another recent study, we found that a high dispersal rate, high fecundity, and a short lifespan are associated in metapopulations experiencing unpredictable environments. In contrast, a very low dispersal rate (close to zero), low fecundity and a long lifespan are associated in metapopulations occupying predictable environments. We discuss these results as well as their demographic and evolutionary consequences.
Assuntos
Anuros , Ecossistema , Animais , Evolução Biológica , Demografia , Fertilidade , Dinâmica PopulacionalRESUMO
Self-incompatibility has been considered by geneticists a model system for reproductive biology and balancing selection, but our understanding of the genetic basis and evolution of this molecular lock-and-key system has remained limited by the extreme level of sequence divergence among haplotypes, resulting in a lack of appropriate genomic sequences. In this study, we report and analyze the full sequence of eleven distinct haplotypes of the self-incompatibility locus (S-locus) in two closely related Arabidopsis species, obtained from individual BAC libraries. We use this extensive dataset to highlight sharply contrasted patterns of molecular evolution of each of the two genes controlling self-incompatibility themselves, as well as of the genomic region surrounding them. We find strong collinearity of the flanking regions among haplotypes on each side of the S-locus together with high levels of sequence similarity. In contrast, the S-locus region itself shows spectacularly deep gene genealogies, high variability in size and gene organization, as well as complete absence of sequence similarity in intergenic sequences and striking accumulation of transposable elements. Of particular interest, we demonstrate that dominant and recessive S-haplotypes experience sharply contrasted patterns of molecular evolution. Indeed, dominant haplotypes exhibit larger size and a much higher density of transposable elements, being matched only by that in the centromere. Overall, these properties highlight that the S-locus presents many striking similarities with other regions involved in the determination of mating-types, such as sex chromosomes in animals or in plants, or the mating-type locus in fungi and green algae.
Assuntos
Proteínas de Arabidopsis/genética , Arabidopsis/genética , Evolução Molecular , Haplótipos , Autoincompatibilidade em Angiospermas/genética , Elementos de DNA Transponíveis/genética , Rearranjo Gênico , Genes Dominantes , Genes Recessivos , Filogenia , Recombinação Genética , Análise de Sequência de DNARESUMO
BACKGROUND: The passion fruit (Passiflora edulis) is a tropical crop of economic importance both for juice production and consumption as fresh fruit. The juice is also used in concentrate blends that are consumed worldwide. However, very little is known about the genome of the species. Therefore, improving our understanding of passion fruit genomics is essential and to some degree a pre-requisite if its genetic resources are to be used more efficiently. In this study, we have constructed a large-insert BAC library and provided the first view on the structure and content of the passion fruit genome, using BAC-end sequence (BES) data as a major resource. RESULTS: The library consisted of 82,944 clones and its levels of organellar DNA were very low. The library represents six haploid genome equivalents, and the average insert size was 108 kb. To check its utility for gene isolation, successful macroarray screening experiments were carried out with probes complementary to eight Passiflora gene sequences available in public databases. BACs harbouring those genes were used in fluorescent in situ hybridizations and unique signals were detected for four BACs in three chromosomes (n=9). Then, we explored 10,000 BES and we identified reads likely to contain repetitive mobile elements (19.6% of all BES), simple sequence repeats and putative proteins, and to estimate the GC content (~42%) of the reads. Around 9.6% of all BES were found to have high levels of similarity to plant genes and ontological terms were assigned to more than half of the sequences analysed (940). The vast majority of the top-hits made by our sequences were to Populus trichocarpa (24.8% of the total occurrences), Theobroma cacao (21.6%), Ricinus communis (14.3%), Vitis vinifera (6.5%) and Prunus persica (3.8%). CONCLUSIONS: We generated the first large-insert library for a member of Passifloraceae. This BAC library provides a new resource for genetic and genomic studies, as well as it represents a valuable tool for future whole genome study. Remarkably, a number of BAC-end pair sequences could be mapped to intervals of the sequenced Arabidopsis thaliana, V. vinifera and P. trichocarpa chromosomes, and putative collinear microsyntenic regions were identified.
Assuntos
Cromossomos Artificiais Bacterianos/genética , Genômica/métodos , Passiflora/genética , Mapeamento Cromossômico , Elementos de DNA Transponíveis/genética , Genoma de Planta/genética , Repetições de Microssatélites/genética , Fases de Leitura Aberta/genética , Retroelementos/genéticaRESUMO
In flowering plants, outcrossing is commonly ensured by self-incompatibility (SI) systems. These can be homomorphic (typically with many different allelic specificities) or can accompany flower heteromorphism (mostly with just two specificities and corresponding floral types). The SI system of the Oleaceae family is unusual, with the long-term maintenance of only two specificities but often without flower morphology differences. To elucidate the genomic architecture and molecular basis of this SI system, we obtained chromosome-scale genome assemblies of Phillyrea angustifolia individuals and related them to a genetic map. The S-locus region proved to have a segregating 543-kb indel unique to one specificity, suggesting a hemizygous region, as observed in all distylous systems so far studied at the genomic level. Only one of the predicted genes in this indel region is found in the olive tree, Olea europaea, genome, also within a segregating indel. We describe complete association between the presence/absence of this gene and the SI types determined for individuals of seven distantly related Oleaceae species. This gene is predicted to be involved in catabolism of the gibberellic acid (GA) hormone, and experimental manipulation of GA levels in developing buds modified the male and female SI responses of the two specificities in different ways. Our results provide a unique example of a homomorphic SI system, where a single conserved gibberellin-related gene in a hemizygous indel underlies the long-term maintenance of two groups of reproductive compatibility.
Assuntos
Giberelinas , Giberelinas/metabolismo , Oleaceae/genética , Oleaceae/metabolismo , Oleaceae/crescimento & desenvolvimento , Autoincompatibilidade em Angiospermas/genética , Genoma de Planta , Flores/genética , Flores/crescimento & desenvolvimento , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismoRESUMO
BACKGROUND: Freezing provokes severe yield losses to different fall-sown annual legumes. Understanding the molecular bases of freezing tolerance is of great interest for breeding programs. Medicago truncatula Gaertn. is an annual temperate forage legume that has been chosen as a model species for agronomically and economically important legume crops. The present study aimed to identify positional candidate genes for a major freezing tolerance quantitative trait locus that was previously mapped to M. truncatula chromosome 6 (Mt-FTQTL6) using the LR3 population derived from a cross between the freezing-tolerant accession F83005-5 and the freezing-sensitive accession DZA045-5. RESULTS: The confidence interval of Mt-FTQTL6 was narrowed down to the region comprised between markers MTIC153 and NT6054 using recombinant F7 and F8 lines. A bacterial-artificial chromosome (BAC) clone contig map was constructed in an attempt to close the residual assembly gap existing therein. Twenty positional candidate genes including twelve C-repeat binding factor (CBF)/dehydration-responsive element binding factor 1 (DREB1) genes were identified from BAC-derived sequences and whole-genome shotgun sequences (WGS). CBF/DREB1 genes are organized in a tandem array within an approximately 296-Kb region. Eleven CBF/DREB1 genes were isolated and sequenced from F83005-5 and DZA045-5 which revealed high polymorphism among these accessions. Unique features characterizing CBF/DREB1 genes from M. truncatula, such as alternative splicing and large tandem duplication, are elucidated for the first time. CONCLUSIONS: Overall, twenty genes were identified as potential candidates to explain Mt-FTQTL6 effect. Their future functional characterization will uncover the gene(s) involved in freezing tolerance difference observed between F83005-5 and DZA045-5. Knowledge transfer for breeding improvement of crop legumes is expected. Furthermore, CBF/DREB1 related data will certainly have a large impact on research studies targeting this group of transcriptional activators in M. truncatula and other legume species.
Assuntos
Proteínas de Arabidopsis/genética , Congelamento , Medicago truncatula/genética , Fatores de Transcrição/genética , Aclimatação/genética , Arabidopsis/genética , Arabidopsis/metabolismo , Proteínas de Arabidopsis/fisiologia , Sequência de Bases , Cromossomos de Plantas/genética , Desidratação , Regulação da Expressão Gênica de Plantas , Medicago truncatula/crescimento & desenvolvimento , Fenótipo , Locos de Características Quantitativas/genética , Fatores de Transcrição/fisiologiaRESUMO
Ralstonia solanacearum is a major soilborne pathogen that attacks > 200 plant species, including major crops. To characterize MtQRRS1, a major quantitative trait locus (QTL) for resistance towards this bacterium in the model legume Medicago truncatula, genetic and functional approaches were combined. QTL analyses together with disease scoring of heterogeneous inbred families were used to define the locus. The candidate region was studied by physical mapping using a bacterial artificial chromosome (BAC) library of the resistant line, and sequencing. In planta bacterial growth measurements, grafting experiments and gene expression analysis were performed to investigate the mechanisms by which this locus confers resistance to R. solanacearum. The MtQRRS1 locus was localized to the same position in two recombinant inbred line populations and was narrowed down to a 64 kb region. Comparison of parental line sequences revealed 15 candidate genes with sequence polymorphisms, but no evidence of differential gene expression upon infection. A role for the hypocotyl in resistance establishment was shown. These data indicate that the quantitative resistance to bacterial wilt conferred by MtQRRS1, which contains a cluster of seven R genes, is shared by different accessions and may act through intralocus interactions to promote resistance.
Assuntos
Resistência à Doença/genética , Medicago truncatula/genética , Medicago truncatula/microbiologia , Doenças das Plantas/imunologia , Doenças das Plantas/microbiologia , Locos de Características Quantitativas/genética , Ralstonia solanacearum/fisiologia , Cromossomos de Plantas/genética , Análise por Conglomerados , Cruzamentos Genéticos , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Estudos de Associação Genética , Genótipo , Hipocótilo/imunologia , Hipocótilo/microbiologia , Endogamia , Medicago truncatula/imunologia , Anotação de Sequência Molecular , Dados de Sequência Molecular , Fenótipo , Mapeamento Físico do Cromossomo , Doenças das Plantas/genética , Polimorfismo de Nucleotídeo Único/genética , Reprodutibilidade dos TestesRESUMO
Bread wheat (Triticum aestivum L.) is one of the most important crops globally and a high priority for genetic improvement, but its large and complex genome has been seen as intractable to whole genome sequencing. Isolation of individual wheat chromosome arms has facilitated large-scale sequence analyses. However, so far there is no such survey of sequences from the A genome of wheat. Greater understanding of an A chromosome could facilitate wheat improvement and future sequencing of the entire genome. We have constructed BAC library from the long arm of T. aestivum chromosome 1A (1AL) and obtained BAC end sequences from 7,470 clones encompassing the arm. We obtained 13,445 (89.99%) useful sequences with a cumulative length of 7.57 Mb, representing 1.43% of 1AL and about 0.14% of the entire A genome. The GC content of the sequences was 44.7%, and 90% of the chromosome was estimated to comprise repeat sequences, while just over 1% encoded expressed genes. From the sequence data, we identified a large number of sites suitable for development of molecular markers (362 SSR and 6,948 ISBP) which will have utility for mapping this chromosome and for marker assisted breeding. From 44 putative ISBP markers tested 23 (52.3%) were found to be useful. The BAC end sequence data also enabled the identification of genes and syntenic blocks specific to chromosome 1AL, suggesting regions of particular functional interest and targets for future research.
Assuntos
Cromossomos de Plantas/genética , Triticum/genética , Cromossomos Artificiais Bacterianos/genética , Clonagem Molecular , Marcadores Genéticos , Genoma de Planta , Repetições de Microssatélites , Anotação de Sequência Molecular , Proteínas de Plantas/genética , Polimorfismo Genético , Análise de Sequência de DNA , SinteniaRESUMO
To improve our understanding of the organization and regulation of the wheat (Triticum aestivum) gene space, we established a transcription map of a wheat chromosome (3B) by hybridizing a newly developed wheat expression microarray with bacterial artificial chromosome pools from a new version of the 3B physical map as well as with cDNA probes derived from 15 RNA samples. Mapping data for almost 3,000 genes showed that the gene space spans the whole chromosome 3B with a 2-fold increase of gene density toward the telomeres due to an increase in the number of genes in islands. Comparative analyses with rice (Oryza sativa) and Brachypodium distachyon revealed that these gene islands are composed mainly of genes likely originating from interchromosomal gene duplications. Gene Ontology and expression profile analyses for the 3,000 genes located along the chromosome revealed that the gene islands are enriched significantly in genes sharing the same function or expression profile, thereby suggesting that genes in islands acquired shared regulation during evolution. Only a small fraction of these clusters of cofunctional and coexpressed genes was conserved with rice and B. distachyon, indicating a recent origin. Finally, genes with the same expression profiles in remote islands (coregulation islands) were identified suggesting long-distance regulation of gene expression along the chromosomes in wheat.
Assuntos
Cromossomos Artificiais Bacterianos/genética , DNA Intergênico/genética , Genoma de Planta/genética , Ilhas Genômicas/fisiologia , Mapeamento Físico do Cromossomo/métodos , Triticum/genética , Sequência de Bases , Brachypodium/genética , Centrômero/genética , Cromossomos de Plantas/genética , DNA de Plantas/química , DNA de Plantas/genética , Evolução Molecular , Duplicação Gênica , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas/genética , Genes de Plantas/genética , Ilhas Genômicas/genética , Dados de Sequência Molecular , Família Multigênica , Análise de Sequência com Séries de Oligonucleotídeos , Oryza/genética , Poliploidia , Análise de Sequência de DNA , Telômero/genética , TranscriptomaRESUMO
BACKGROUND: The sequencing of the wheat (Triticum aestivum) genome has been a methodological challenge for many years owing to its large size (15.5 Gb), repeat content, and hexaploidy. Many initiatives aiming at obtaining a reference genome of cultivar Chinese Spring have been launched in the past years and it was achieved in 2018 as the result of a huge effort to combine short-read sequencing with many other resources. Reference-quality genome assemblies were then produced for other accessions, but the rapid evolution of sequencing technologies offers opportunities to reach high-quality standards at lower cost. RESULTS: Here, we report on an optimized procedure based on long reads produced on the Oxford Nanopore Technology PromethION device to assemble the genome of the French bread wheat cultivar Renan. CONCLUSIONS: We provide the most contiguous chromosome-scale assembly of a bread wheat genome to date. Coupled with an annotation based on RNA-sequencing data, this resource will be valuable for the crop community and will facilitate the rapid selection of agronomically important traits. We also provide a framework to generate high-quality assemblies of complex genomes using ONT.
Assuntos
Genoma , Triticum , Cruzamento , Cromossomos , Análise de Sequência de DNA/métodos , Triticum/genéticaRESUMO
Recent technological advances in next-generation sequencing (NGS) technologies have dramatically reduced the cost of DNA sequencing, allowing species with large and complex genomes to be sequenced. Although bread wheat (Triticum aestivum L.) is one of the world's most important food crops, efficient exploitation of molecular marker-assisted breeding approaches has lagged behind that achieved in other crop species, due to its large polyploid genome. However, an international public-private effort spanning 9 years reported over 65% draft genome of bread wheat in 2014, and finally, after more than a decade culminated in the release of a gold-standard, fully annotated reference wheat-genome assembly in 2018. Shortly thereafter, in 2020, the genome of assemblies of additional 15 global wheat accessions was released. As a result, wheat has now entered into the pan-genomic era, where basic resources can be efficiently exploited. Wheat genotyping with a few hundred markers has been replaced by genotyping arrays, capable of characterizing hundreds of wheat lines, using thousands of markers, providing fast, relatively inexpensive, and reliable data for exploitation in wheat breeding. These advances have opened up new opportunities for marker-assisted selection (MAS) and genomic selection (GS) in wheat. Herein, we review the advances and perspectives in wheat genetics and genomics, with a focus on key traits, including grain yield, yield-related traits, end-use quality, and resistance to biotic and abiotic stresses. We also focus on reported candidate genes cloned and linked to traits of interest. Furthermore, we report on the improvement in the aforementioned quantitative traits, through the use of (i) clustered regularly interspaced short-palindromic repeats/CRISPR-associated protein 9 (CRISPR/Cas9)-mediated gene-editing and (ii) positional cloning methods, and of genomic selection. Finally, we examine the utilization of genomics for the next-generation wheat breeding, providing a practical example of using in silico bioinformatics tools that are based on the wheat reference-genome sequence.
RESUMO
All organisms on Earth are exposed to low doses of natural radioactivity but some habitats are more radioactive than others. Yet, documenting the influence of natural radioactivity on the evolution of biodiversity is challenging. Here, we addressed whether organisms living in naturally more radioactive habitats accumulate more mutations across generations using 14 species of waterlice living in subterranean habitats with contrasted levels of radioactivity. We found that the mitochondrial and nuclear mutation rates across a waterlouse species' genome increased on average by 60% and 30%, respectively, when radioactivity increased by a factor of three. We also found a positive correlation between the level of radioactivity and the probability of G to T (and complementary C to A) mutations, a hallmark of oxidative stress. We conclude that even low doses of natural bedrock radioactivity influence the mutation rate possibly through the accumulation of oxidative damage, in particular in the mitochondrial genome.
Assuntos
Evolução Molecular , Fenômenos Geológicos , Isópodes/genética , Isópodes/efeitos da radiação , Raios Ultravioleta , Animais , Genes Mitocondriais/genética , Mutação , FilogeniaRESUMO
Sugarcane (Saccharum spp.) is highly polyploid and aneuploid. Modern cultivars are derived from hybridization between S. officinarum and S. spontaneum. This combination results in a genome exhibiting variable ploidy among different loci, a huge genome size (~10 Gb) and a high content of repetitive regions. An approach using genomic, transcriptomic, and genetic mapping can improve our knowledge of the behavior of genetics in sugarcane. The hypothetical HP600 and Centromere Protein C (CENP-C) genes from sugarcane were used to elucidate the allelic expression and genomic and genetic behaviors of this complex polyploid. The physically linked side-by-side genes HP600 and CENP-C were found in two different homeologous chromosome groups with ploidies of eight and ten. The first region (Region01) was a Sorghum bicolor ortholog region with all haplotypes of HP600 and CENP-C expressed, but HP600 exhibited an unbalanced haplotype expression. The second region (Region02) was a scrambled sugarcane sequence formed from different noncollinear genes containing partial duplications of HP600 and CENP-C (paralogs). This duplication resulted in a non-expressed HP600 pseudogene and a recombined fusion version of CENP-C and the orthologous gene Sobic.003G299500 with at least two chimeric gene haplotypes expressed. It was also determined that it occurred before Saccharum genus formation and after the separation of sorghum and sugarcane. A linkage map was constructed using markers from nonduplicated Region01 and for the duplication (Region01 and Region02). We compare the physical and linkage maps, demonstrating the possibility of mapping markers located in duplicated regions with markers in nonduplicated region. Our results contribute directly to the improvement of linkage mapping in complex polyploids and improve the integration of physical and genetic data for sugarcane breeding programs. Thus, we describe the complexity involved in sugarcane genetics and genomics and allelic dynamics, which can be useful for understanding complex polyploid genomes.
RESUMO
Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are "novel" to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.