RESUMEN
Aegilops umbellulata serve as an important reservoir for novel biotic and abiotic stress tolerance for wheat improvement. However, chromosomal rearrangements and evolutionary trajectory of this species remain to be elucidated. Here, we present a comprehensive investigation into Ae. umbellulata genome by generating a high-quality near telomere-to-telomere genome assembly of PI 554389 and resequencing 20 additional Ae. umbellulata genomes representing diverse geographical and phenotypic variations. Our analysis unveils complex chromosomal rearrangements, most prominently in 4U and 6U chromosomes, delineating a distinct evolutionary trajectory of Ae. umbellulata from wheat and its relatives. Furthermore, our data rectified the erroneous naming of chromosomes 4U and 6U in the past and highlighted multiple major evolutionary events that led to the present-day U-genome. Resequencing of diverse Ae. umbellulata accessions revealed high genetic diversity within the species, partitioning into three distinct evolutionary sub-populations and supported by extensive phenotypic variability in resistance against several races/pathotypes of five major wheat diseases. Disease evaluations indicated the presence of several novel resistance genes in the resequenced lines for future studies. Resequencing also resulted in the identification of six new haplotypes for Lr9, the first resistance gene cloned from Ae. umbellulata. The extensive genomic and phenotypic resources presented in this study will expedite the future genetic exploration of Ae. umbellulata, facilitating efforts aimed at enhancing resiliency and productivity in wheat.
RESUMEN
Until recently, achieving a reference-quality genome sequence for bread wheat was long thought beyond the limits of genome sequencing and assembly technology, primarily due to the large genome size and >â 80% repetitive sequence content. The release of the chromosome scale 14.5-Gb IWGSC RefSeq v1.0 genome sequence of bread wheat cv. Chinese Spring (CS) was, therefore, a milestone. Here, we used a direct label and stain (DLS) optical map of the CS genome together with a prior nick, label, repair and stain (NLRS) optical map, and sequence contigs assembled with Pacific Biosciences long reads, to refine the v1.0 assembly. Inconsistencies between the sequence and maps were reconciled and gaps were closed. Gap filling and anchoring of 279 unplaced scaffolds increased the total length of pseudomolecules by 168 Mb (excluding Ns). Positions and orientations were corrected for 233 and 354 scaffolds, respectively, representing 10% of the genome sequence. The accuracy of the remaining 90% of the assembly was validated. As a result of the increased contiguity, the numbers of transposable elements (TEs) and intact TEs have increased in IWGSC RefSeq v2.1 compared with v1.0. In total, 98% of the gene models identified in v1.0 were mapped onto this new assembly through development of a dedicated approach implemented in the MAGAAT pipeline. The numbers of high-confidence genes on pseudomolecules have increased from 105 319 to 105 534. The reconciled assembly enhances the utility of the sequence for genetic mapping, comparative genomics, gene annotation and isolation, and more general studies on the biology of wheat.
Asunto(s)
Mapeo Cromosómico/métodos , Genoma de Planta , Triticum/genética , Cromosomas Artificiales Bacterianos , Cromosomas de las Plantas/química , Elementos Transponibles de ADN , Anotación de Secuencia MolecularRESUMEN
Grain development of Triticum aestivum is being studied extensively using individual OMICS tools. However, integrated transcriptome and proteome studies are limited mainly due to complexity of genome. Current study focused to unravel the transcriptome-proteome coordination of key mechanisms underlying carbohydrate metabolism during whole wheat grain development. Wheat grains were manually dissected to obtain grain tissues for proteomics and transcriptomics analyses. Differentially expressed proteins and transcripts at the 11 stages of grain development were compared. Computational workflow for integration of two datasets related to carbohydrate metabolism was designed. For CM proteins, output peptide sequences of proteomic analyses (via LC-MS/MS) were used as source to search corresponding transcripts. The transcript that turned out with higher number of peptides was selected as bona fide ribonucleotide sequence for respective protein synthesis. More than 90% of hits resulted in successful identification of respective transcripts. Comparative analysis of protein and transcript expression profiles resulted in overall 32% concordance between these two series of data. However, during grain development correlation of two datasets gradually increased up to ~ tenfold from 152 to 655 °Cd and then dropped down. Proteins involved in carbohydrate metabolism were divided in five categories in accordance with their functions. Enzymes involved in starch and sucrose biosynthesis showed the highest correlations between proteome-transcriptome profiles. High percentage of identification and validation of protein-transcript hits highlighted the power of omics data integration approach over existing gene functional annotation tools. We found that correlation of two datasets is highly influenced by stage of grain development. Further, gene regulatory networks would be helpful in unraveling the mechanisms underlying the complex and significant traits such as grain weight and yield.
Asunto(s)
Metabolismo de los Hidratos de Carbono/fisiología , Triticum/genética , Triticum/metabolismo , Carbohidratos/genética , Cromatografía Liquida/métodos , Grano Comestible/genética , Perfilación de la Expresión Génica/métodos , Regulación de la Expresión Génica de las Plantas/genética , Proteínas de Plantas/genética , Proteoma/genética , Proteómica/métodos , Espectrometría de Masas en Tándem/métodos , Transcriptoma/genéticaRESUMEN
BACKGROUND: Fusarium crown rot (FCR) is a major cereal disease in semi-arid areas worldwide. Of the various QTL reported, the one on chromosome arm 3BL (Qcrs.cpi-3B) has the largest effect that can be consistently detected in different genetic backgrounds. Nine sets of near isogenic lines (NILs) for this locus were made available in a previous study. To identify markers that could be reliably used in tagging the Qcrs.cpi-3B locus, a NIL-derived population consisting of 774 F10 lines were generated and exploited to assess markers selected from the existing linkage map and generated from sequences of the 3B pseudomolecule. RESULTS: This is the first report on fine mapping a QTL conferring FCR resistance in wheat. By three rounds of linkage mapping using the NILs and the NIL-derived population, the Qcrs.cpi-3B locus was mapped to an interval of 0.7 cM covering a physical distance of about 1.5 Mb. Seven markers co-segregating with the locus were developed. This interval contains a total of 63 gene-coding sequences based on the 3B pseudomolecule, and six of them were known to encode disease resistance proteins. Several of the genes in this interval were among those responsive to FCR infection detected in an earlier study. CONCLUSIONS: The accurate localization of the Qcrs.cpi-3B locus and the development of the markers co-segregating with it should facilitate the incorporation of this large-effect QTL conferring FCR resistance into breeding programs as well as the cloning of the gene(s) underlying the QTL.
Asunto(s)
Cromosomas de las Plantas/genética , Resistencia a la Enfermedad/genética , Enfermedades de las Plantas/genética , Triticum/genética , Mapeo Cromosómico , Fusarium/genética , Fusarium/patogenicidad , Enfermedades de las Plantas/microbiología , Sitios de Carácter Cuantitativo/genética , Triticum/microbiologíaRESUMEN
Meiosis, the basis of sex, evolved through iterative gene duplications. To understand whether subsequent duplications have further enriched the core meiotic "tool-kit," we investigated the fate of meiotic gene duplicates following whole genome duplication (WGD), a common occurrence in eukaryotes. We show that meiotic genes return to a single copy more rapidly than genome-wide average in angiosperms, one of the lineages in which WGD is most vividly exemplified. The rate at which duplicates are lost decreases through time, a tendency that is also observed genome-wide and may thus prove to be a general trend post-WGD. The sharpest decline is observed for the subset of genes mediating meiotic recombination; however, we found no evidence that the presence of these duplicates is counterselected in two recent polyploid crops selected for fertility. We therefore propose that their loss is passive, highlighting how quickly WGDs are resolved in the absence of selective duplicate retention.
Asunto(s)
Magnoliopsida/genética , Meiosis , Evolución Molecular , Duplicación de Gen , Genoma de Planta , Recombinación HomólogaRESUMEN
The physical map of the hexaploid wheat chromosome 3B was screened using centromeric DNA probes. A 1.1-Mb region showing the highest number of positive bacterial artificial chromosome (BAC) clones was fully sequenced and annotated, revealing that 96% of the DNA consisted of transposable elements, mainly long terminal repeat (LTR) retrotransposons (88%). Estimation of the insertion time of the transposable elements revealed that CRW (also called Cereba) and Quinta are the youngest elements at the centromeres of common wheat (Triticum spp.) and its diploid ancestors, with Quinta being younger than CRW in both diploid and hexaploid wheats. Chromatin immunoprecipitation experiments revealed that both CRW and Quinta families are targeted by the centromere-specific histone H3 variant CENH3. Immuno colocalization of retroelements and CENH3 antibody indicated that a higher proportion of Quinta than CRWs was associated with CENH3, although CRWs were more abundant. Long arrays of satellite repeats were also identified in the wheat centromere regions, but they lost the ability to bind with CENH3. In addition to transposons, two functional genes and one pseudogene were identified. The gene density in the centromere appeared to be between three and four times lower than the average gene density of chromosome 3B. Comparisons with related grasses also indicated a loss of microcollinearity in this region. Finally, comparison of centromeric sequences of Aegilops tauschii (DD), Triticum boeoticum (AA) and hexaploid wheat revealed that the centromeres in both the polyploids and diploids are still undergoing dynamic changes, and that the new CRWs and Quintas may have undertaken the core role in kinetochore formation.
Asunto(s)
Centrómero/genética , Cromosomas de las Plantas , Retroelementos/genética , Triticum/genética , Cromosomas Artificiales Bacterianos , Elementos Transponibles de ADN , Grano Comestible/genética , Histonas/genética , Filogenia , Mapeo Físico de Cromosoma , Poaceae/genética , Poliploidía , Seudogenes , SinteníaRESUMEN
All six arms of the group 1 chromosomes of hexaploid wheat (Triticum aestivum) were sequenced with Roche/454 to 1.3- to 2.2-fold coverage and compared with similar data sets from the homoeologous chromosome 1H of barley (Hordeum vulgare). Six to ten thousand gene sequences were sampled per chromosome. These were classified into genes that have their closest homologs in the Triticeae group 1 syntenic region in Brachypodium, rice (Oryza sativa), and/or sorghum (Sorghum bicolor) and genes that have their homologs elsewhere in these model grass genomes. Although the number of syntenic genes was similar between the homologous groups, the amount of nonsyntenic genes was found to be extremely diverse between wheat and barley and even between wheat subgenomes. Besides a small core group of genes that are nonsyntenic in other grasses but conserved among Triticeae, we found thousands of genic sequences that are specific to chromosomes of one single species or subgenome. By examining in detail 50 genes from chromosome 1H for which BAC sequences were available, we found that many represent pseudogenes that resulted from transposable element activity and double-strand break repair. Thus, Triticeae seem to accumulate nonsyntenic genes frequently. Since many of them are likely to be pseudogenes, total gene numbers in Triticeae are prone to pronounced overestimates.
Asunto(s)
Genes de Plantas/genética , Genoma de Planta/genética , Hordeum/genética , Poaceae/genética , Seudogenes/genética , Triticum/genética , Secuencia de Bases , Brachypodium/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Evolución Molecular , Frecuencia de los Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Datos de Secuencia Molecular , Oryza/genética , Sorghum/genética , Sintenía/genéticaRESUMEN
Stagonospora nodorum glume blotch (SNG), caused by the necrotrophic fungus Stagonospora nodorum, is one of the economically important diseases of bread wheat (Triticum aestivum L.). Resistance to SNG is known to be quantitative and previous studies of a recombinant inbred line (RIL) population identified a major quantitative trait locus (QTL) for resistance to SNG on the short arm of chromosome 3B. To localize this QTL (QSng.sfr-3BS) with high resolution, we constructed a genetic map for the QTL target region using information from sequenced flow-sorted chromosomes 3B of the two parental cultivars 'Arina' and 'Forno', the physical map of chromosome 3B of cultivar 'Chinese Spring' and BAC-clone sequences. The mapping population of near-isogenic lines (NIL) was evaluated for SNG resistance in field infection tests. NILs segregated for disease resistance as well as for plant height; additionally, we observed a high environmental influence on the trait. Our analysis detected a strong negative correlation of SNG resistance and plant height. Further analysis of the target region identified two linked loci associated with SNG resistance. One of them was also associated with plant height, revealing an effect of QSng.sfr-3BS on plant height that was hidden in the RIL population. This result demonstrates an unexpectedly high genetic complexity of resistance controlled by QSng.sfr-3BS and shows the importance of the study of QTL in mendelized form in NILs.
Asunto(s)
Ascomicetos , Resistencia a la Enfermedad/genética , Genes de Plantas , Sitios de Carácter Cuantitativo , Triticum/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Sitios Genéticos , Marcadores Genéticos , Fenotipo , Enfermedades de las Plantas/microbiología , Triticum/microbiologíaRESUMEN
To improve our understanding of the organization and evolution of the wheat (Triticum aestivum) genome, we sequenced and annotated 13-Mb contigs (18.2 Mb) originating from different regions of its largest chromosome, 3B (1 Gb), and produced a 2x chromosome survey by shotgun Illumina/Solexa sequencing. All regions carried genes irrespective of their chromosomal location. However, gene distribution was not random, with 75% of them clustered into small islands containing three genes on average. A twofold increase of gene density was observed toward the telomeres likely due to high tandem and interchromosomal duplication events. A total of 3222 transposable elements were identified, including 800 new families. Most of them are complete but showed a highly nested structure spread over distances as large as 200 kb. A succession of amplification waves involving different transposable element families led to contrasted sequence compositions between the proximal and distal regions. Finally, with an estimate of 50,000 genes per diploid genome, our data suggest that wheat may have a higher gene number than other cereals. Indeed, comparisons with rice (Oryza sativa) and Brachypodium revealed that a high number of additional noncollinear genes are interspersed within a highly conserved ancestral grass gene backbone, supporting the idea of an accelerated evolution in the Triticeae lineages.
Asunto(s)
Elementos Transponibles de ADN , Evolución Molecular , Genoma de Planta , Triticum/genética , Cromosomas Artificiales Bacterianos , Cromosomas de las Plantas , Mapeo Contig , ADN de Plantas/genética , Duplicación de Gen , Genes de Plantas , Datos de Secuencia Molecular , Familia de Multigenes , Análisis de Secuencia de ADN , Telómero/genéticaRESUMEN
Bread wheat (Triticum aestivum L.) is a major crop and its genome is one of the largest ever assembled at reference-quality level. It is 15 Gb, hexaploid, with 85% of transposable elements (TEs). Wheat genetic diversity was mainly focused on genes and little is known about the extent of genomic variability affecting TEs, transposition rate, and the impact of polyploidy. Multiple chromosome-scale assemblies are now available for bread wheat and for its tetraploid and diploid wild relatives. In this study, we computed base pair-resolved, gene-anchored, whole genome alignments of A, B, and D lineages at different ploidy levels in order to estimate the variability that affects the TE space. We used assembled genomes of 13 T. aestivum cultivars (6x = AABBDD) and a single genome for Triticum durum (4x = AABB), Triticum dicoccoides (4x = AABB), Triticum urartu (2x = AA), and Aegilops tauschii (2x = DD). We show that 5%-34% of the TE fraction is variable, depending on the species divergence. Between 400 and 13,000 novel TE insertions per subgenome were detected. We found lineage-specific insertions for nearly all TE families in di-, tetra-, and hexaploids. No burst of transposition was observed and polyploidization did not trigger any boost of transposition. This study challenges the prevailing idea of wheat TE dynamics and is more in agreement with an equilibrium model of evolution.
Asunto(s)
Elementos Transponibles de ADN , Triticum , Triticum/genética , Genoma de Planta , Poliploidía , Evolución MolecularRESUMEN
Introduction: Meiotic recombination (or crossover, CO) is essential for gamete fertility as well as for alleles and genes reshuffling that is at the heart of plant breeding. However, CO remains a limited event, which strongly hampers the rapid production of original and improved cultivars. RecQ4 is a gene encoding a helicase protein that, when mutated, contributes to improve recombination rate in all species where it has been evaluated so far. Methods: In this study, we developed wheat (Triticum aestivum L.) triple mutant (TM) for the three homoeologous copies of TaRecQ4 as well as mutants for two copies and heterozygous for the last one (Htz-A, Htz-B, Htz-D). Results: Phenotypic observation revealed a significant reduction of fertility and pollen viability in TM and Htz-B plants compared to wild type plants suggesting major defects during meiosis. Cytogenetic analyses of these plants showed that complete absence of TaRecQ4 as observed in TM plants, leads to chromosome fragmentation during the pachytene stage, resulting in problems in the segregation of chromosomes during meiosis. Htz-A and Htz-D mutants had an almost normal meiotic progression indicating that both TaRecQ4-A and TaRecQ4-D copies are functional and that there is no dosage effect for TaRecQ4 in bread wheat. On the contrary, the TaRecQ4-B copy seems knocked-out, probably because of a SNP leading to a Threonine>Alanine change at position 539 (T539A) of the protein, that occurs in the crucial helicase ATP bind/DEAD/ResIII domain which unwinds nucleic acids. Occurrence of numerous multivalents in TM plants suggests that TaRecQ4 could also play a role in the control of homoeologous recombination. Discussion: These findings provide a foundation for further molecular investigations into wheat meiosis regulation to fully understand the underlying mechanisms of how TaRecQ4 affects chiasma formation, as well as to identify ways to mitigate these defects and enhance both homologous and homoeologous recombination efficiency in wheat.
RESUMEN
As genome resources for wheat (Triticum L.) expand at a rapid pace, it is important to update targeted sequencing tools to incorporate improved sequence assemblies and regions of previously unknown significance. Here, we developed an updated regulatory region enrichment capture for wheat and other Triticeae species. The core target space includes sequences from 2-Kbp upstream of each gene predicted in the Chinese Spring wheat genome (IWGSC RefSeq Annotation v1.0) and regions of open chromatin identified with an assay for transposase-accessible chromatin using sequencing from wheat leaf and root samples. To improve specificity, we aggressively filtered candidate repetitive sequences using a combination of nucleotide basic local alignment search tool (BLASTN) searches to the Triticeae Repetitive Sequence Database (TREP), identification of regions with read over-coverage from previous target enrichment experiments, and k-mer frequency analyses. The final design comprises 216.5 Mbp of predicted hybridization space in hexaploid wheat and showed increased specificity and coverage of targeted sequences relative to previous protocols. Test captures on hexaploid and tetraploid wheat and other diploid cereals show that the assay has broad potential utility for cost-effective promoter and open chromatin resequencing and general-purpose genotyping of various Triticeae species.
Asunto(s)
Genoma de Planta , Triticum , Triticum/genética , Análisis Costo-Beneficio , Poliploidía , Regiones Promotoras Genéticas , CromatinaRESUMEN
BACKGROUND: Sequencing projects using a clone-by-clone approach require the availability of a robust physical map. The SNaPshot technology, based on pair-wise comparisons of restriction fragments sizes, has been used recently to build the first physical map of a wheat chromosome and to complete the maize physical map. However, restriction fragments sizes shared randomly between two non-overlapping BACs often lead to chimerical contigs and mis-assembled BACs in such large and repetitive genomes. Whole Genome Profiling (WGP™) was developed recently as a new sequence-based physical mapping technology and has the potential to limit this problem. RESULTS: A subset of the wheat 3B chromosome BAC library covering 230 Mb was used to establish a WGP physical map and to compare it to a map obtained with the SNaPshot technology. We first adapted the WGP-based assembly methodology to cope with the complexity of the wheat genome. Then, the results showed that the WGP map covers the same length than the SNaPshot map but with 30% less contigs and, more importantly with 3.5 times less mis-assembled BACs. Finally, we evaluated the benefit of integrating WGP tags in different sequence assemblies obtained after Roche/454 sequencing of BAC pools. We showed that while WGP tag integration improves assemblies performed with unpaired reads and with paired-end reads at low coverage, it does not significantly improve sequence assemblies performed at high coverage (25x) with paired-end reads. CONCLUSIONS: Our results demonstrate that, with a suitable assembly methodology, WGP builds more robust physical maps than the SNaPshot technology in wheat and that WGP can be adapted to any genome. Moreover, WGP tag integration in sequence assemblies improves low quality assembly. However, to achieve a high quality draft sequence assembly, a sequencing depth of 25x paired-end reads is required, at which point WGP tag integration does not provide additional scaffolding value. Finally, we suggest that WGP tags can support the efficient sequencing of BAC pools by enabling reliable assignment of sequence scaffolds to their BAC of origin, a feature that is of great interest when using BAC pooling strategies to reduce the cost of sequencing large genomes.
Asunto(s)
Genoma de Planta , Mapeo Físico de Cromosoma , Análisis de Secuencia de ADN/métodos , Triticum/genética , Cromosomas Artificiales Bacterianos , Cromosomas de las Plantas , Mapeo Contig , Elementos Transponibles de ADN , Alineación de SecuenciaRESUMEN
In bread wheat (Triticum aestivum L.), initial studies using deletion lines indicated that crossover (CO) events occur mainly in the telomeric regions of the chromosomes with a possible correlation with the presence of genes. However, little is known about the distribution of COs at the sequence level. To investigate this, we studied in detail the pattern of COs along a contig of 3.110 Mb using two F2 segregating populations (Chinese Spring × Renan (F2-CsRe) and Chinese Spring × Courtot (F2-CsCt)) each containing ~2,000 individuals. The availability of the sequence of the contig from Cs enabled the development of 318 markers among which 23 co-dominant polymorphic markers (11 SSRs and 12 SNPs) were selected for CO distribution analyses. The distribution of CO events was not homogeneous throughout the contig, ranging from 0.05 to 2.77 cM/Mb, but was conserved between the two populations despite very different contig recombination rate averages (0.82 cM/Mb in F2-CsRe vs 0.35 cM/Mb in F2-CsCt). The CO frequency was correlated with the percentage of coding sequence in Cs and with the polymorphism rate between Cs and Re or Ct in both populations, indicating an impact of these two factors on CO distribution. At a finer scale, COs were found in a region covering 2.38 kb, spanning a gene coding for a glycosyl transferase (Hga3), suggesting the presence of a CO hotspot. A non-crossover event covering at least 453 bp was also identified in the same interval. From these results, we can conclude that gene content could be one of the factors driving recombination in bread wheat.
Asunto(s)
Intercambio Genético , Meiosis , Recombinación Genética , Triticum/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Polimorfismo de Nucleótido SimpleRESUMEN
BACKGROUND: Polyploidization is considered one of the main mechanisms of plant genome evolution. The presence of multiple copies of the same gene reduces selection pressure and permits sub-functionalization and neo-functionalization leading to plant diversification, adaptation and speciation. In bread wheat, polyploidization and the prevalence of transposable elements resulted in massive gene duplication and movement. As a result, the number of genes which are non-collinear to genomes of related species seems markedly increased in wheat. RESULTS: We used new-generation sequencing (NGS) to generate sequence of a Mb-sized region from wheat chromosome arm 3DS. Sequence assembly of 24 BAC clones resulted in two scaffolds of 1,264,820 and 333,768 bases. The sequence was annotated and compared to the homoeologous region on wheat chromosome 3B and orthologous loci of Brachypodium distachyon and rice. Among 39 coding sequences in the 3DS scaffolds, 32 have a homoeolog on chromosome 3B. In contrast, only fifteen and fourteen orthologs were identified in the corresponding regions in rice and Brachypodium, respectively. Interestingly, five pseudogenes were identified among the non-collinear coding sequences at the 3B locus, while none was found at the 3DS locus. CONCLUSION: Direct comparison of two Mb-sized regions of the B and D genomes of bread wheat revealed similar rates of non-collinear gene insertion in both genomes with a majority of gene duplications occurring before their divergence. Relatively low proportion of pseudogenes was identified among non-collinear coding sequences. Our data suggest that the pseudogenes did not originate from insertion of non-functional copies, but were formed later during the evolution of hexaploid wheat. Some evidence was found for gene erosion along the B genome locus.
Asunto(s)
Cromosomas de las Plantas/genética , Evolución Molecular , Genoma de Planta/genética , Triticum/genética , Brachypodium/genética , Cromosomas Artificiales Bacterianos , Mapeo Contig , ADN de Plantas/genética , Duplicación de Gen , Sitios Genéticos/genética , Mutagénesis Insercional , Oryza/genética , Filogenia , Poliploidía , Seudogenes/genética , Análisis de Secuencia de ADNRESUMEN
To improve our understanding of the organization and regulation of the wheat (Triticum aestivum) gene space, we established a transcription map of a wheat chromosome (3B) by hybridizing a newly developed wheat expression microarray with bacterial artificial chromosome pools from a new version of the 3B physical map as well as with cDNA probes derived from 15 RNA samples. Mapping data for almost 3,000 genes showed that the gene space spans the whole chromosome 3B with a 2-fold increase of gene density toward the telomeres due to an increase in the number of genes in islands. Comparative analyses with rice (Oryza sativa) and Brachypodium distachyon revealed that these gene islands are composed mainly of genes likely originating from interchromosomal gene duplications. Gene Ontology and expression profile analyses for the 3,000 genes located along the chromosome revealed that the gene islands are enriched significantly in genes sharing the same function or expression profile, thereby suggesting that genes in islands acquired shared regulation during evolution. Only a small fraction of these clusters of cofunctional and coexpressed genes was conserved with rice and B. distachyon, indicating a recent origin. Finally, genes with the same expression profiles in remote islands (coregulation islands) were identified suggesting long-distance regulation of gene expression along the chromosomes in wheat.
Asunto(s)
Cromosomas Artificiales Bacterianos/genética , ADN Intergénico/genética , Genoma de Planta/genética , Islas Genómicas/fisiología , Mapeo Físico de Cromosoma/métodos , Triticum/genética , Secuencia de Bases , Brachypodium/genética , Centrómero/genética , Cromosomas de las Plantas/genética , ADN de Plantas/química , ADN de Plantas/genética , Evolución Molecular , Duplicación de Gen , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas/genética , Genes de Plantas/genética , Islas Genómicas/genética , Datos de Secuencia Molecular , Familia de Multigenes , Análisis de Secuencia por Matrices de Oligonucleótidos , Oryza/genética , Poliploidía , Análisis de Secuencia de ADN , Telómero/genética , TranscriptomaRESUMEN
Understanding meiotic crossover (CO) variation in crops like bread wheat (Triticum aestivum L.) is necessary as COs are essential to create new, original and powerful combinations of genes for traits of agronomical interest. We cytogenetically characterized a set of wheat aneuploid lines missing part or all of chromosome 3B to identify the most influential regions for chiasma formation located on this chromosome. We showed that deletion of the short arm did not change the total number of chiasmata genome-wide, whereas this latter was reduced by ~35% while deleting the long arm. Contrary to what was hypothesized in a previous study, deletion of the long arm does not disturb the initiation of the synaptonemal complex (SC) in early meiotic stages. However, progression of the SC is abnormal, and we never observed its completion when the long arm is deleted. By studying six different deletion lines (missing different parts of the long arm), we revealed that at least two genes located in both the proximal (C-3BL2-0.22) and distal (3BL7-0.63-1.00) deletion bins are involved in the control of chiasmata, each deletion reducing the number of chiasmata by ~15%. We combined sequence analyses of deletion bins with RNA-Seq data derived from meiotic tissues and identified a set of genes for which at least the homoeologous copy on chromosome 3B is expressed and which are involved in DNA processing. Among these genes, eight (CAP-E1/E2, DUO1, MLH1, MPK4, MUS81, RTEL1, SYN4, ZIP4) are known to be involved in the recombination pathway.
RESUMEN
BACKGROUND: The sequencing of the wheat (Triticum aestivum) genome has been a methodological challenge for many years owing to its large size (15.5 Gb), repeat content, and hexaploidy. Many initiatives aiming at obtaining a reference genome of cultivar Chinese Spring have been launched in the past years and it was achieved in 2018 as the result of a huge effort to combine short-read sequencing with many other resources. Reference-quality genome assemblies were then produced for other accessions, but the rapid evolution of sequencing technologies offers opportunities to reach high-quality standards at lower cost. RESULTS: Here, we report on an optimized procedure based on long reads produced on the Oxford Nanopore Technology PromethION device to assemble the genome of the French bread wheat cultivar Renan. CONCLUSIONS: We provide the most contiguous chromosome-scale assembly of a bread wheat genome to date. Coupled with an annotation based on RNA-sequencing data, this resource will be valuable for the crop community and will facilitate the rapid selection of agronomically important traits. We also provide a framework to generate high-quality assemblies of complex genomes using ONT.
Asunto(s)
Genoma , Triticum , Cruzamiento , Cromosomas , Análisis de Secuencia de ADN/métodos , Triticum/genéticaRESUMEN
Bread wheat is an allohexaploid species originating from two successive and recent rounds of hybridization between three diploid species that were very similar in terms of chromosome number, genome size, TE content, gene content and synteny. As a result, it has long been considered that most of the genes were in three pairs of homoeologous copies. However, these so-called triads represent only one half of wheat genes, while the remaining half belong to homoeologous groups with various number of copies across subgenomes. In this study, we examined and compared the distribution, conservation, function, expression and epigenetic profiles of triads with homoeologous groups having undergone a deletion (dyads) or a duplication (tetrads) in one subgenome. We show that dyads and tetrads are mostly located in distal regions and have lower expression level and breadth than triads. Moreover, they are enriched in functions related to adaptation and more associated with the repressive H3K27me3 modification. Altogether, these results suggest that triads mainly correspond to housekeeping genes and are part of the core genome, while dyads and tetrads belong to the Triticeae dispensable genome. In addition, by comparing the different categories of dyads and tetrads, we hypothesize that, unlike most of the allopolyploid species, subgenome dominance and biased fractionation are absent in hexaploid wheat. Differences observed between the three subgenomes are more likely related to two successive and ongoing waves of post-polyploid diploidization, that had impacted A and B more significantly than D, as a result of the evolutionary history of hexaploid wheat.
Asunto(s)
Variaciones en el Número de Copia de ADN , Triticum , Genoma de Planta , Humanos , Poliploidía , Sintenía , Triticum/genéticaRESUMEN
BACKGROUND: Because of its size, allohexaploid nature and high repeat content, the wheat genome has always been perceived as too complex for efficient molecular studies. We recently constructed the first physical map of a wheat chromosome (3B). However gene mapping is still laborious in wheat because of high redundancy between the three homoeologous genomes. In contrast, in the closely related diploid species, barley, numerous gene-based markers have been developed. This study aims at combining the unique genomic resources developed in wheat and barley to decipher the organisation of gene space on wheat chromosome 3B. RESULTS: Three dimensional pools of the minimal tiling path of wheat chromosome 3B physical map were hybridised to a barley Agilent 15K expression microarray. This led to the fine mapping of 738 barley orthologous genes on wheat chromosome 3B. In addition, comparative analyses revealed that 68% of the genes identified were syntenic between the wheat chromosome 3B and barley chromosome 3 H and 59% between wheat chromosome 3B and rice chromosome 1, together with some wheat-specific rearrangements. Finally, it indicated an increasing gradient of gene density from the centromere to the telomeres positively correlated with the number of genes clustered in islands on wheat chromosome 3B. CONCLUSION: Our study shows that novel structural genomics resources now available in wheat and barley can be combined efficiently to overcome specific problems of genetic anchoring of physical contigs in wheat and to perform high-resolution comparative analyses with rice for deciphering the organisation of the wheat gene space.