RESUMEN
Polyploidy is a persistent phenomenon in angiosperm genome evolution that is hypothesized to have contributed to the diversity of extant flowering plants. Brassica napus, one of the world's most important angiosperm oilseed species, originated from the interspecific hybridization of Brassica rapa (An ) and Brassica oleracea (Cn ). While the trends of genome dominance in transcriptomics are beginning to emerge, less is known about the epigenetic and small RNA landscapes in polyploids during reproductive development. The seed is the pivotal developmental transition into the new sporophytic generation, and experiences substantial epigenetic modifications over time. Here, we investigated the prevalence of bias in the contexts of DNA methylation and small interfering (si)RNA profiles in both subgenomes (An and Cn ), as well as the ancestral fractionated genomes across B. napus seed development. We report ubiquitous Cn subgenome bias of siRNA expression and cytosine methylation, with DNA methylation being particularly abundant on gene promoters in the Cn subgenome. Further, we provide evidence that siRNA transcriptional patterns were conserved within the ancestral triplicated subgenomes of B. napus, but not across the An and Cn subgenomes. We discuss how methylation patterns in the B. napus seed relate to genes, promoter regions, siRNA loci and transposable elements through the lens of genome fractionation and polyploidization. Taken together we provide evidence for epigenetic regulation selectively silencing the Cn subgenome during seed development, and explore the impact of genome fractionation on the epigenetic components of the B. napus seed.
Asunto(s)
Brassica napus , Brassica napus/metabolismo , Metilación de ADN/genética , Epigénesis Genética/genética , Genoma de Planta/genética , Genómica , Poliploidía , ARN Interferente Pequeño/genética , ARN Interferente Pequeño/metabolismo , Semillas/genética , Semillas/metabolismoRESUMEN
We profiled the global gene expression landscape across the reproductive lifecycle of Brassica napus. Comparative analysis of this nascent amphidiploid revealed the contribution of each subgenome to plant reproduction. Whole-genome transcription factor networks identified BZIP11 as a transcriptional regulator of early B. napus seed development. Knockdown of BZIP11 using RNA interference resulted in a similar reduction in gene activity of predicted gene targets, and a reproductive-lethal phenotype. Global mRNA profiling revealed lower accumulation of Cn subgenome transcripts relative to the An subgenome. Subgenome-specific transcription factor networks identified distinct transcription factor families enriched in each of the An and Cn subgenomes early in seed development. Analysis of laser-microdissected seed subregions further reveal subgenome expression dynamics in the embryo, endosperm and seed coat of early stage seeds. Transcription factors predicted to be regulators encoded by the An subgenome are expressed primarily in the seed coat, whereas regulators encoded by the Cn subgenome were expressed primarily in the embryo. Data suggest subgenome bias are characteristic features of the B. napus seed throughout development, and that such bias might not be universal across the embryo, endosperm and seed coat of the developing seed. Transcriptional networks spanning both the An and Cn genomes of the whole B. napus seed can identify valuable targets for seed development research and that -omics level approaches to studying gene regulation in B. napus can benefit from both broad and high-resolution analyses.
Asunto(s)
Brassica napus/crecimiento & desarrollo , Brassica napus/genética , Desarrollo de la Planta/genética , Semillas/crecimiento & desarrollo , Semillas/genética , Factores de Transcripción/genética , Productos Agrícolas/genética , Productos Agrícolas/crecimiento & desarrollo , Regulación del Desarrollo de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genes de Plantas , Genoma de PlantaRESUMEN
Camelina neglecta is a diploid species from the genus Camelina, which includes the versatile oilseed Camelina sativa. These species are closely related to Arabidopsis thaliana and the economically important Brassica crop species, making this genus a useful platform to dissect traits of agronomic importance while providing a tool to study the evolution of polyploids. A highly contiguous chromosome-level genome sequence of C. neglecta with an N50 size of 29.1 Mb was generated utilizing Pacific Biosciences (PacBio, Menlo Park, CA) long-read sequencing followed by chromosome conformation phasing. Comparison of the genome with that of C. sativa shows remarkable coincidence with subgenome 1 of the hexaploid, with only one major chromosomal rearrangement separating the two. Synonymous substitution rate analysis of the predicted 34 061 genes suggested subgenome 1 of C. sativa directly descended from C. neglecta around 1.2 mya. Higher functional divergence of genes in the hexaploid as evidenced by the greater number of unique orthogroups, and differential composition of resistant gene analogs, might suggest an immediate adaptation strategy after genome merger. The absence of genome bias in gene fractionation among the subgenomes of C. sativa in comparison with C. neglecta, and the complete lack of fractionation of meiosis-specific genes attests to the neopolyploid status of C. sativa. The assembled genome will provide a tool to further study genome evolution processes in the Camelina genus and potentially allow for the identification and exploitation of novel variation for Camelina crop improvement.
Asunto(s)
Arabidopsis , Brassica , Brassicaceae , Neglecta , Diploidia , Brassicaceae/genética , Arabidopsis/genética , Brassica/genética , Genoma de PlantaRESUMEN
MAIN CONCLUSION: Genetic variation in seed protein composition, seed protein gene expression and predictions of seed protein physiochemical properties were documented in C. sativa and other Camelina species. Seed protein diversity was examined in six Camelina species (C. hispida, C. laxa, C. microcarpa, C. neglecta, C. rumelica and C. sativa). Differences were observed in seed protein electrophoretic profiles, total seed protein content and amino acid composition between the species. Genes encoding major seed proteins (cruciferins, napins, oleosins and vicilins) were catalogued for C. sativa and RNA-Seq analysis established the expression patterns of these and other genes in developing seed from anthesis through to maturation. Examination of 187 C. sativa accessions revealed limited variation in seed protein electrophoretic profiles, though sufficient to group the majority into classes based on high MW protein profiles corresponding to the cruciferin region. C. sativa possessed four distinct types of cruciferins, named CsCRA, CsCRB, CsCRC and CsCRD, which corresponded to orthologues in Arabidopsis thaliana with members of each type encoded by homeologous genes on the three C. sativa sub-genomes. Total protein content and amino acid composition varied only slightly; however, RNA-Seq analysis revealed that CsCRA and CsCRB genes contributed > 95% of the cruciferin transcripts in most lines, whereas CsCRC genes were the most highly expressed cruciferin genes in others, including the type cultivar DH55. This was confirmed by proteomics analyses. Cruciferin is the most abundant seed protein and contributes the most to functionality. Modelling of the C. sativa cruciferins indicated that each type possesses different physiochemical attributes that were predicted to impart unique functional properties. As such, opportunities exist to create C. sativa cultivars with seed protein profiles tailored to specific technical applications.
Asunto(s)
Arabidopsis , Brassicaceae , Aminoácidos/metabolismo , Arabidopsis/genética , Brassicaceae/genética , Brassicaceae/metabolismo , Variación Genética , Semillas/genética , Semillas/metabolismoRESUMEN
In plants, race-specific defence against microbial pathogens is facilitated by resistance (R) genes which correspond to specific pathogen avirulence genes. This study reports the cloning of a blackleg R gene from Brassica napus (canola), Rlm9, which encodes a wall-associated kinase-like (WAKL) protein, a newly discovered class of race-specific plant RLK resistance genes. Rlm9 provides race-specific resistance against isolates of Leptosphaeria maculans carrying the corresponding avirulence gene AvrLm5-9, representing only the second WAKL-type R gene described to date. The Rlm9 protein is predicted to be cell membrane-bound and while not conclusive, our work did not indicate direct interaction with AvrLm5-9. Rlm9 forms part of a distinct evolutionary family of RLK proteins in B. napus, and while little is yet known about WAKL function, the Brassica-Leptosphaeria pathosystem may prove to be a model system by which the mechanism of fungal avirulence protein recognition by WAKL-type R genes can be determined.
Asunto(s)
Brassica napus/genética , Resistencia a la Enfermedad/genética , Leptosphaeria/patogenicidad , Enfermedades de las Plantas/inmunología , Proteínas Quinasas/metabolismo , Brassica napus/inmunología , Brassica napus/microbiología , Enfermedades de las Plantas/microbiología , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Proteínas Quinasas/genética , Especificidad de la Especie , VirulenciaRESUMEN
Genome structural variation (SV) contributes strongly to trait variation in eukaryotic species and may have an even higher functional significance than single-nucleotide polymorphism (SNP). In recent years, there have been a number of studies associating large chromosomal scale SV ranging from hundreds of kilobases all the way up to a few megabases to key agronomic traits in plant genomes. However, there have been little or no efforts towards cataloguing small- (30-10 000 bp) to mid-scale (10 000-30 000 bp) SV and their impact on evolution and adaptation-related traits in plants. This might be attributed to complex and highly duplicated nature of plant genomes, which makes them difficult to assess using high-throughput genome screening methods. Here, we describe how long-read sequencing technologies can overcome this problem, revealing a surprisingly high level of widespread, small- to mid-scale SV in a major allopolyploid crop species, Brassica napus. We found that up to 10% of all genes were affected by small- to mid-scale SV events. Nearly half of these SV events ranged between 100 bp and 1000 bp, which makes them challenging to detect using short-read Illumina sequencing. Examples demonstrating the contribution of such SV towards eco-geographical adaptation and disease resistance in oilseed rape suggest that revisiting complex plant genomes using medium-coverage long-read sequencing might reveal unexpected levels of functional gene variation, with major implications for trait regulation and crop improvement.
Asunto(s)
Brassica napus , Poliploidía , Brassica napus/genética , Resistencia a la Enfermedad/genética , Genoma de Planta/genética , Humanos , Polimorfismo de Nucleótido Simple/genéticaRESUMEN
Ensuring faithful homologous recombination in allopolyploids is essential to maintain optimal fertility of the species. Variation in the ability to control aberrant pairing between homoeologous chromosomes in Brassica napus has been identified. The current study exploited the extremes of such variation to identify genetic factors that differentiate newly resynthesised B. napus, which is inherently unstable, and established B. napus, which has adapted to largely control homoeologous recombination. A segregating B. napus mapping population was analysed utilising both cytogenetic observations and high-throughput genotyping to quantify the levels of homoeologous recombination. Three quantitative trait loci (QTL) were identified that contributed to the control of homoeologous recombination in the important oilseed crop B. napus. One major QTL on BnaA9 contributed between 32 and 58% of the observed variation. This study is the first to assess homoeologous recombination and map associated QTLs resulting from deviations in normal pairing in allotetraploid B. napus. The identified QTL regions suggest candidate meiotic genes that could be manipulated in order to control this important trait and further allow the development of molecular markers to utilise this trait to exploit homoeologous recombination in a crop.
Asunto(s)
Brassica napus , Brassica napus/genética , Cromosomas de las Plantas/genética , Genoma de Planta , Poliploidía , Sitios de Carácter Cuantitativo/genéticaRESUMEN
BACKGROUND: Clubroot, caused by Plasmodiophora brassicae Woronin, is a very important disease of Brassica species. Management of clubroot relies heavily on genetic resistance. In a cross of Brassica nigra lines PI 219576 (highly resistant, R) × CR2748 (highly susceptible, S) to clubroot, all F1 plants were resistant to clubroot. There was a 1:1 ratio of R:S in the BC1 and 3R:1S in the F2, which indicated that a single dominant gene controlled clubroot resistance in PI 219576. This gene was designated Rcr6. Mapping of Rcr6 was performed using genome sequencing information from A-genome of B. rapa and B-genome of B. nigra though bulked segregant RNA sequencing (BSR-Seq) and further mapping with Kompetitive Allele Specific PCR (KASP) analysis. RESULTS: Reads of R and S bulks from BSR-Seq were initially aligned onto B. rapa (A-genome; B. nigra has the B-genome) where Rcr6 was associated with chromosome A08. KASP analysis showed that Rcr6 was flanked by SNP markers homologous to the region of 14.8-15.4 Mb of chromosome A08. There were 190 genes annotated in this region, with five genes (Bra010552, Bra010588, Bra010589, Bra010590 and Bra010663) identified as encoding the toll-interleukin-1 receptor / nucleotide-binding site / leucine-rich-repeat (TIR-NBS-LRR; TNL) class of proteins. The reads from BSR-Seq were then aligned into a draft B-genome of B. nigra, where Rcr6 was mapped on chromosome B3. KASP analysis indicated that Rcr6 was located on chromosome B3 in a 0.5 Mb region from 6.1-6.6 Mb. Only one TNL gene homologous to the B. rapa gene Bra010663 was identified in the target region. This gene is a likely candidate for Rcr6. Subsequent analysis of the Rcr6 equivalent region based on a published B. nigra genome was performed. This gene is located into chromosome B7 of the published B-genome, homologous to BniB015819. CONCLUSION: Rcr6 was the first gene identified and mapped in the B-genome of Brassica species. It resides in a genomic region homologous to chromosome A08 of A-genome. Based on this finding, it could possibly integrate into A08 of B. napus using marker assisted selection with SNP markers tightly linked to Rcr6 developed in this study.
Asunto(s)
Planta de la Mostaza/genética , Planta de la Mostaza/microbiología , Enfermedades de las Plantas/parasitología , Proteínas de Plantas/genética , Plasmodiophorida/fisiología , Polimorfismo de Nucleótido Simple , Mapeo Cromosómico , Resistencia a la Enfermedad/genética , Marcadores Genéticos , Proteínas de Plantas/metabolismo , ARN de Planta/análisis , Análisis de Secuencia de ARNRESUMEN
BACKGROUND: CRISPR/Cas9 gene editing has become a revolutionary technique for crop improvement as it can facilitate fast and efficient genetic changes without the retention of transgene components in the final plant line. Lack of robust bioinformatics tools to facilitate the design of highly specific functional guide RNAs (gRNAs) and prediction of off-target sites in wheat is currently an obstacle to effective application of CRISPR technology to wheat improvement. DESCRIPTION: We have developed a web-based bioinformatics tool to design specific gRNAs for genome editing and transcriptional regulation of gene expression in wheat. A collaborative study between the Broad Institute and Microsoft Research used large-scale empirical evidence to devise algorithms (Doech et al., 2016, Nature Biotechnology 34, 184-191) for predicting the on-target activity and off-target potential of CRISPR/SpCas9 (Streptococcus pyogenes Cas9). We applied these prediction models to determine on-target specificity and potential off-target activity for individual gRNAs targeting specific loci in the wheat genome. The genome-wide gRNA mappings and the corresponding Doench scores predictive of the on-target and off-target activities were used to create a gRNA database which was used as a data source for the web application termed WheatCRISPR. CONCLUSION: The WheatCRISPR tool allows researchers to browse all possible gRNAs targeting a gene or sequence of interest and select effective gRNAs based on their predicted high on-target and low off-target activity scores, as well as other characteristics such as position within the targeted gene. It is publicly available at https://crispr.bioinfo.nrc.ca/WheatCrispr/ .
Asunto(s)
Sistemas CRISPR-Cas , Edición Génica/métodos , ARN Guía de Kinetoplastida , Triticum/genética , Biología Computacional/métodos , Bases de Datos Genéticas , Marcación de Gen , Genoma de Planta , InternetRESUMEN
KEY MESSAGE: Structural genome variation is a major determinant of useful trait diversity. We describe how genome analysis methods are enabling discovery of trait-associated structural variants and their potential impact on breeding. As our understanding of complex crop genomes continues to grow, there is growing evidence that structural genome variation plays a major role in determining traits important for breeding and agriculture. Identifying the extent and impact of structural variants in crop genomes is becoming increasingly feasible with ongoing advances in the sophistication of genome sequencing technologies, particularly as it becomes easier to generate accurate long sequence reads on a genome-wide scale. In this article, we discuss the origins of structural genome variation in crops from ancient and recent genome duplication and polyploidization events and review high-throughput methods to assay such variants in crop populations in order to find associations with phenotypic traits. There is increasing evidence from such studies that gene presence-absence and copy number variation resulting from segmental chromosome exchanges may be at the heart of adaptive variation of crops to counter abiotic and biotic stress factors. We present examples from major crops that demonstrate the potential of pangenomic diversity as a key resource for future plant breeding for resilience and sustainability.
Asunto(s)
Productos Agrícolas/genética , Variación Estructural del Genoma , Carácter Cuantitativo Heredable , Genoma de Planta , Polimorfismo de Nucleótido Simple/genética , Análisis de Secuencia de ADNRESUMEN
Homoeologous exchanges (HEs) have been shown to generate novel gene combinations and phenotypes in a range of polyploid species. Gene presence/absence variation (PAV) is also a major contributor to genetic diversity. In this study, we show that there is an association between these two events, particularly in recent Brassica napus synthetic accessions, and that these represent a novel source of genetic diversity, which can be captured for the improvement of this important crop species. By assembling the pangenome of B. napus, we show that 38% of the genes display PAV behaviour, with some of these variable genes predicted to be involved in important agronomic traits including flowering time, disease resistance, acyl lipid metabolism and glucosinolate metabolism. This study is a first and provides a detailed characterization of the association between HEs and PAVs in B. napus at the pangenome level.
Asunto(s)
Brassica napus/genética , Conversión Génica/genética , Genes de Plantas/genética , Diploidia , Eliminación de Gen , Duplicación de Gen , Variación Genética/genética , Genoma de Planta/genética , Carácter Cuantitativo HeredableRESUMEN
Camelina sativa is currently being embraced as a viable industrial bio-platform crop due to a number of desirable agronomic attributes and the unique fatty acid profile of the seed oil that has applications for food, feed and biofuel. The recent completion of the reference genome sequence of C. sativa identified a young hexaploid genome. To complement this work, we have generated a genome-wide developmental transcriptome map by RNA sequencing of 12 different tissues covering major developmental stages during the life cycle of C. sativa. We have generated a digital atlas of this comprehensive transcriptome resource that enables interactive visualization of expression data through a searchable database of electronic fluorescent pictographs (eFP browser). An analysis of this dataset supported expression of 88% of the annotated genes in C. sativa and provided a global overview of the complex architecture of temporal and spatial gene expression patterns active during development. Conventional differential gene expression analysis combined with weighted gene expression network analysis uncovered similarities as well as differences in gene expression patterns between different tissues and identified tissue-specific genes and network modules. A high-quality census of transcription factors, analysis of alternative splicing and tissue-specific genome dominance provided insight into the transcriptional dynamics and sub-genome interplay among the well-preserved triplicated repertoire of homeologous loci. The comprehensive transcriptome atlas in combination with the reference genome sequence provides a powerful resource for genomics research which can be leveraged to identify functional associations between genes and understand the regulatory networks underlying developmental processes.
Asunto(s)
Biocombustibles , Brassicaceae/metabolismo , Proteínas de Plantas/metabolismo , Transcriptoma/genética , Brassicaceae/genética , Regulación de la Expresión Génica de las Plantas/genética , Regulación de la Expresión Génica de las Plantas/fisiología , Proteínas de Plantas/genética , Poliploidía , Factores de Transcripción/genética , Factores de Transcripción/metabolismoRESUMEN
Polyploidy, the possession of multiple sets of chromosomes, has been a predominant factor in the evolution and success of the angiosperms. Although artificially formed allopolyploids show a high rate of genome rearrangement, the genomes of cultivars and germplasm used for crop breeding were assumed stable and genome structural variation under the artificial selection process of commercial breeding has remained little studied. Here, we show, using a repurposed visualization method based on transcriptome sequence data, that genome structural rearrangement occurs frequently in varieties of three polyploid crops (oilseed rape, mustard rape and bread wheat), meaning that the extent of genome structural variation present in commercial crops is much higher than expected. Exchanges were found to occur most frequently where homoeologous chromosome segments are collinear to telomeres and in material produced as doubled haploids. The new insights into genome structural evolution enable us to reinterpret the results of recent studies and implicate homoeologous exchanges, not deletions, as being responsible for variation controlling important seed quality traits in rapeseed. Having begun to identify the extent of genome structural variation in polyploid crops, we can envisage new strategies for the global challenge of broadening crop genetic diversity and accelerating adaptation, such as the molecular identification and selection of genome deletions or duplications encompassing genes with trait-controlling dosage effects.
Asunto(s)
Productos Agrícolas/genética , Genoma de Planta , Poliploidía , Análisis de Secuencia de ARN/métodos , Brassica napus/genética , Brassica rapa/genética , Biología Computacional/métodos , ARN Mensajero , Triticum/genéticaRESUMEN
Genomic rearrangements arising during polyploidization are an important source of genetic and phenotypic variation in the recent allopolyploid crop Brassica napus. Exchanges among homoeologous chromosomes, due to interhomoeologue pairing, and deletions without compensating homoeologous duplications are observed in both natural B. napus and synthetic B. napus. Rearrangements of large or small chromosome segments induce gene copy number variation (CNV) and can potentially cause phenotypic changes. Unfortunately, complex genome restructuring is difficult to deal with in linkage mapping studies. Here, we demonstrate how high-density genetic mapping with codominant, physically anchored SNP markers can detect segmental homoeologous exchanges (HE) as well as deletions and accurately link these to QTL. We validated rearrangements detected in genetic mapping data by whole-genome resequencing of parental lines along with cytogenetic analysis using fluorescence in situ hybridization with bacterial artificial chromosome probes (BAC-FISH) coupled with PCR using primers specific to the rearranged region. Using a well-known QTL region influencing seed quality traits as an example, we confirmed that HE underlies the trait variation in a DH population involving a synthetic B. napus trait donor, and succeeded in narrowing the QTL to a small defined interval that enables delineation of key candidate genes.
Asunto(s)
Brassica napus/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Fenotipo , Sitios de Carácter Cuantitativo/genética , Emparejamiento Cromosómico , Cromosomas Artificiales Bacterianos/genética , Variaciones en el Número de Copia de ADN , ADN de Plantas/genética , Diploidia , Reordenamiento Génico , Ligamiento Genético/genética , Genoma de Planta , Genotipo , Hibridación Fluorescente in Situ , Hibridación de Ácido Nucleico , Polimorfismo de Nucleótido Simple , Recombinación Genética , Semillas/química , Secuenciación Completa del GenomaRESUMEN
The Brassicaceae (Cruciferae) family, owing to its remarkable species, genetic, and physiological diversity as well as its significant economic potential, has become a model for polyploidy and evolutionary studies. Utilizing extensive transcriptome pyrosequencing of diverse taxa, we established a resolved phylogeny of a subset of crucifer species. We elucidated the frequency, age, and phylogenetic position of polyploidy and lineage separation events that have marked the evolutionary history of the Brassicaceae. Besides the well-known ancient α (47 million years ago [Mya]) and ß (124 Mya) paleopolyploidy events, several species were shown to have undergone a further more recent (â¼7 to 12 Mya) round of genome multiplication. We identified eight whole-genome duplications corresponding to at least five independent neo/mesopolyploidy events. Although the Brassicaceae family evolved from other eudicots at the beginning of the Cenozoic era of the Earth (60 Mya), major diversification occurred only during the Neogene period (0 to 23 Mya). Remarkably, the widespread species divergence, major polyploidy, and lineage separation events during Brassicaceae evolution are clustered in time around epoch transitions characterized by prolonged unstable climatic conditions. The synchronized diversification of Brassicaceae species suggests that polyploid events may have conferred higher adaptability and increased tolerance toward the drastically changing global environment, thus facilitating species radiation.
Asunto(s)
Brassicaceae/genética , Cleome/genética , Evolución Molecular , Genoma de Planta/genética , Secuencia de Bases , Brassicaceae/clasificación , Cleome/clasificación , Biblioteca de Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Anotación de Secuencia Molecular , Datos de Secuencia Molecular , Filogenia , Hojas de la Planta/clasificación , Hojas de la Planta/genética , Poliploidía , ARN Mensajero/genética , ARN de Planta/química , ARN de Planta/genética , Análisis de Secuencia de ADN , Factores de Tiempo , TranscriptomaRESUMEN
The Brassica napus 60K Illumina Infinium™ SNP array has had huge international uptake in the rapeseed community due to the revolutionary speed of acquisition and ease of analysis of this high-throughput genotyping data, particularly when coupled with the newly available reference genome sequence. However, further utilization of this valuable resource can be optimized by better understanding the promises and pitfalls of SNP arrays. We outline how best to analyze Brassica SNP marker array data for diverse applications, including linkage and association mapping, genetic diversity and genomic introgression studies. We present data on which SNPs are locus-specific in winter, semi-winter and spring B. napus germplasm pools, rather than amplifying both an A-genome and a C-genome locus or multiple loci. Common issues that arise when analyzing array data will be discussed, particularly those unique to SNP markers and how to deal with these for practical applications in Brassica breeding applications.
Asunto(s)
Brassica napus/genética , Genotipo , Análisis de Secuencia por Matrices de Oligonucleótidos , Polimorfismo de Nucleótido Simple , Alelos , Mapeo Cromosómico , Evolución Molecular , Reordenamiento Génico , Ligamiento Genético , Marcadores Genéticos , Genética de Población , Genoma de Planta , Técnicas de Genotipaje , FitomejoramientoRESUMEN
BACKGROUND: There are three basic Brassica genomes (A, B, and C) and three parallel sets of subgenomes distinguished in the diploid Brassica (i.e.: B. rapa, A(r)A(r); B. nigra, B(ni)B(ni); B. oleracea, C(o)C(o)) and the derived allotetraploid species (i.e.: B. juncea, A(j)A(j)B(j)B(j); B. napus, A(n)A(n)C(n)C(n); B. carinata, B(c)B(c)C(c)C(c)). To understand subgenome differentiation in B. juncea in comparison to other A genome-carrying Brassica species (B. rapa and B. napus), we constructed a dense genetic linkage map of B. juncea, and conducted population genetic analysis on diverse lines of the three A-genome carrying Brassica species using a genotyping-by-sequencing approach (DArT-seq). RESULTS: A dense genetic linkage map of B. juncea was constructed using an F2 population derived from Sichuan Yellow/Purple Mustard. The map included 3329 DArT-seq markers on 18 linkage groups and covered 1579 cM with an average density of two markers per cM. Based on this map and the alignment of the marker sequences with the physical genome of Arabidopsis thaliana, we observed strong co-linearity of the ancestral blocks among the different A subgenomes but also considerable block variation. Comparative analyses at the level of genome sequences of B. rapa and B. napus, and marker sequence anchored on the genetic map of B. juncea, revealed a total of 30 potential inversion events across large segments and 20 potential translocation events among the three A subgenomes. Population genetic analysis on 26 accessions of the three A genome-carrying Brassica species showed that the highest genetic distance were estimated when comparing A(j)-A(n) than between A(n)-A(r) and A(j)-A(r) subgenome pairs. CONCLUSIONS: The development of the dense genetic linkage map of B. juncea with informative DArT-seq marker sequences and availability of the reference sequences of the A(r), and A(n)C(n) genomes allowed us to compare the A subgenome structure of B. juncea (A(j)) . Our results suggest that strong co-linearity exists among the three A Brassica genomes (A(r), A(n) and A(j)) but with apparent subgenomic variation. Population genetic analysis on three A-genome carrying Brassica species support the idea that B. juncea has distinct genomic diversity, and/or evolved from a different A genome progenitor of B. napus.
Asunto(s)
Brassica napus/genética , Genoma de Planta/genética , Planta de la Mostaza/genética , Sitios de Carácter Cuantitativo/genética , Arabidopsis/genética , Mapeo Cromosómico , Diploidia , Ligamiento Genético , Genotipo , Repeticiones de Microsatélite/genética , Análisis de Secuencia de ADNRESUMEN
BACKGROUND: The protist Plasmodiophora brassicae is a soil-borne pathogen of cruciferous species and the causal agent of clubroot disease of Brassicas including agriculturally important crops such as canola/rapeseed (Brassica napus). P. brassicae has remained an enigmatic plant pathogen and is a rare example of an obligate biotroph that resides entirely inside the host plant cell. The pathogen is the cause of severe yield losses and can render infested fields unsuitable for Brassica crop growth due to the persistence of resting spores in the soil for up to 20 years. RESULTS: To provide insight into the biology of the pathogen and its interaction with its primary host B. napus, we produced a draft genome of P. brassicae pathotypes 3 and 6 (Pb3 and Pb6) that differ in their host range. Pb3 is highly virulent on B. napus (but also infects other Brassica species) while Pb6 infects only vegetable Brassica crops. Both the Pb3 and Pb6 genomes are highly compact, each with a total size of 24.2 Mb, and contain less than 2 % repetitive DNA. Clustering of genome-wide single nucleotide polymorphisms (SNP) of Pb3, Pb6 and three additional re-sequenced pathotypes (Pb2, Pb5 and Pb8) shows a high degree of correlation of cluster grouping with host range. The Pb3 genome features significant reduction of intergenic space with multiple examples of overlapping untranslated regions (UTRs). Dependency on the host for essential nutrients is evident from the loss of genes for the biosynthesis of thiamine and some amino acids and the presence of a wide range of transport proteins, including some unique to P. brassicae. The annotated genes of Pb3 include those with a potential role in the regulation of the plant growth hormones cytokinin and auxin. The expression profile of Pb3 genes, including putative effectors, during infection and their potential role in manipulation of host defence is discussed. CONCLUSION: The P. brassicae genome sequence reveals a compact genome, a dependency of the pathogen on its host for some essential nutrients and a potential role in the regulation of host plant cytokinin and auxin. Genome annotation supported by RNA sequencing reveals significant reduction in intergenic space which, in addition to low repeat content, has likely contributed to the P. brassicae compact genome.
Asunto(s)
Brassica/parasitología , Genoma de Protozoos , Interacciones Huésped-Parásitos/genética , Plasmodiophorida/genética , Arabidopsis , Productos Agrícolas/parasitología , Citocininas/metabolismo , ADN Protozoario/genética , Especificidad del Huésped , Ácidos Indolacéticos/metabolismo , Enfermedades de las Plantas/parasitología , Análisis de Secuencia de ARN , TranscriptomaRESUMEN
KEY MESSAGE: The Brassica napus Illumina array provides genome-wide markers linked to the available genome sequence, a significant tool for genetic analyses of the allotetraploid B. napus and its progenitor diploid genomes. A high-density single nucleotide polymorphism (SNP) Illumina Infinium array, containing 52,157 markers, was developed for the allotetraploid Brassica napus. A stringent selection process employing the short probe sequence for each SNP assay was used to limit the majority of the selected markers to those represented a minimum number of times across the highly replicated genome. As a result approximately 60 % of the SNP assays display genome-specificity, resolving as three clearly separated clusters (AA, AB, and BB) when tested with a diverse range of B. napus material. This genome specificity was supported by the analysis of the diploid ancestors of B. napus, whereby 26,504 and 29,720 markers were scorable in B. oleracea and B. rapa, respectively. Forty-four percent of the assayed loci on the array were genetically mapped in a single doubled-haploid B. napus population allowing alignment of their physical and genetic coordinates. Although strong conservation of the two positions was shown, at least 3 % of the loci were genetically mapped to a homoeologous position compared to their presumed physical position in the respective genome, underlying the importance of genetic corroboration of locus identity. In addition, the alignments identified multiple rearrangements between the diploid and tetraploid Brassica genomes. Although mostly attributed to genome assembly errors, some are likely evidence of rearrangements that occurred since the hybridisation of the progenitor genomes in the B. napus nucleus. Based on estimates for linkage disequilibrium decay, the array is a valuable tool for genetic fine mapping and genome-wide association studies in B. napus and its progenitor genomes.
Asunto(s)
Brassica napus/genética , Mapeo Cromosómico , Genoma de Planta , Técnicas de Genotipaje , Polimorfismo de Nucleótido Simple , ADN de Plantas/genética , Diploidia , Marcadores Genéticos , Análisis de Secuencia de ADN , TetraploidíaRESUMEN
KEY MESSAGE: Provide evidence that the Brassica B genome chromosome B3 carries blackleg resistance gene, and also the B genome chromosomes were inherited several generations along with B. napus chromosomes. Blackleg disease caused by fungus Leptosphaeria maculans causes significant yield losses in Brassica napus. Brassica carinata possesses excellent resistance to this disease. To introgress blackleg resistance, crosses between B. napus cv. Westar and B. carinata were done. The interspecific-hybrids were backcrossed twice to Westar and self-pollinated three times to produce BC2S3 families. Doubled haploid lines (DH1) were produced from one blackleg resistant family. SSR markers were used to study the association between B genome chromosome(s) and blackleg resistance. The entire B3 chromosome of B. carinata was associated with blackleg resistance in DH1. A second DH population (DH2) was produced from F1s of resistant DH1 lines crossed to blackleg susceptible B. napus cv. Polo where resistance was found to be associated with SSR markers from the middle to bottom of the B3 and top of the B8 chromosomes. The results demonstrated that the B3 chromosome carried gene(s) for blackleg resistance. Genomic in situ hybridization (GISH) and GISH-like analysis of the DH2 lines revealed that susceptible lines, in addition to B. napus chromosomes, possessed one pair of B genome chromosomes (2n = 40), while resistant lines had either one (2n = 40) or two pairs (2n = 42) of B chromosomes. The molecular and GISH data suggested that the B chromosome in the susceptible lines was B7, while it was difficult to confirm the identity of the B chromosomes in the resistant lines. Also, B chromosomes were found to be inherited over several generations along with B. napus chromosomes.