Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 58
Filter
1.
Nature ; 615(7953): 652-659, 2023 03.
Article in English | MEDLINE | ID: mdl-36890232

ABSTRACT

Increasing the proportion of locally produced plant protein in currently meat-rich diets could substantially reduce greenhouse gas emissions and loss of biodiversity1. However, plant protein production is hampered by the lack of a cool-season legume equivalent to soybean in agronomic value2. Faba bean (Vicia faba L.) has a high yield potential and is well suited for cultivation in temperate regions, but genomic resources are scarce. Here, we report a high-quality chromosome-scale assembly of the faba bean genome and show that it has expanded to a massive 13 Gb in size through an imbalance between the rates of amplification and elimination of retrotransposons and satellite repeats. Genes and recombination events are evenly dispersed across chromosomes and the gene space is remarkably compact considering the genome size, although with substantial copy number variation driven by tandem duplication. Demonstrating practical application of the genome sequence, we develop a targeted genotyping assay and use high-resolution genome-wide association analysis to dissect the genetic basis of seed size and hilum colour. The resources presented constitute a genomics-based breeding platform for faba bean, enabling breeders and geneticists to accelerate the improvement of sustainable protein production across the Mediterranean, subtropical and northern temperate agroecological zones.


Subject(s)
Crops, Agricultural , Diploidy , Genetic Variation , Genome, Plant , Genomics , Plant Breeding , Plant Proteins , Vicia faba , Chromosomes, Plant/genetics , Crops, Agricultural/genetics , Crops, Agricultural/metabolism , DNA Copy Number Variations/genetics , DNA, Satellite/genetics , Gene Amplification/genetics , Genes, Plant/genetics , Genetic Variation/genetics , Genome, Plant/genetics , Genome-Wide Association Study , Geography , Plant Breeding/methods , Plant Proteins/genetics , Plant Proteins/metabolism , Recombination, Genetic , Retroelements/genetics , Seeds/anatomy & histology , Seeds/genetics , Vicia faba/anatomy & histology , Vicia faba/genetics , Vicia faba/metabolism
2.
Nature ; 606(7912): 113-119, 2022 06.
Article in English | MEDLINE | ID: mdl-35585233

ABSTRACT

Cultivated oat (Avena sativa L.) is an allohexaploid (AACCDD, 2n = 6x = 42) thought to have been domesticated more than 3,000 years ago while growing as a weed in wheat, emmer and barley fields in Anatolia1,2. Oat has a low carbon footprint, substantial health benefits and the potential to replace animal-based food products. However, the lack of a fully annotated reference genome has hampered efforts to deconvolute its complex evolutionary history and functional gene dynamics. Here we present a high-quality reference genome of A. sativa and close relatives of its diploid (Avena longiglumis, AA, 2n = 14) and tetraploid (Avena insularis, CCDD, 2n = 4x = 28) progenitors. We reveal the mosaic structure of the oat genome, trace large-scale genomic reorganizations in the polyploidization history of oat and illustrate a breeding barrier associated with the genome architecture of oat. We showcase detailed analyses of gene families implicated in human health and nutrition, which adds to the evidence supporting oat safety in gluten-free diets, and we perform mapping-by-sequencing of an agronomic trait related to water-use efficiency. This resource for the Avena genus will help to leverage knowledge from other cereal genomes, improve understanding of basic oat biology and accelerate genomics-assisted breeding and reanalysis of quantitative trait studies.


Subject(s)
Avena , Edible Grain , Genome, Plant , Avena/genetics , Diploidy , Edible Grain/genetics , Genome, Plant/genetics , Mosaicism , Plant Breeding , Tetraploidy
3.
Nature ; 588(7837): 284-289, 2020 12.
Article in English | MEDLINE | ID: mdl-33239781

ABSTRACT

Genetic diversity is key to crop improvement. Owing to pervasive genomic structural variation, a single reference genome assembly cannot capture the full complement of sequence diversity of a crop species (known as the 'pan-genome'1). Multiple high-quality sequence assemblies are an indispensable component of a pan-genome infrastructure. Barley (Hordeum vulgare L.) is an important cereal crop with a long history of cultivation that is adapted to a wide range of agro-climatic conditions2. Here we report the construction of chromosome-scale sequence assemblies for the genotypes of 20 varieties of barley-comprising landraces, cultivars and a wild barley-that were selected as representatives of global barley diversity. We catalogued genomic presence/absence variants and explored the use of structural variants for quantitative genetic analysis through whole-genome shotgun sequencing of 300 gene bank accessions. We discovered abundant large inversion polymorphisms and analysed in detail two inversions that are frequently found in current elite barley germplasm; one is probably the product of mutation breeding and the other is tightly linked to a locus that is involved in the expansion of geographical range. This first-generation barley pan-genome makes previously hidden genetic variation accessible to genetic studies and breeding.


Subject(s)
Chromosomes, Plant/genetics , Genome, Plant/genetics , Hordeum/genetics , Internationality , Mutation , Plant Breeding , Chromosome Inversion/genetics , Chromosome Mapping , Genetic Loci/genetics , Genotype , Hordeum/classification , Polymorphism, Genetic/genetics , Reference Standards , Seed Bank , Sequence Inversion , Whole Genome Sequencing
4.
Nature ; 588(7837): 277-283, 2020 12.
Article in English | MEDLINE | ID: mdl-33239791

ABSTRACT

Advances in genomics have expedited the improvement of several agriculturally important crops but similar efforts in wheat (Triticum spp.) have been more challenging. This is largely owing to the size and complexity of the wheat genome1, and the lack of genome-assembly data for multiple wheat lines2,3. Here we generated ten chromosome pseudomolecule and five scaffold assemblies of hexaploid wheat to explore the genomic diversity among wheat lines from global breeding programs. Comparative analysis revealed extensive structural rearrangements, introgressions from wild relatives and differences in gene content resulting from complex breeding histories aimed at improving adaptation to diverse environments, grain yield and quality, and resistance to stresses4,5. We provide examples outlining the utility of these genomes, including a detailed multi-genome-derived nucleotide-binding leucine-rich repeat protein repertoire involved in disease resistance and the characterization of Sm16, a gene associated with insect resistance. These genome assemblies will provide a basis for functional gene discovery and breeding to deliver the next generation of modern wheat cultivars.


Subject(s)
Genetic Variation , Genome, Plant/genetics , Genomics , Internationality , Plant Breeding/methods , Triticum/genetics , Acclimatization/genetics , Animals , Centromere/genetics , Centromere/metabolism , Chromosome Mapping , Cloning, Molecular , DNA Copy Number Variations/genetics , DNA Transposable Elements/genetics , Edible Grain/genetics , Edible Grain/growth & development , Genes, Plant/genetics , Genetic Introgression , Haplotypes , Insecta/pathogenicity , NLR Proteins/genetics , Plant Diseases/genetics , Plant Proteins/genetics , Polymorphism, Single Nucleotide/genetics , Polyploidy , Triticum/classification , Triticum/growth & development
5.
Plant Cell ; 33(6): 1888-1906, 2021 07 19.
Article in English | MEDLINE | ID: mdl-33710295

ABSTRACT

Sequence assembly of large and repeat-rich plant genomes has been challenging, requiring substantial computational resources and often several complementary sequence assembly and genome mapping approaches. The recent development of fast and accurate long-read sequencing by circular consensus sequencing (CCS) on the PacBio platform may greatly increase the scope of plant pan-genome projects. Here, we compare current long-read sequencing platforms regarding their ability to rapidly generate contiguous sequence assemblies in pan-genome studies of barley (Hordeum vulgare). Most long-read assemblies are clearly superior to the current barley reference sequence based on short-reads. Assemblies derived from accurate long reads excel in most metrics, but the CCS approach was the most cost-effective strategy for assembling tens of barley genomes. A downsampling analysis indicated that 20-fold CCS coverage can yield very good sequence assemblies, while even five-fold CCS data may capture the complete sequence of most genes. We present an updated reference genome assembly for barley with near-complete representation of the repeat-rich intergenic space. Long-read assembly can underpin the construction of accurate and complete sequences of multiple genomes of a species to build pan-genome infrastructures in Triticeae crops and their wild relatives.


Subject(s)
Genomics/methods , High-Throughput Nucleotide Sequencing/methods , Hordeum/genetics , Computational Biology/methods , DNA, Intergenic , Genome, Plant , Molecular Sequence Annotation , Retroelements , Sequence Analysis, DNA , Terminal Repeat Sequences
6.
Plant J ; 110(1): 179-192, 2022 04.
Article in English | MEDLINE | ID: mdl-34997796

ABSTRACT

Aegilops is a close relative of wheat (Triticum spp.), and Aegilops species in the section Sitopsis represent a rich reservoir of genetic diversity for the improvement of wheat. To understand their diversity and advance their utilization, we produced whole-genome assemblies of Aegilops longissima and Aegilops speltoides. Whole-genome comparative analysis, along with the recently sequenced Aegilops sharonensis genome, showed that the Ae. longissima and Ae. sharonensis genomes are highly similar and are most closely related to the wheat D subgenome. By contrast, the Ae. speltoides genome is more closely related to the B subgenome. Haplotype block analysis supported the idea that Ae. speltoides genome is closest to the wheat B subgenome, and highlighted variable and similar genomic regions between the three Aegilops species and wheat. Genome-wide analysis of nucleotide-binding leucine-rich repeat (NLR) genes revealed species-specific and lineage-specific NLR genes and variants, demonstrating the potential of Aegilops genomes for wheat improvement.


Subject(s)
Aegilops , Aegilops/genetics , Genome, Plant/genetics , Phylogeny , Poaceae/genetics , Triticum/genetics
7.
Plant Physiol ; 190(2): 1242-1259, 2022 09 28.
Article in English | MEDLINE | ID: mdl-35861439

ABSTRACT

Parasitism is a successful life strategy that has evolved independently in several families of vascular plants. The genera Cuscuta and Orobanche represent examples of the two profoundly different groups of parasites: one parasitizing host shoots and the other infecting host roots. In this study, we sequenced and described the overall repertoire of small RNAs from Cuscuta campestris and Orobanche aegyptiaca. We showed that C. campestris contains a number of novel microRNAs (miRNAs) in addition to a conspicuous retention of miRNAs that are typically lacking in other Solanales, while several typically conserved miRNAs seem to have become obsolete in the parasite. One new miRNA appears to be derived from a horizontal gene transfer event. The exploratory analysis of the miRNA population (exploratory due to the absence of a full genomic sequence for reference) from the root parasitic O. aegyptiaca also revealed a loss of a number of miRNAs compared to photosynthetic species from the same order. In summary, our study shows partly similar evolutionary signatures in the RNA silencing machinery in both parasites. Our data bear proof for the dynamism of this regulatory mechanism in parasitic plants.


Subject(s)
Cuscuta , MicroRNAs , Orobanche , Parasites , Animals , Cuscuta/genetics , MicroRNAs/genetics , Orobanche/genetics , RNA, Plant/genetics
8.
Nature ; 544(7651): 427-433, 2017 04 26.
Article in English | MEDLINE | ID: mdl-28447635

ABSTRACT

Cereal grasses of the Triticeae tribe have been the major food source in temperate regions since the dawn of agriculture. Their large genomes are characterized by a high content of repetitive elements and large pericentromeric regions that are virtually devoid of meiotic recombination. Here we present a high-quality reference genome assembly for barley (Hordeum vulgare L.). We use chromosome conformation capture mapping to derive the linear order of sequences across the pericentromeric space and to investigate the spatial organization of chromatin in the nucleus at megabase resolution. The composition of genes and repetitive elements differs between distal and proximal regions. Gene family analyses reveal lineage-specific duplications of genes involved in the transport of nutrients to developing seeds and the mobilization of carbohydrates in grains. We demonstrate the importance of the barley reference sequence for breeding by inspecting the genomic partitioning of sequence variation in modern elite germplasm, highlighting regions vulnerable to genetic erosion.


Subject(s)
Chromosomes, Plant/genetics , Genome, Plant/genetics , Hordeum/genetics , Cell Nucleus/genetics , Centromere/genetics , Chromatin/genetics , Chromatin/metabolism , Chromosome Mapping , Chromosomes, Artificial, Bacterial/genetics , Genetic Variation , Genomics , Haplotypes/genetics , Meiosis/genetics , Repetitive Sequences, Nucleic Acid/genetics , Seeds/genetics
9.
Plant J ; 97(1): 182-198, 2019 01.
Article in English | MEDLINE | ID: mdl-30500991

ABSTRACT

Recent advances in genomics technologies have greatly accelerated the progress in both fundamental plant science and applied breeding research. Concurrently, high-throughput plant phenotyping is becoming widely adopted in the plant community, promising to alleviate the phenotypic bottleneck. While these technological breakthroughs are significantly accelerating quantitative trait locus (QTL) and causal gene identification, challenges to enable even more sophisticated analyses remain. In particular, care needs to be taken to standardize, describe and conduct experiments robustly while relying on plant physiology expertise. In this article, we review the state of the art regarding genome assembly and the future potential of pangenomics in plant research. We also describe the necessity of standardizing and describing phenotypic studies using the Minimum Information About a Plant Phenotyping Experiment (MIAPPE) standard to enable the reuse and integration of phenotypic data. In addition, we show how deep phenotypic data might yield novel trait-trait correlations and review how to link phenotypic data to genomic data. Finally, we provide perspectives on the golden future of machine learning and their potential in linking phenotypes to genomic features.


Subject(s)
Genetic Association Studies , Genome, Plant/genetics , Genomics , Machine Learning , Phenomics , Plants/genetics , Phenotype , Quantitative Trait Loci/genetics
10.
Genome Res ; 27(5): 885-896, 2017 05.
Article in English | MEDLINE | ID: mdl-28420692

ABSTRACT

Advances in genome sequencing and assembly technologies are generating many high-quality genome sequences, but assemblies of large, repeat-rich polyploid genomes, such as that of bread wheat, remain fragmented and incomplete. We have generated a new wheat whole-genome shotgun sequence assembly using a combination of optimized data types and an assembly algorithm designed to deal with large and complex genomes. The new assembly represents >78% of the genome with a scaffold N50 of 88.8 kb that has a high fidelity to the input data. Our new annotation combines strand-specific Illumina RNA-seq and Pacific Biosciences (PacBio) full-length cDNAs to identify 104,091 high-confidence protein-coding genes and 10,156 noncoding RNA genes. We confirmed three known and identified one novel genome rearrangements. Our approach enables the rapid and scalable assembly of wheat genomes, the identification of structural variants, and the definition of complete gene models, all powerful resources for trait analysis and breeding of this key global crop.


Subject(s)
Contig Mapping/methods , Genome, Plant , Molecular Sequence Annotation/methods , Plant Proteins/genetics , Translocation, Genetic , Triticum/genetics , Algorithms , Contig Mapping/standards , Molecular Sequence Annotation/standards , Polymorphism, Genetic , Polyploidy
11.
Plant J ; 93(3): 502-514, 2018 02.
Article in English | MEDLINE | ID: mdl-29205595

ABSTRACT

Pseudogenes have a reputation of being 'evolutionary relics' or 'junk DNA'. While they are well characterized in mammals, studies in more complex plant genomes have so far been hampered by the absence of reference genome sequences. Barley is one of the economically most important cereals and has a genome size of 5.1 Gb. With the first high-quality genome reference assembly available for a Triticeae crop, we conducted a whole-genome assessment of pseudogenes on the barley genome. We identified, characterized and classified 89 440 gene fragments and pseudogenes scattered along the chromosomes, with occasional hotspots and higher densities at the chromosome ends. Full-length pseudogenes (11 015) have preferentially retained their exon-intron structure. Retrotransposition of processed mRNAs only plays a marginal role in their creation. However, the distribution of retroposed pseudogenes reflects the Rabl configuration of barley chromosomes and thus hints at founding mechanisms. While parent genes related to the defense-response were found to be under-represented in cultivated barley, we detected several defense-related pseudogenes in wild barley accessions. The percentage of transcriptionally active pseudogenes is 7.2%, and these may potentially adopt new regulatory roles.The barley genome is rich in pseudogenes and small gene fragments mainly located towards chromosome tips or as tandemly repeated units. Our results indicate non-random duplication and pseudogenization preferences and improve our understanding of the dynamics of gene birth and death in large plant genomes and the mechanisms that lead to evolutionary innovations.


Subject(s)
Genes, Plant , Hordeum/genetics , Pseudogenes , Chromosome Mapping , Chromosomes, Plant , Gene Duplication , Multigene Family , Selection, Genetic , Synteny
12.
Plant J ; 93(3): 515-533, 2018 02.
Article in English | MEDLINE | ID: mdl-29237241

ABSTRACT

The draft genome of the moss model, Physcomitrella patens, comprised approximately 2000 unordered scaffolds. In order to enable analyses of genome structure and evolution we generated a chromosome-scale genome assembly using genetic linkage as well as (end) sequencing of long DNA fragments. We find that 57% of the genome comprises transposable elements (TEs), some of which may be actively transposing during the life cycle. Unlike in flowering plant genomes, gene- and TE-rich regions show an overall even distribution along the chromosomes. However, the chromosomes are mono-centric with peaks of a class of Copia elements potentially coinciding with centromeres. Gene body methylation is evident in 5.7% of the protein-coding genes, typically coinciding with low GC and low expression. Some giant virus insertions are transcriptionally active and might protect gametes from viral infection via siRNA mediated silencing. Structure-based detection methods show that the genome evolved via two rounds of whole genome duplications (WGDs), apparently common in mosses but not in liverworts and hornworts. Several hundred genes are present in colinear regions conserved since the last common ancestor of plants. These syntenic regions are enriched for functions related to plant-specific cell growth and tissue organization. The P. patens genome lacks the TE-rich pericentromeric and gene-rich distal regions typical for most flowering plant genomes. More non-seed plant genomes are needed to unravel how plant genomes evolve, and to understand whether the P. patens genome structure is typical for mosses or bryophytes.


Subject(s)
Biological Evolution , Bryopsida/genetics , Chromosomes, Plant , Genome, Plant , Centromere , Chromatin/genetics , DNA Methylation , DNA Transposable Elements , Genetic Variation , Polymorphism, Single Nucleotide , Recombination, Genetic , Synteny
13.
Plant J ; 89(5): 853-869, 2017 Mar.
Article in English | MEDLINE | ID: mdl-27888547

ABSTRACT

We report on a whole-genome draft sequence of rye (Secale cereale L.). Rye is a diploid Triticeae species closely related to wheat and barley, and an important crop for food and feed in Central and Eastern Europe. Through whole-genome shotgun sequencing of the 7.9-Gbp genome of the winter rye inbred line Lo7 we obtained a de novo assembly represented by 1.29 million scaffolds covering a total length of 2.8 Gbp. Our reference sequence represents nearly the entire low-copy portion of the rye genome. This genome assembly was used to predict 27 784 rye gene models based on homology to sequenced grass genomes. Through resequencing of 10 rye inbred lines and one accession of the wild relative S. vavilovii, we discovered more than 90 million single nucleotide variants and short insertions/deletions in the rye genome. From these variants, we developed the high-density Rye600k genotyping array with 600 843 markers, which enabled anchoring the sequence contigs along a high-density genetic map and establishing a synteny-based virtual gene order. Genotyping data were used to characterize the diversity of rye breeding pools and genetic resources, and to obtain a genome-wide map of selection signals differentiating the divergent gene pools. This rye whole-genome sequence closes a gap in Triticeae genome research, and will be highly valuable for comparative genomics, functional studies and genome-based breeding in rye.


Subject(s)
Chromosomes, Plant/genetics , Secale/genetics , DNA, Plant/genetics , Genome, Plant/genetics , Genomics , Genotype , Synteny
14.
Nature ; 492(7429): 423-7, 2012 Dec 20.
Article in English | MEDLINE | ID: mdl-23257886

ABSTRACT

Polyploidy often confers emergent properties, such as the higher fibre productivity and quality of tetraploid cottons than diploid cottons bred for the same environments. Here we show that an abrupt five- to sixfold ploidy increase approximately 60 million years (Myr) ago, and allopolyploidy reuniting divergent Gossypium genomes approximately 1-2 Myr ago, conferred about 30-36-fold duplication of ancestral angiosperm (flowering plant) genes in elite cottons (Gossypium hirsutum and Gossypium barbadense), genetic complexity equalled only by Brassica among sequenced angiosperms. Nascent fibre evolution, before allopolyploidy, is elucidated by comparison of spinnable-fibred Gossypium herbaceum A and non-spinnable Gossypium longicalyx F genomes to one another and the outgroup D genome of non-spinnable Gossypium raimondii. The sequence of a G. hirsutum A(t)D(t) (in which 't' indicates tetraploid) cultivar reveals many non-reciprocal DNA exchanges between subgenomes that may have contributed to phenotypic innovation and/or other emergent properties such as ecological adaptation by polyploids. Most DNA-level novelty in G. hirsutum recombines alleles from the D-genome progenitor native to its New World habitat and the Old World A-genome progenitor in which spinnable fibre evolved. Coordinated expression changes in proximal groups of functionally distinct genes, including a nuclear mitochondrial DNA block, may account for clusters of cotton-fibre quantitative trait loci affecting diverse traits. Opportunities abound for dissecting emergent properties of other polyploids, particularly angiosperms, by comparison to diploid progenitors and outgroups.


Subject(s)
Biological Evolution , Cotton Fiber , Genome, Plant/genetics , Gossypium/genetics , Polyploidy , Alleles , Cacao/genetics , Chromosomes, Plant/genetics , Diploidy , Gene Duplication/genetics , Genes, Plant/genetics , Gossypium/classification , Molecular Sequence Annotation , Phylogeny , Vitis/genetics
15.
Nucleic Acids Res ; 44(D1): D1141-7, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26527721

ABSTRACT

PGSB (Plant Genome and Systems Biology: formerly MIPS) PlantsDB (http://pgsb.helmholtz-muenchen.de/plant/index.jsp) is a database framework for the comparative analysis and visualization of plant genome data. The resource has been updated with new data sets and types as well as specialized tools and interfaces to address user demands for intuitive access to complex plant genome data. In its latest incarnation, we have re-worked both the layout and navigation structure and implemented new keyword search options and a new BLAST sequence search functionality. Actively involved in corresponding sequencing consortia, PlantsDB has dedicated special efforts to the integration and visualization of complex triticeae genome data, especially for barley, wheat and rye. We enhanced CrowsNest, a tool to visualize syntenic relationships between genomes, with data from the wheat sub-genome progenitor Aegilops tauschii and added functionality to the PGSB RNASeqExpressionBrowser. GenomeZipper results were integrated for the genomes of barley, rye, wheat and perennial ryegrass and interactive access is granted through PlantsDB interfaces. Data exchange and cross-linking between PlantsDB and other plant genome databases is stimulated by the transPLANT project (http://transplantdb.eu/).


Subject(s)
Databases, Genetic , Genome, Plant , Gene Expression , Genomics , Hordeum/genetics , Plants/genetics , Plants/metabolism , Secale/genetics , Software , Triticum/genetics
16.
PLoS Genet ; 11(10): e1005588, 2015 Oct.
Article in English | MEDLINE | ID: mdl-26492483

ABSTRACT

Plants integrate seasonal cues such as temperature and day length to optimally adjust their flowering time to the environment. Compared to the control of flowering before and after winter by the vernalization and day length pathways, mechanisms that delay or promote flowering during a transient cool or warm period, especially during spring, are less well understood. Due to global warming, understanding this ambient temperature pathway has gained increasing importance. In Arabidopsis thaliana, FLOWERING LOCUS M (FLM) is a critical flowering regulator of the ambient temperature pathway. FLM is alternatively spliced in a temperature-dependent manner and the two predominant splice variants, FLM-ß and FLM-δ, can repress and activate flowering in the genetic background of the A. thaliana reference accession Columbia-0. The relevance of this regulatory mechanism for the environmental adaptation across the entire range of the species is, however, unknown. Here, we identify insertion polymorphisms in the first intron of FLM as causative for accelerated flowering in many natural A. thaliana accessions, especially in cool (15°C) temperatures. We present evidence for a potential adaptive role of this structural variation and link it specifically to changes in the abundance of FLM-ß. Our results may allow predicting flowering in response to ambient temperatures in the Brassicaceae.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/genetics , Flowers/genetics , MADS Domain Proteins/genetics , Mutagenesis, Insertional/genetics , Alternative Splicing/genetics , Arabidopsis/growth & development , Arabidopsis Proteins/biosynthesis , Gene Expression Regulation, Plant , Global Warming , MADS Domain Proteins/biosynthesis , Polymorphism, Genetic , Seasons , Temperature
17.
Nature ; 480(7378): 520-4, 2011 Nov 16.
Article in English | MEDLINE | ID: mdl-22089132

ABSTRACT

Legumes (Fabaceae or Leguminosae) are unique among cultivated plants for their ability to carry out endosymbiotic nitrogen fixation with rhizobial bacteria, a process that takes place in a specialized structure known as the nodule. Legumes belong to one of the two main groups of eurosids, the Fabidae, which includes most species capable of endosymbiotic nitrogen fixation. Legumes comprise several evolutionary lineages derived from a common ancestor 60 million years ago (Myr ago). Papilionoids are the largest clade, dating nearly to the origin of legumes and containing most cultivated species. Medicago truncatula is a long-established model for the study of legume biology. Here we describe the draft sequence of the M. truncatula euchromatin based on a recently completed BAC assembly supplemented with Illumina shotgun sequence, together capturing ∼94% of all M. truncatula genes. A whole-genome duplication (WGD) approximately 58 Myr ago had a major role in shaping the M. truncatula genome and thereby contributed to the evolution of endosymbiotic nitrogen fixation. Subsequent to the WGD, the M. truncatula genome experienced higher levels of rearrangement than two other sequenced legumes, Glycine max and Lotus japonicus. M. truncatula is a close relative of alfalfa (Medicago sativa), a widely cultivated crop with limited genomics tools and complex autotetraploid genetics. As such, the M. truncatula genome sequence provides significant opportunities to expand alfalfa's genomic toolbox.


Subject(s)
Biological Evolution , Genome, Plant , Medicago truncatula/genetics , Medicago truncatula/microbiology , Rhizobium/physiology , Symbiosis , Molecular Sequence Data , Nitrogen Fixation/genetics , Glycine max/genetics , Synteny , Vitis/genetics
18.
Biochim Biophys Acta ; 1849(1): 64-70, 2015 Jan.
Article in English | MEDLINE | ID: mdl-25481283

ABSTRACT

BACKGROUND: B chromosomes are supernumerary dispensable parts of the karyotype which appear in some individuals of some populations in some species. Often, they have been considered as 'junk DNA' or genomic parasites without functional genes. SCOPE OF REVIEW: Due to recent advances in sequencing technologies, it became possible to investigate their DNA composition, transcriptional activity and effects on the host transcriptome profile in detail. Here, we review the most recent findings regarding the gene content of B chromosomes and their transcriptional activities and discuss these findings in the context of comparable biological phenomena, like sex chromosomes, aneuploidy and pseudogenes. MAJOR CONCLUSIONS: Recent data suggest that B chromosomes carry transcriptionally active genic sequences which could affect the transcriptome profile of their host genome. GENERAL SIGNIFICANCE: These findings are gradually changing our view that B chromosomes are solely genetically inert selfish elements without any functional genes. This at one side could partly explain the deleterious effects which are associated with their presence. On the other hand it makes B chromosome a nice model for studying regulatory mechanisms of duplicated genes and their evolutionary consequences.


Subject(s)
Chromosomes/genetics , DNA, Intergenic/genetics , Evolution, Molecular , Transcription, Genetic , Animals , Eukaryota/genetics , Gene Expression Regulation/genetics , Genome , Humans , In Situ Hybridization, Fluorescence , Pseudogenes/genetics
19.
Plant Physiol ; 164(1): 412-23, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24243933

ABSTRACT

Barley (Hordeum vulgare) is an important cereal crop and a model species for Triticeae genomics. To lay the foundation for hierarchical map-based sequencing, a genome-wide physical map of its large and complex 5.1 billion-bp genome was constructed by high-information content fingerprinting of almost 600,000 bacterial artificial chromosomes representing 14-fold haploid genome coverage. The resultant physical map comprises 9,265 contigs with a cumulative size of 4.9 Gb representing 96% of the physical length of the barley genome. The reliability of the map was verified through extensive genetic marker information and the analysis of topological networks of clone overlaps. A minimum tiling path of 66,772 minimally overlapping clones was defined that will serve as a template for hierarchical clone-by-clone map-based shotgun sequencing. We integrated whole-genome shotgun sequence data from the individuals of two mapping populations with published bacterial artificial chromosome survey sequence information to genetically anchor the physical map. This novel approach in combination with the comprehensive whole-genome shotgun sequence data sets allowed us to independently validate and improve a previously reported physical and genetic framework. The resources developed in this study will underpin fine-mapping and cloning of agronomically important genes and the assembly of a draft genome sequence.


Subject(s)
Hordeum/genetics , Physical Chromosome Mapping , Polymorphism, Single Nucleotide , Chromosomes, Artificial, Bacterial , Contig Mapping , Reproducibility of Results , Sequence Analysis, DNA
20.
Nature ; 457(7229): 551-6, 2009 Jan 29.
Article in English | MEDLINE | ID: mdl-19189423

ABSTRACT

Sorghum, an African grass related to sugar cane and maize, is grown for food, feed, fibre and fuel. We present an initial analysis of the approximately 730-megabase Sorghum bicolor (L.) Moench genome, placing approximately 98% of genes in their chromosomal context using whole-genome shotgun sequence validated by genetic, physical and syntenic information. Genetic recombination is largely confined to about one-third of the sorghum genome with gene order and density similar to those of rice. Retrotransposon accumulation in recombinationally recalcitrant heterochromatin explains the approximately 75% larger genome size of sorghum compared with rice. Although gene and repetitive DNA distributions have been preserved since palaeopolyploidization approximately 70 million years ago, most duplicated gene sets lost one member before the sorghum-rice divergence. Concerted evolution makes one duplicated chromosomal segment appear to be only a few million years old. About 24% of genes are grass-specific and 7% are sorghum-specific. Recent gene and microRNA duplications may contribute to sorghum's drought tolerance.


Subject(s)
Evolution, Molecular , Genome, Plant/genetics , Poaceae/genetics , Sorghum/genetics , Arabidopsis/genetics , Chromosomes, Plant/genetics , Gene Duplication , Genes, Plant , Oryza/genetics , Populus/genetics , Recombination, Genetic/genetics , Sequence Alignment , Sequence Analysis, DNA , Sequence Deletion/genetics , Zea mays/genetics
SELECTION OF CITATIONS
SEARCH DETAIL