Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 47
Filter
Add more filters










Publication year range
1.
Plant Sci ; 341: 112018, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38325660

ABSTRACT

Sustainable agriculture requires locally adapted varieties that produce nutritious food with limited agricultural inputs. Genome engineering represents a viable approach to develop cultivars that fulfill these criteria. For example, the red Hassawi rice, a native landrace of Saudi Arabia, tolerates local drought and high-salinity conditions and produces grain with diverse health-promoting phytochemicals. However, Hassawi has a long growth cycle, high cultivation costs, low productivity, and susceptibility to lodging. Here, to improve these undesirable traits via genome editing, we established efficient regeneration and Agrobacterium-mediated transformation protocols for Hassawi. In addition, we generated the first high-quality reference genome and targeted the key flowering repressor gene, Hd4, thus shortening the plant's lifecycle and height. Using CRISPR/Cas9 multiplexing, we simultaneously disrupted negative regulators of flowering time (Hd2, Hd4, and Hd5), grain size (GS3), grain number (GN1a), and plant height (Sd1). The resulting homozygous mutant lines flowered extremely early (∼56 days) and had shorter stems (approximately 107 cm), longer grains (by 5.1%), and more grains per plant (by 50.2%), thereby enhancing overall productivity. Furthermore, the awns of grains were 86.4% shorter compared to unedited plants. Moreover, the modified rice grain displayed improved nutritional attributes. As a result, the modified Hassawi rice combines several desirable traits that can incentivize large-scale cultivation and reduce malnutrition.


Subject(s)
Oryza , Oryza/genetics , Gene Editing , Phenotype , Agriculture , CRISPR-Cas Systems
2.
BMC Biol ; 22(1): 13, 2024 Jan 25.
Article in English | MEDLINE | ID: mdl-38273258

ABSTRACT

BACKGROUND: Single-nucleotide polymorphisms (SNPs) are the most widely used form of molecular genetic variation studies. As reference genomes and resequencing data sets expand exponentially, tools must be in place to call SNPs at a similar pace. The genome analysis toolkit (GATK) is one of the most widely used SNP calling software tools publicly available, but unfortunately, high-performance computing versions of this tool have yet to become widely available and affordable. RESULTS: Here we report an open-source high-performance computing genome variant calling workflow (HPC-GVCW) for GATK that can run on multiple computing platforms from supercomputers to desktop machines. We benchmarked HPC-GVCW on multiple crop species for performance and accuracy with comparable results with previously published reports (using GATK alone). Finally, we used HPC-GVCW in production mode to call SNPs on a "subpopulation aware" 16-genome rice reference panel with ~ 3000 resequenced rice accessions. The entire process took ~ 16 weeks and resulted in the identification of an average of 27.3 M SNPs/genome and the discovery of ~ 2.3 million novel SNPs that were not present in the flagship reference genome for rice (i.e., IRGSP RefSeq). CONCLUSIONS: This study developed an open-source pipeline (HPC-GVCW) to run GATK on HPC platforms, which significantly improved the speed at which SNPs can be called. The workflow is widely applicable as demonstrated successfully for four major crop species with genomes ranging in size from 400 Mb to 2.4 Gb. Using HPC-GVCW in production mode to call SNPs on a 25 multi-crop-reference genome data set produced over 1.1 billion SNPs that were publicly released for functional and breeding studies. For rice, many novel SNPs were identified and were found to reside within genes and open chromatin regions that are predicted to have functional consequences. Combined, our results demonstrate the usefulness of combining a high-performance SNP calling architecture solution with a subpopulation-aware reference genome panel for rapid SNP discovery and public deployment.


Subject(s)
Genome, Plant , Polymorphism, Single Nucleotide , Workflow , Plant Breeding , Software , High-Throughput Nucleotide Sequencing/methods
3.
Nat Food ; 4(5): 366-371, 2023 05.
Article in English | MEDLINE | ID: mdl-37169820

ABSTRACT

Pigmented rice (Oryza sativa L.) is a rich source of nutrients, but pigmented lines typically have long life cycles and limited productivity. Here we generated genome assemblies of 5 pigmented rice varieties and evaluated the genetic variation among 51 pigmented rice varieties by resequencing an additional 46 varieties. Phylogenetic analyses divided the pigmented varieties into four varietal groups: Geng-japonica, Xian-indica, circum-Aus and circum-Basmati. Metabolomics and ionomics profiling revealed that black rice varieties are rich in aromatic secondary metabolites. We established a regeneration and transformation system and used CRISPR-Cas9 to knock out three flowering time repressors (Hd2, Hd4 and Hd5) in the black Indonesian rice Cempo Ireng, resulting in an early maturing variety with shorter stature. Our study thus provides a multi-omics resource for understanding and improving Asian pigmented rice.


Subject(s)
Genetic Variation , Oryza , Oryza/genetics , Phylogeny , Multiomics , Sequence Analysis, DNA
4.
Nat Commun ; 14(1): 1567, 2023 03 21.
Article in English | MEDLINE | ID: mdl-36944612

ABSTRACT

Understanding and exploiting genetic diversity is a key factor for the productive and stable production of rice. Here, we utilize 73 high-quality genomes that encompass the subpopulation structure of Asian rice (Oryza sativa), plus the genomes of two wild relatives (O. rufipogon and O. punctata), to build a pan-genome inversion index of 1769 non-redundant inversions that span an average of ~29% of the O. sativa cv. Nipponbare reference genome sequence. Using this index, we estimate an inversion rate of ~700 inversions per million years in Asian rice, which is 16 to 50 times higher than previously estimated for plants. Detailed analyses of these inversions show evidence of their effects on gene expression, recombination rate, and linkage disequilibrium. Our study uncovers the prevalence and scale of large inversions (≥100 bp) across the pan-genome of Asian rice and hints at their largely unexplored role in functional biology and crop performance.


Subject(s)
Oryza , Oryza/genetics , Sequence Analysis, DNA , Genome, Plant/genetics , Biological Evolution , Phylogeny
5.
G3 (Bethesda) ; 13(3)2023 03 09.
Article in English | MEDLINE | ID: mdl-36611193

ABSTRACT

High-quality genome assemblies are characterized by high-sequence contiguity, completeness, and a low error rate, thus providing the basis for a wide array of studies focusing on natural species ecology, conservation, evolution, and population genomics. To provide this valuable resource for conservation projects and comparative genomics studies on gyrfalcon (Falco rusticolus), we sequenced and assembled the genome of this species using third-generation sequencing strategies and optical maps. Here, we describe a highly contiguous and complete genome assembly comprising 20 scaffolds and 13 contigs with a total size of 1.193 Gbp, including 8,064 complete Benchmarking Universal Single-Copy Orthologs (BUSCOs) of the total 8,338 BUSCO groups present in the library aves_odb10. Of these BUSCO genes, 96.7% were complete, 96.1% were present as a single copy, and 0.6% were duplicated. Furthermore, 0.8% of BUSCO genes were fragmented and 2.5% (210) were missing. A de novo search for transposable elements (TEs) identified 5,716 TEs that masked 7.61% of the F. rusticolus genome assembly when combined with publicly available TE collections. Long interspersed nuclear elements, in particular, the element Chicken-repeat 1 (CR1), were the most abundant TEs in the F. rusticolus genome. A de novo first-pass gene annotation was performed using 293,349 PacBio Iso-Seq transcripts and 496,195 transcripts derived from the assembly of 42,429,525 Illumina PE RNA-seq reads. In all, 19,602 putative genes, of which 59.31% were functionally characterized and associated with Gene Ontology terms, were annotated. A comparison of the gyrfalcon genome assembly with the publicly available assemblies of the domestic chicken (Gallus gallus), zebra finch (Taeniopygia guttata), and hummingbird (Calypte anna) revealed several genome rearrangements. In particular, nine putative chromosome fusions were identified in the gyrfalcon genome assembly compared with those in the G. gallus genome assembly. This genome assembly, its annotation for TEs and genes, and the comparative analyses presented, complement and strength the base of high-quality genome assemblies and associated resources available for comparative studies focusing on the evolution, ecology, and conservation of Aves.


Subject(s)
Chromosomes , Genomics , Molecular Sequence Annotation , DNA Transposable Elements
6.
J Exp Bot ; 74(5): 1331-1342, 2023 03 13.
Article in English | MEDLINE | ID: mdl-36527431

ABSTRACT

The wild relatives of rice hold unexplored genetic diversity that can be employed to feed an estimated population of 10 billion by 2050. The Oryza Map Alignment Project (OMAP) initiated in 2003 has provided comprehensive genomic resources for comparative, evolutionary, and functional characterization of the wild relatives of rice, facilitating the cloning of >600 rice genes, including those for grain width (GW5) and submergence tolerance (SUB1A). Following in the footsteps of the original project, the goal of 'IOMAP: the Americas' is to investigate the present and historic genetic diversity of wild Oryza species endemic to the Americas through the sequencing of herbaria and in situ specimens. The generation of a large diversity panel describing past and current genetic status and potential erosion of genetic variation in the populations will provide useful knowledge for the conservation of the biodiversity in these species. The wild relatives of rice in the Americas present a wide range of resistance traits useful for crop improvement and neodomestication approaches. In the race against time for a sustainable food future, the neodomestication of the first cereal species recently accomplished in O. alta opens the door to the potential neodomestication of the other wild Oryza species in Americas.


Subject(s)
Oryza , Oryza/genetics , Phenotype , Genomics , Edible Grain/genetics
7.
Plant J ; 113(4): 734-748, 2023 02.
Article in English | MEDLINE | ID: mdl-36573648

ABSTRACT

Transposable elements (TEs) are an important source of genome variability, playing many roles in the evolution of eukaryotic species. Besides well-known phenomena, TEs may undergo the exaptation process and generate the so-called exapted transposable element genes (ETEs). Here we present a genome-wide survey of ETEs in the large genome of sunflower (Helianthus annuus L.), in which the massive amount of TEs, provides a significant source for exaptation. A library of sunflower TEs was used to build TE-specific Hidden Markov Model profiles, to search for all available sunflower gene products. In doing so, 20 016 putative ETEs were identified and further investigated for the characteristics that distinguish TEs from genes, leading to the validation of 3530 ETEs. The analysis of ETEs transcription patterns under different stress conditions showed a differential regulation triggered by treatments mimicking biotic and abiotic stress; furthermore, the distribution of functional domains of differentially regulated ETEs revealed a relevant presence of domains involved in many aspects of cellular functions. A comparative genomic investigation was performed including species representative of Asterids and appropriate outgroups: the bulk of ETEs that resulted were specific to the sunflower, while few ETEs presented orthologues in the genome of all analyzed species, making the hypothesis of a conserved function. This study highlights the crucial role played by exaptation, actively contributing to species evolution.


Subject(s)
DNA Transposable Elements , Helianthus , DNA Transposable Elements/genetics , Helianthus/genetics , Genome, Plant/genetics , Evolution, Molecular , Genomics
8.
Cell ; 185(1): 204-217.e14, 2022 01 06.
Article in English | MEDLINE | ID: mdl-34965378

ABSTRACT

Conifers dominate the world's forest ecosystems and are the most widely planted tree species. Their giant and complex genomes present great challenges for assembling a complete reference genome for evolutionary and genomic studies. We present a 25.4-Gb chromosome-level assembly of Chinese pine (Pinus tabuliformis) and revealed that its genome size is mostly attributable to huge intergenic regions and long introns with high transposable element (TE) content. Large genes with long introns exhibited higher expressions levels. Despite a lack of recent whole-genome duplication, 91.2% of genes were duplicated through dispersed duplication, and expanded gene families are mainly related to stress responses, which may underpin conifers' adaptation, particularly in cold and/or arid conditions. The reproductive regulation network is distinct compared with angiosperms. Slow removal of TEs with high-level methylation may have contributed to genomic expansion. This study provides insights into conifer evolution and resources for advancing research on conifer adaptation and development.


Subject(s)
Epigenome , Evolution, Molecular , Gene Expression Regulation, Plant , Genes, Plant , Pinus/genetics , Acclimatization/genetics , Chromosomes, Plant/genetics , Cycadopsida/genetics , DNA Transposable Elements/genetics , Forests , Gene Regulatory Networks , Genome Size , Genomics/methods , Introns , Magnoliopsida/genetics
9.
Genome Biol Evol ; 13(10)2021 10 01.
Article in English | MEDLINE | ID: mdl-34599322

ABSTRACT

Genome sizes of eukaryotic organisms vary substantially, with whole-genome duplications (WGD) and transposable element expansion acting as main drivers for rapid genome size increase. The two North American mudminnows, Umbra limi and Umbra pygmaea, feature genomes about twice the size of their sister lineage Esocidae (e.g., pikes and pickerels). However, it is unknown whether all Umbra species share this genome expansion and which causal mechanisms drive this expansion. Using flow cytometry, we find that the genome of the European mudminnow is expanded similarly to both North American species, ranging between 4.5 and 5.4 pg per diploid nucleus. Observed blocks of interstitially located telomeric repeats in U. limi suggest frequent Robertsonian rearrangements in its history. Comparative analyses of transcriptome and genome assemblies show that the genome expansion in Umbra is driven by the expansion of DNA transposon and unclassified repeat sequences without WGD. Furthermore, we find a substantial ongoing expansion of repeat sequences in the Alaska blackfish Dallia pectoralis, the closest relative to the family Umbridae, which might mark the beginning of a similar genome expansion. Our study suggests that the genome expansion in mudminnows, driven mainly by transposon expansion, but not WGD, occurred before the separation into the American and European lineage.


Subject(s)
Umbridae , Animals , DNA Transposable Elements/genetics , Genome Size , Umbridae/genetics
10.
Mol Plant ; 14(10): 1757-1767, 2021 10 04.
Article in English | MEDLINE | ID: mdl-34171480

ABSTRACT

Rice (Oryza sativa), a major staple throughout the world and a model system for plant genomics and breeding, was the first crop genome sequenced almost two decades ago. However, reference genomes for all higher organisms to date contain gaps and missing sequences. Here, we report the assembly and analysis of gap-free reference genome sequences for two elite O. sativa xian/indica rice varieties, Zhenshan 97 and Minghui 63, which are being used as a model system for studying heterosis and yield. Gap-free reference genomes provide the opportunity for a global view of the structure and function of centromeres. We show that all rice centromeric regions share conserved centromere-specific satellite motifs with different copy numbers and structures. In addition, the similarity of CentO repeats in the same chromosome is higher than across chromosomes, supporting a model of local expansion and homogenization. Both genomes have over 395 non-TE genes located in centromere regions, of which ∼41% are actively transcribed. Two large structural variants at the end of chromosome 11 affect the copy number of resistance genes between the two genomes. The availability of the two gap-free genomes lays a solid foundation for further understanding genome structure and function in plants and breeding climate-resilient varieties.


Subject(s)
Centromere , Chromosomes, Plant , Genome, Plant , Oryza/genetics , Molecular Sequence Annotation , Species Specificity , Whole Genome Sequencing
11.
Cell ; 184(5): 1156-1170.e14, 2021 03 04.
Article in English | MEDLINE | ID: mdl-33539781

ABSTRACT

Cultivated rice varieties are all diploid, and polyploidization of rice has long been desired because of its advantages in genome buffering, vigorousness, and environmental robustness. However, a workable route remains elusive. Here, we describe a practical strategy, namely de novo domestication of wild allotetraploid rice. By screening allotetraploid wild rice inventory, we identified one genotype of Oryza alta (CCDD), polyploid rice 1 (PPR1), and established two important resources for its de novo domestication: (1) an efficient tissue culture, transformation, and genome editing system and (2) a high-quality genome assembly discriminated into two subgenomes of 12 chromosomes apiece. With these resources, we show that six agronomically important traits could be rapidly improved by editing O. alta homologs of the genes controlling these traits in diploid rice. Our results demonstrate the possibility that de novo domesticated allotetraploid rice can be developed into a new staple cereal to strengthen world food security.


Subject(s)
Crops, Agricultural/genetics , Domestication , Oryza/genetics , CRISPR-Cas Systems , Food Security , Gene Editing , Genetic Variation , Genome, Plant , Oryza/classification , Polyploidy
12.
Front Plant Sci ; 11: 579980, 2020.
Article in English | MEDLINE | ID: mdl-33072154

ABSTRACT

As the world's population expands from 7.6 billion to 10 billion over the next 30 years, scientists and farmers across the globe must explore every angle necessary to provide a safe, stable and sustainable food supply for generations to come. Rice, and its wild relatives in the genus Oryza, will play a significant role in helping to solve this 10 billion people question due to its place as a staple food for billions. The genus Oryza is composed of 27 species that span 15 million years of evolutionary diversification and have been shown to contain a plethora of untapped adaptive traits, e.g., biotic and abiotic resistances, which can be used to improve cultivated rice. Such traits can be introduced into cultivated rice, in some cases by conventional crossing, and others via genetic transformation and gene editing methods. In cases where traits are too complex to easily transfer to cultivated rice [e.g., quantitative trait loci (QTL)], an alternative strategy is to domesticate the wild relative that already contains the desired adaptive traits - i.e., "neodomestication". To utilize the Oryza genus for crop improvement and neodomestication, we first need a set of genomic resources that can be used to efficiently identify, capture, and guide molecular crop improvement. Here, we introduce the concept of platinum standard reference genome sequences (PSRefSeq) - a new standard by which contiguous near-gap free reference genomes can now be produced. By having a set of PSRefSeqs for every Oryza species we set a new bar for how crop wild relatives can be integrated into crop improvement programs.

13.
Plant J ; 103(4): 1420-1432, 2020 08.
Article in English | MEDLINE | ID: mdl-32391598

ABSTRACT

Sweet cherry (Prunus avium L.) trees are both economically important fruit crops but also important components of natural forest ecosystems in Europe, Asia and Africa. Wild and domesticated trees currently coexist in the same geographic areas with important questions arising on their historical relationships. Little is known about the effects of the domestication process on the evolution of the sweet cherry genome. We assembled and annotated the genome of the cultivated variety "Big Star*" and assessed the genetic diversity among 97 sweet cherry accessions representing three different stages in the domestication and breeding process (wild trees, landraces and modern varieties). The genetic diversity analysis revealed significant genome-wide losses of variation among the three stages and supports a clear distinction between wild and domesticated trees, with only limited gene flow being detected between wild trees and domesticated landraces. We identified 11 domestication sweeps and five breeding sweeps covering, respectively, 11.0 and 2.4 Mb of the P. avium genome. A considerable fraction of the domestication sweeps overlaps with those detected in the related species, Prunus persica (peach), indicating that artificial selection during domestication may have acted independently on the same regions and genes in the two species. We detected 104 candidate genes in sweep regions involved in different processes, such as the determination of fruit texture, the regulation of flowering and fruit ripening and the resistance to pathogens. The signatures of selection identified will enable future evolutionary studies and provide a valuable resource for genetic improvement and conservation programs in sweet cherry.


Subject(s)
Domestication , Genome, Plant/genetics , Prunus avium/genetics , Chromosome Mapping , Chromosomes, Plant/genetics , DNA, Satellite/genetics , Genes, Plant/genetics , Genetic Variation/genetics , Genetics, Population
14.
Sci Data ; 7(1): 113, 2020 04 07.
Article in English | MEDLINE | ID: mdl-32265447

ABSTRACT

As the human population grows from 7.8 billion to 10 billion over the next 30 years, breeders must do everything possible to create crops that are highly productive and nutritious, while simultaneously having less of an environmental footprint. Rice will play a critical role in meeting this demand and thus, knowledge of the full repertoire of genetic diversity that exists in germplasm banks across the globe is required. To meet this demand, we describe the generation, validation and preliminary analyses of transposable element and long-range structural variation content of 12 near-gap-free reference genome sequences (RefSeqs) from representatives of 12 of 15 subpopulations of cultivated Asian rice. When combined with 4 existing RefSeqs, that represent the 3 remaining rice subpopulations and the largest admixed population, this collection of 16 Platinum Standard RefSeqs (PSRefSeq) can be used as a template to map resequencing data to detect virtually all standing natural variation that exists in the pan-genome of cultivated Asian rice.


Subject(s)
Genome, Plant , Oryza/genetics , Crops, Agricultural/genetics , Genetic Variation , Genomics
15.
Plant J ; 102(3): 600-614, 2020 05.
Article in English | MEDLINE | ID: mdl-31808196

ABSTRACT

Due to DNA heterozygosity and repeat content, assembly of non-model plant genomes is challenging. Herein, we report a high-quality genome reference of one of the oldest known domesticated species, fig (Ficus carica L.), using Pacific Biosciences single-molecule, real-time sequencing. The fig genome is ~333 Mbp in size, of which 80% has been anchored to 13 chromosomes. Genome-wide analysis of N6 -methyladenine and N4 -methylcytosine revealed high methylation levels in both genes and transposable elements, and a prevalence of methylated over non-methylated genes. Furthermore, the characterization of N6 -methyladenine sites led to the identification of ANHGA, a species-specific motif, which is prevalent for both genes and transposable elements. Finally, exploiting the contiguity of the 13 pseudomolecules, we identified 13 putative centromeric regions. The high-quality reference genome and the characterization of methylation profiles, provides an important resource for both fig breeding and for fundamental research into the relationship between epigenetic changes and phenotype, using fig as a model species.


Subject(s)
Epigenesis, Genetic/genetics , Ficus/genetics , Genome, Plant/genetics , Adenosine/analogs & derivatives , Adenosine/metabolism , Haplotypes , Phenotype
17.
Proc Natl Acad Sci U S A ; 115(46): E10970-E10978, 2018 11 13.
Article in English | MEDLINE | ID: mdl-30373829

ABSTRACT

The Populus genus is one of the major plant model systems, but genomic resources have thus far primarily been available for poplar species, and primarily Populus trichocarpa (Torr. & Gray), which was the first tree with a whole-genome assembly. To further advance evolutionary and functional genomic analyses in Populus, we produced genome assemblies and population genetics resources of two aspen species, Populus tremula L. and Populus tremuloides Michx. The two aspen species have distributions spanning the Northern Hemisphere, where they are keystone species supporting a wide variety of dependent communities and produce a diverse array of secondary metabolites. Our analyses show that the two aspens share a similar genome structure and a highly conserved gene content with P. trichocarpa but display substantially higher levels of heterozygosity. Based on population resequencing data, we observed widespread positive and negative selection acting on both coding and noncoding regions. Furthermore, patterns of genetic diversity and molecular evolution in aspen are influenced by a number of features, such as expression level, coexpression network connectivity, and regulatory variation. To maximize the community utility of these resources, we have integrated all presented data within the PopGenIE web resource (PopGenIE.org).


Subject(s)
Populus/genetics , Biological Evolution , DNA, Plant/genetics , Evolution, Molecular , Genetic Variation , Genetics, Population/methods , Genome, Plant , Genomics , Linkage Disequilibrium/genetics , Phylogeny , Selection, Genetic/genetics , Sequence Analysis, DNA/methods , Trees/genetics
18.
Nat Genet ; 50(2): 285-296, 2018 02.
Article in English | MEDLINE | ID: mdl-29358651

ABSTRACT

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young 'AA' subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 'Miracle Rice', which relieved famine and drove the Green Revolution in Asia 50 years ago.


Subject(s)
Crops, Agricultural/genetics , Evolution, Molecular , Genetic Variation , Oryza/classification , Oryza/genetics , Conserved Sequence , Domestication , Genetic Speciation , Genome, Plant , Phylogeny
19.
Genome Biol Evol ; 9(12): 3449-3462, 2017 12 01.
Article in English | MEDLINE | ID: mdl-29228262

ABSTRACT

The accumulation and removal of transposable elements (TEs) is a major driver of genome size evolution in eukaryotes. In plants, long terminal repeat (LTR) retrotransposons (LTR-RTs) represent the majority of TEs and form most of the nuclear DNA in large genomes. Unequal recombination (UR) between LTRs leads to removal of intervening sequence and formation of solo-LTRs. UR is a major mechanism of LTR-RT removal in many angiosperms, but our understanding of LTR-RT-associated recombination within the large, LTR-RT-rich genomes of conifers is quite limited. We employ a novel read-based methodology to estimate the relative rates of LTR-RT-associated UR within the genomes of four conifer and seven angiosperm species. We found the lowest rates of UR in the largest genomes studied, conifers and the angiosperm maize. Recombination may also resolve as gene conversion, which does not remove sequence, so we analyzed LTR-RT-associated gene conversion events (GCEs) in Norway spruce and six angiosperms. Opposite the trend for UR, we found the highest rates of GCEs in Norway spruce and maize. Unlike previous work in angiosperms, we found no evidence that rates of UR correlate with retroelement structural features in the conifers, suggesting that another process is suppressing UR in these species. Recent results from diverse eukaryotes indicate that heterochromatin affects the resolution of recombination, by favoring gene conversion over crossing-over, similar to our observation of opposed rates of UR and GCEs. Control of LTR-RT proliferation via formation of heterochromatin would be a likely step toward large genomes in eukaryotes carrying high LTR-RT content.


Subject(s)
Picea/genetics , Recombination, Genetic , Retroelements , Terminal Repeat Sequences , Zea mays/genetics , Computational Biology , Evolution, Molecular , Gene Conversion , Genome Size , Magnoliopsida/genetics
20.
Mol Genet Genomics ; 292(1): 157-171, 2017 Feb.
Article in English | MEDLINE | ID: mdl-27778102

ABSTRACT

Transposable elements (TEs) are the most abundant genetic material for almost all eukaryotic genomes. Their effects on the host genomes range from an extensive size variation to the regulation of gene expression, altering gene function and creating new genes. Because of TEs pivotal contribute to the host genome structure and regulation, their identification and characterization provide a wealth of useful data for gaining an in-depth understanding of host genome functioning. The giant reed (Arundo donax) is a perennial rhizomatous C3 grass, octadecaploid, with an estimated nuclear genome size of 2744 Mbp. It is a promising feedstock for second-generation biofuels and biomethane production. To identify and characterize the most repetitive TEs in the genomes of A. donax and its ancestral A. plinii species, we carried out low-coverage whole genome shotgun sequencing for both species. Using a de novo repeat identification approach, 33,041 and 28,237 non-redundant repetitive sequences were identified and characterized in A. donax and A. plinii genomes, representing 37.55 and 31.68% of each genome, respectively. Comparative phylogenetic analyses, including the major TE classes identified in A. donax and A. plinii, together with rice and maize TE paralogs, were carried out to understand the evolutionary relationship of the most abundant TE classes. Highly conserved copies of RIRE1-like Ty1-Copia elements were discovered in two Arundo spp. in which they represented nearly 3% of each genomic sequence. We identified and characterized the medium/highly repetitive TEs in two unexplored polyploid genomes, thus generating useful information for the study of the genomic structure, composition, and functioning of these two non-model species. We provided a valuable resource that could be exploited in any effort aimed at sequencing and assembling these two genomes.


Subject(s)
Biofuels , DNA Transposable Elements , Poaceae/classification , Poaceae/genetics , DNA, Plant/genetics , Evolution, Molecular , High-Throughput Nucleotide Sequencing , Sequence Analysis, DNA , Terminal Repeat Sequences
SELECTION OF CITATIONS
SEARCH DETAIL
...