Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 23
Filtrar
1.
Plant J ; 116(4): 1003-1017, 2023 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-37675609

RESUMEN

Populus species play a foundational role in diverse ecosystems and are important renewable feedstocks for bioenergy and bioproducts. Hybrid aspen Populus tremula × P. alba INRA 717-1B4 is a widely used transformation model in tree functional genomics and biotechnology research. As an outcrossing interspecific hybrid, its genome is riddled with sequence polymorphisms which present a challenge for sequence-sensitive analyses. Here we report a telomere-to-telomere genome for this hybrid aspen with two chromosome-scale, haplotype-resolved assemblies. We performed a comprehensive analysis of the repetitive landscape and identified both tandem repeat array-based and array-less centromeres. Unexpectedly, the most abundant satellite repeats in both haplotypes lie outside of the centromeres, consist of a 147 bp monomer PtaM147, frequently span >1 megabases, and form heterochromatic knobs. PtaM147 repeats are detected exclusively in aspens (section Populus) but PtaM147-like sequences occur in LTR-retrotransposons of closely related species, suggesting their origin from the retrotransposons. The genomic resource generated for this transformation model genotype has greatly improved the design and analysis of genome editing experiments that are highly sensitive to sequence polymorphisms. The work should motivate future hypothesis-driven research to probe into the function of the abundant and aspen-specific PtaM147 satellite DNA.


Asunto(s)
ADN Satélite , Populus , ADN Satélite/genética , Haplotipos/genética , Populus/genética , Ecosistema , Retroelementos , Centrómero/genética
2.
BMC Genomics ; 20(1): 905, 2019 Nov 27.
Artículo en Inglés | MEDLINE | ID: mdl-31775618

RESUMEN

BACKGROUND: The availability of thousands of complete rice genome sequences from diverse varieties and accessions has laid the foundation for in-depth exploration of the rice genome. One drawback to these collections is that most of these rice varieties have long life cycles, and/or low transformation efficiencies, which limits their usefulness as model organisms for functional genomics studies. In contrast, the rice variety Kitaake has a rapid life cycle (9 weeks seed to seed) and is easy to transform and propagate. For these reasons, Kitaake has emerged as a model for studies of diverse monocotyledonous species. RESULTS: Here, we report the de novo genome sequencing and analysis of Oryza sativa ssp. japonica variety KitaakeX, a Kitaake plant carrying the rice XA21 immune receptor. Our KitaakeX sequence assembly contains 377.6 Mb, consisting of 33 scaffolds (476 contigs) with a contig N50 of 1.4 Mb. Complementing the assembly are detailed gene annotations of 35,594 protein coding genes. We identified 331,335 genomic variations between KitaakeX and Nipponbare (ssp. japonica), and 2,785,991 variations between KitaakeX and Zhenshan97 (ssp. indica). We also compared Kitaake resequencing reads to the KitaakeX assembly and identified 219 small variations. The high-quality genome of the model rice plant KitaakeX will accelerate rice functional genomics. CONCLUSIONS: The high quality, de novo assembly of the KitaakeX genome will serve as a useful reference genome for rice and will accelerate functional genomics studies of rice and other species.


Asunto(s)
Genoma de Planta , Genómica , Oryza/genética , Secuenciación Completa del Genoma , Biología Computacional/métodos , Variación Genética , Genómica/métodos , Anotación de Secuencia Molecular , Oryza/clasificación , Fenotipo
3.
Proc Natl Acad Sci U S A ; 113(35): E5163-71, 2016 08 30.
Artículo en Inglés | MEDLINE | ID: mdl-27535938

RESUMEN

Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement.


Asunto(s)
Cromosomas de las Plantas/genética , Variación Genética , Genoma de Planta/genética , Oryza/genética , Mapeo Cromosómico/métodos , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genes de Plantas/genética , Mutación INDEL , Oryza/clasificación , Polimorfismo de Nucleótido Simple , Especificidad de la Especie
4.
ACS Chem Biol ; 19(1): 185-192, 2024 Jan 19.
Artículo en Inglés | MEDLINE | ID: mdl-38081799

RESUMEN

Red algae or seaweeds produce highly distinctive halogenated terpenoid compounds, including the pentabromochlorinated monoterpene halomon that was once heralded as a promising anticancer agent. The first dedicated step in the biosynthesis of these natural product molecules is expected to be catalyzed by terpene synthase (TS) enzymes. Recent work has demonstrated an emerging class of type I TSs in red algal terpene biosynthesis. However, only one such enzyme from a notoriously haloterpenoid-producing red alga (Laurencia pacifica) has been functionally characterized and the product structure is not related to halogenated terpenoids. Herein, we report 10 new type I TSs from the red algae Portieria hornemannii, Plocamium pacificum, L. pacifica, and Laurencia subopposita that produce a diversity of halogenated mono- and sesquiterpenes. We used a combination of genome sequencing, terpenoid metabolomics, in vitro biochemistry, and bioinformatics to establish red algal TSs in all four species, including those associated with the selective production of key halogenated terpene precursors myrcene, trans-ß-ocimene, and germacrene D-4-ol. These results expand on a small but growing number of characterized red algal TSs and offer insight into the biosynthesis of iconic halogenated algal compounds that are not without precedence elsewhere in biology.


Asunto(s)
Transferasas Alquil y Aril , Rhodophyta , Rhodophyta/química , Terpenos/química , Monoterpenos/química
5.
Nat Plants ; 9(2): 238-254, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36747050

RESUMEN

Peatlands are crucial sinks for atmospheric carbon but are critically threatened due to warming climates. Sphagnum (peat moss) species are keystone members of peatland communities where they actively engineer hyperacidic conditions, which improves their competitive advantage and accelerates ecosystem-level carbon sequestration. To dissect the molecular and physiological sources of this unique biology, we generated chromosome-scale genomes of two Sphagnum species: S. divinum and S. angustifolium. Sphagnum genomes show no gene colinearity with any other reference genome to date, demonstrating that Sphagnum represents an unsampled lineage of land plant evolution. The genomes also revealed an average recombination rate an order of magnitude higher than vascular land plants and short putative U/V sex chromosomes. These newly described sex chromosomes interact with autosomal loci that significantly impact growth across diverse pH conditions. This discovery demonstrates that the ability of Sphagnum to sequester carbon in acidic peat bogs is mediated by interactions between sex, autosomes and environment.


Asunto(s)
Ecosistema , Sphagnopsida , Secuestro de Carbono , Sphagnopsida/fisiología , Clima , Cromosomas Sexuales
6.
Plant J ; 63(3): 430-42, 2010 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-20487382

RESUMEN

Despite knowledge that polyploidy is widespread and a major evolutionary force in flowering plant diversification, detailed comparative molecular studies on polyploidy have been confined to only a few species and families. The genus Oryza is composed of 23 species that are classified into ten distinct 'genome types' (six diploid and four polyploid), and is emerging as a powerful new model system to study polyploidy. Here we report the identification, sequence and comprehensive comparative annotation of eight homoeologous genomes from a single orthologous region (Adh1-Adh2) from four allopolyploid species representing each of the known Oryza genome types (BC, CD, HJ and KL). Detailed comparative phylogenomic analyses of these regions within and across species and ploidy levels provided several insights into the spatio-temporal dynamics of genome organization and evolution of this region in 'natural' polyploids of Oryza. The major findings of this study are that: (i) homoeologous genomic regions within the same nucleus experience both independent and parallel evolution, (ii) differential lineage-specific selection pressures do not occur between polyploids and their diploid progenitors, (iii) there have been no dramatic structural changes relative to the diploid ancestors, (iv) a variation in the molecular evolutionary rate exists between the two genomes in the BC complex species even though the BC and CD polyploid species appear to have arisen <2 million years ago, and (v) there are no clear distinctions in the patterns of genome evolution in the diploid versus polyploid species.


Asunto(s)
Evolución Molecular , Genoma de Planta , Oryza/genética , Tetraploidía , Cromosomas Artificiales Bacterianos , Genes de Plantas , Datos de Secuencia Molecular , Filogenia , Retroelementos
7.
Plant Genome ; 14(3): e20114, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34275202

RESUMEN

The stiff-stalk heterotic group in Maize (Zea mays L.) is an important source of inbreds used in U.S. commercial hybrid production. Founder inbreds B14, B37, B73, and, to a lesser extent, B84, are found in the pedigrees of a majority of commercial seed parent inbred lines. We created high-quality genome assemblies of B84 and four expired Plant Variety Protection (ex-PVP) lines LH145 representing B14, NKH8431 of mixed descent, PHB47 representing B37, and PHJ40, which is a Pioneer Hi-Bred International (PHI) early stiff-stalk type. Sequence was generated using long-read sequencing achieving highly contiguous assemblies of 2.13-2.18 Gbp with N50 scaffold lengths >200 Mbp. Inbred-specific gene annotations were generated using a core five-tissue gene expression atlas, whereas transposable element (TE) annotation was conducted using de novo and homology-directed methodologies. Compared with the reference inbred B73, synteny analyses revealed extensive collinearity across the five stiff-stalk genomes, although unique components of the maize pangenome were detected. Comparison of this set of stiff-stalk inbreds with the original Iowa Stiff Stalk Synthetic breeding population revealed that these inbreds represent only a proportion of variation in the original stiff-stalk pool and there are highly conserved haplotypes in released public and ex-Plant Variety Protection inbreds. Despite the reduction in variation from the original stiff-stalk population, substantial genetic and genomic variation was identified supporting the potential for continued breeding success in this pool. The assemblies described here represent stiff-stalk inbreds that have historical and commercial relevance and provide further insight into the emerging maize pangenome.


Asunto(s)
Fitomejoramiento , Zea mays , Genómica , Haplotipos , Vigor Híbrido , Zea mays/genética
8.
Mol Plant ; 14(10): 1757-1767, 2021 10 04.
Artículo en Inglés | MEDLINE | ID: mdl-34171480

RESUMEN

Rice (Oryza sativa), a major staple throughout the world and a model system for plant genomics and breeding, was the first crop genome sequenced almost two decades ago. However, reference genomes for all higher organisms to date contain gaps and missing sequences. Here, we report the assembly and analysis of gap-free reference genome sequences for two elite O. sativa xian/indica rice varieties, Zhenshan 97 and Minghui 63, which are being used as a model system for studying heterosis and yield. Gap-free reference genomes provide the opportunity for a global view of the structure and function of centromeres. We show that all rice centromeric regions share conserved centromere-specific satellite motifs with different copy numbers and structures. In addition, the similarity of CentO repeats in the same chromosome is higher than across chromosomes, supporting a model of local expansion and homogenization. Both genomes have over 395 non-TE genes located in centromere regions, of which ∼41% are actively transcribed. Two large structural variants at the end of chromosome 11 affect the copy number of resistance genes between the two genomes. The availability of the two gap-free genomes lays a solid foundation for further understanding genome structure and function in plants and breeding climate-resilient varieties.


Asunto(s)
Centrómero , Cromosomas de las Plantas , Genoma de Planta , Oryza/genética , Anotación de Secuencia Molecular , Especificidad de la Especie , Secuenciación Completa del Genoma
9.
Genome Biol ; 21(1): 259, 2020 10 06.
Artículo en Inglés | MEDLINE | ID: mdl-33023654

RESUMEN

BACKGROUND: Plants can transmit somatic mutations and epimutations to offspring, which in turn can affect fitness. Knowledge of the rate at which these variations arise is necessary to understand how plant development contributes to local adaption in an ecoevolutionary context, particularly in long-lived perennials. RESULTS: Here, we generate a new high-quality reference genome from the oldest branch of a wild Populus trichocarpa tree with two dominant stems which have been evolving independently for 330 years. By sampling multiple, age-estimated branches of this tree, we use a multi-omics approach to quantify age-related somatic changes at the genetic, epigenetic, and transcriptional level. We show that the per-year somatic mutation and epimutation rates are lower than in annuals and that transcriptional variation is mainly independent of age divergence and cytosine methylation. Furthermore, a detailed analysis of the somatic epimutation spectrum indicates that transgenerationally heritable epimutations originate mainly from DNA methylation maintenance errors during mitotic rather than during meiotic cell divisions. CONCLUSION: Taken together, our study provides unprecedented insights into the origin of nucleotide and functional variation in a long-lived perennial plant.


Asunto(s)
Genoma de Planta , Tasa de Mutación , Populus/genética , Factores de Edad , Metilación de ADN , Epigénesis Genética , Expresión Génica , Anotación de Secuencia Molecular
10.
Nat Commun ; 10(1): 4680, 2019 10 15.
Artículo en Inglés | MEDLINE | ID: mdl-31615981

RESUMEN

Date palms (Phoenix dactylifera) are an important fruit crop of arid regions of the Middle East and North Africa. Despite its importance, few genomic resources exist for date palms, hampering evolutionary genomic studies of this perennial species. Here we report an improved long-read genome assembly for P. dactylifera that is 772.3 Mb in length, with contig N50 of 897.2 Kb, and use this to perform genome-wide association studies (GWAS) of the sex determining region and 21 fruit traits. We find a fruit color GWAS at the R2R3-MYB transcription factor VIRESCENS gene and identify functional alleles that include a retrotransposon insertion and start codon mutation. We also find a GWAS peak for sugar composition spanning deletion polymorphisms in multiple linked invertase genes. MYB transcription factors and invertase are implicated in fruit color and sugar composition in other crops, demonstrating the importance of parallel evolution in the evolutionary diversification of domesticated species.


Asunto(s)
Frutas/química , Phoeniceae/genética , Pigmentación/genética , Procesos de Determinación del Sexo/genética , Alelos , Mapeo Cromosómico , Codón Iniciador , ADN de Plantas/genética , Fructosa , Frutas/genética , Genoma de Planta/genética , Estudio de Asociación del Genoma Completo , Glucosa , Mutación , Fenotipo , Polimorfismo Genético , Retroelementos , Análisis de Secuencia de ADN , Almidón , Sacarosa , beta-Fructofuranosidasa/genética
11.
BMC Genomics ; 9: 621, 2008 Dec 19.
Artículo en Inglés | MEDLINE | ID: mdl-19099592

RESUMEN

BACKGROUND: Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends. RESULTS: A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the SalI MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to approximately 11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary. CONCLUSION: MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.


Asunto(s)
Mapeo Cromosómico/métodos , Metilación de ADN , Genoma de Planta , Zea mays/genética , Cromosomas Artificiales Bacterianos , ADN de Plantas/genética , Epigénesis Genética , Biblioteca de Genes , Genómica/métodos , Alineación de Secuencia , Análisis de Secuencia de ADN/métodos
13.
Nat Genet ; 50(2): 285-296, 2018 02.
Artículo en Inglés | MEDLINE | ID: mdl-29358651

RESUMEN

The genus Oryza is a model system for the study of molecular evolution over time scales ranging from a few thousand to 15 million years. Using 13 reference genomes spanning the Oryza species tree, we show that despite few large-scale chromosomal rearrangements rapid species diversification is mirrored by lineage-specific emergence and turnover of many novel elements, including transposons, and potential new coding and noncoding genes. Our study resolves controversial areas of the Oryza phylogeny, showing a complex history of introgression among different chromosomes in the young 'AA' subclade containing the two domesticated species. This study highlights the prevalence of functionally coupled disease resistance genes and identifies many new haplotypes of potential use for future crop protection. Finally, this study marks a milestone in modern rice research with the release of a complete long-read assembly of IR 8 'Miracle Rice', which relieved famine and drove the Green Revolution in Asia 50 years ago.


Asunto(s)
Productos Agrícolas/genética , Evolución Molecular , Variación Genética , Oryza/clasificación , Oryza/genética , Secuencia Conservada , Domesticación , Especiación Genética , Genoma de Planta , Filogenia
14.
BMC Evol Biol ; 7: 152, 2007 Aug 29.
Artículo en Inglés | MEDLINE | ID: mdl-17727727

RESUMEN

BACKGROUND: The genus Oryza is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop - rice (Oryza sativa [AA]). Genome size variation in the Oryza is more than 3-fold and ranges from 357 Mbp in Oryza glaberrima [AA] to 1283 Mbp in the polyploid Oryza ridleyi [HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative Oryza species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation. RESULTS: We identified the elements primarily responsible for the most strikingly genome size variation in Oryza. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the Oryza and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species Oryza coarctata [HHKK] whose placement in the Oryza genus is controversial. CONCLUSION: Long Terminal Repeat retrotransposons are the major component of the Oryza genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the Oryza genus. Two families of Ty3-gypsy elements (RIRE2 and Atlantys) account for a significant portion of the genome size variations present in the Oryza genus.


Asunto(s)
Elementos Transponibles de ADN , Variación Genética , Genoma de Planta , Oryza/genética , Evolución Molecular , Filogenia , Secuencias Repetidas Terminales
15.
Nat Ecol Evol ; 1(10): 1585, 2017 10.
Artículo en Inglés | MEDLINE | ID: mdl-29185503

RESUMEN

In Fig. 5 of the version of this Article originally published, the final number on the x axes of each panel was incorrectly written as 1.5; it should have read 7.5. This has now been corrected in all versions of the Article.

16.
Nat Ecol Evol ; 1(5): 119, 2017 Apr 03.
Artículo en Inglés | MEDLINE | ID: mdl-28812690

RESUMEN

Fixed chromosomal inversions can reduce gene flow and promote speciation in two ways: by suppressing recombination and by carrying locally favoured alleles at multiple loci. However, it is unknown whether favoured mutations slowly accumulate on older inversions or if young inversions spread because they capture pre-existing adaptive quantitative trait loci (QTLs). By genetic mapping, chromosome painting and genome sequencing, we have identified a major inversion controlling ecologically important traits in Boechera stricta. The inversion arose since the last glaciation and subsequently reached local high frequency in a hybrid speciation zone. Furthermore, the inversion shows signs of positive directional selection. To test whether the inversion could have captured existing, linked QTLs, we crossed standard, collinear haplotypes from the hybrid zone and found multiple linked phenology QTLs within the inversion region. These findings provide the first direct evidence that linked, locally adapted QTLs may be captured by young inversions during incipient speciation.

17.
Genome Biol ; 17(1): 92, 2016 05 06.
Artículo en Inglés | MEDLINE | ID: mdl-27154274

RESUMEN

BACKGROUND: Mutator-like transposable elements, a class of DNA transposons, exist pervasively in both prokaryotic and eukaryotic genomes, with more than 10,000 copies identified in the rice genome. These elements can capture ectopic genomic sequences that lead to the formation of new gene structures. Here, based on whole-genome comparative analyses, we comprehensively investigated processes and mechanisms of the evolution of putative genes derived from Mutator-like transposable elements in ten Oryza species and the outgroup Leersia perieri, bridging ~20 million years of evolutionary history. RESULTS: Our analysis identified thousands of putative genes in each of the Oryza species, a large proportion of which have evidence of expression and contain chimeric structures. Consistent with previous reports, we observe that the putative Mutator-like transposable element-derived genes are generally GC-rich and mainly derive from GC-rich parental sequences. Furthermore, we determine that Mutator-like transposable elements capture parental sequences preferentially from genomic regions with low methylation levels and high recombination rates. We explicitly show that methylation levels in the internal and terminated inverted repeat regions of these elements, which might be directed by the 24-nucleotide small RNA-mediated pathway, are different and change dynamically over evolutionary time. Lastly, we demonstrate that putative genes derived from Mutator-like transposable elements tend to be expressed in mature pollen, which have undergone de-methylation programming, thereby providing a permissive expression environment for newly formed/transposable element-derived genes. CONCLUSIONS: Our results suggest that DNA methylation may be a primary mechanism to facilitate the origination, survival, and regulation of genes derived from Mutator-like transposable elements, thus contributing to the evolution of gene innovation and novelty in plant genomes.


Asunto(s)
Metilación de ADN/genética , Elementos Transponibles de ADN/genética , Evolución Molecular , Oryza/genética , Regulación de la Expresión Génica de las Plantas , Genoma de Planta , Genómica , ARN Pequeño no Traducido/genética
18.
Sci Data ; 3: 160076, 2016 Sep 13.
Artículo en Inglés | MEDLINE | ID: mdl-27622467

RESUMEN

Over the past 30 years, we have performed many fundamental studies on two Oryza sativa subsp. indica varieties, Zhenshan 97 (ZS97) and Minghui 63 (MH63). To improve the resolution of many of these investigations, we generated two reference-quality reference genome assemblies using the most advanced sequencing technologies. Using PacBio SMRT technology, we produced over 108 (ZS97) and 174 (MH63) Gb of raw sequence data from 166 (ZS97) and 209 (MH63) pools of BAC clones, and generated ~97 (ZS97) and ~74 (MH63) Gb of paired-end whole-genome shotgun (WGS) sequence data with Illumina sequencing technology. With these data, we successfully assembled two platinum standard reference genomes that have been publicly released. Here we provide the full sets of raw data used to generate these two reference genome assemblies. These data sets can be used to test new programs for better genome assembly and annotation, aid in the discovery of new insights into genome structure, function, and evolution, and help to provide essential support to biological research in general.


Asunto(s)
Genoma , Oryza/genética
19.
BMC Res Notes ; 5: 185, 2012 Apr 23.
Artículo en Inglés | MEDLINE | ID: mdl-22524198

RESUMEN

BACKGROUND: Sugarcane breeding has significantly progressed in the last 30 years, but achieving additional yield gains has been difficult because of the constraints imposed by the complex ploidy of this crop. Sugarcane cultivars are interspecific hybrids between Saccharum officinarum and Saccharum spontaneum. S. officinarum is an octoploid with 2n = 80 chromosomes while S. spontaneum has 2n = 40 to 128 chromosomes and ploidy varying from 5 to 16. The hybrid genome is composed of 70-80% S. officinaram and 5-20% S. spontaneum chromosomes and a small proportion of recombinants. Sequencing the genome of this complex crop may help identify useful genes, either per se or through comparative genomics using closely related grasses. The construction and sequencing of a bacterial artificial chromosome (BAC) library of an elite commercial variety of sugarcane could help assembly the sugarcane genome. RESULTS: A BAC library designated SS_SBa was constructed with DNA isolated from the commercial sugarcane variety SP80-3280. The library contains 36,864 clones with an average insert size of 125 Kb, 88% of which has inserts larger than 90 Kb. Based on the estimated genome size of 760-930 Mb, the library exhibits 5-6 times coverage the monoploid sugarcane genome. Bidirectional BAC end sequencing (BESs) from a random sample of 192 BAC clones sampled genes and repetitive elements of the sugarcane genome. Forty-five per cent of the total BES nucleotides represents repetitive elements, 83% of which belonging to LTR retrotransposons. Alignment of BESs corresponding to 42 BACs to the genome sequence of the 10 sorghum chromosomes revealed regions of microsynteny, with expansions and contractions of sorghum genome regions relative to the sugarcane BAC clones. In general, the sampled sorghum genome regions presented an average 29% expansion in relation to the sugarcane syntenic BACs. CONCLUSION: The SS_SBa BAC library represents a new resource for sugarcane genome sequencing. An analysis of insert size, genome coverage and orthologous alignment with the sorghum genome revealed that the library presents whole genome coverage. The comparison of syntenic regions of the sorghum genome to 42 SS_SBa BES pairs revealed that the sorghum genome is expanded in relation to the sugarcane genome.


Asunto(s)
Cromosomas Artificiales Bacterianos/genética , Biblioteca de Genes , Genoma de Planta/genética , Saccharum/genética , Sorghum/genética , Sintenía/genética , Cromosomas de las Plantas/genética , Mutagénesis Insercional/genética , Oryza/genética , Secuencias Repetitivas de Ácidos Nucleicos/genética , Análisis de Secuencia de ADN , Zea mays/genética
20.
Plant Cell ; 20(12): 3191-209, 2008 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-19098269

RESUMEN

Oryza (23 species; 10 genome types) contains the world's most important food crop - rice. Although the rice genome serves as an essential tool for biological research, little is known about the evolution of the other Oryza genome types. They contain a historical record of genomic changes that led to diversification of this genus around the world as well as an untapped reservoir of agriculturally important traits. To investigate the evolution of the collective Oryza genome, we sequenced and compared nine orthologous genomic regions encompassing the Adh1-Adh2 genes (from six diploid genome types) with the rice reference sequence. Our analysis revealed the architectural complexities and dynamic evolution of this region that have occurred over the past approximately 15 million years. Of the 46 intact genes and four pseudogenes in the japonica genome, 38 (76%) fell into eight multigene families. Analysis of the evolutionary history of each family revealed independent and lineage-specific gain and loss of gene family members as frequent causes of synteny disruption. Transposable elements were shown to mediate massive replacement of intergenic space (>95%), gene disruption, and gene/gene fragment movement. Three cases of long-range structural variation (inversions/deletions) spanning several hundred kilobases were identified that contributed significantly to genome diversification.


Asunto(s)
Evolución Molecular , Genoma de Planta/genética , Genómica/métodos , Oryza/genética , Datos de Secuencia Molecular , Oryza/clasificación , Filogenia , Proteínas de Plantas/genética , Proteínas de Plantas/fisiología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA