Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 200
Filtrar
Más filtros

Intervalo de año de publicación
1.
Cell ; 184(5): 1156-1170.e14, 2021 03 04.
Artículo en Inglés | MEDLINE | ID: mdl-33539781

RESUMEN

Cultivated rice varieties are all diploid, and polyploidization of rice has long been desired because of its advantages in genome buffering, vigorousness, and environmental robustness. However, a workable route remains elusive. Here, we describe a practical strategy, namely de novo domestication of wild allotetraploid rice. By screening allotetraploid wild rice inventory, we identified one genotype of Oryza alta (CCDD), polyploid rice 1 (PPR1), and established two important resources for its de novo domestication: (1) an efficient tissue culture, transformation, and genome editing system and (2) a high-quality genome assembly discriminated into two subgenomes of 12 chromosomes apiece. With these resources, we show that six agronomically important traits could be rapidly improved by editing O. alta homologs of the genes controlling these traits in diploid rice. Our results demonstrate the possibility that de novo domesticated allotetraploid rice can be developed into a new staple cereal to strengthen world food security.


Asunto(s)
Productos Agrícolas/genética , Domesticación , Oryza/genética , Sistemas CRISPR-Cas , Seguridad Alimentaria , Edición Génica , Variación Genética , Genoma de Planta , Oryza/clasificación , Poliploidía
2.
BMC Biol ; 22(1): 13, 2024 Jan 25.
Artículo en Inglés | MEDLINE | ID: mdl-38273258

RESUMEN

BACKGROUND: Single-nucleotide polymorphisms (SNPs) are the most widely used form of molecular genetic variation studies. As reference genomes and resequencing data sets expand exponentially, tools must be in place to call SNPs at a similar pace. The genome analysis toolkit (GATK) is one of the most widely used SNP calling software tools publicly available, but unfortunately, high-performance computing versions of this tool have yet to become widely available and affordable. RESULTS: Here we report an open-source high-performance computing genome variant calling workflow (HPC-GVCW) for GATK that can run on multiple computing platforms from supercomputers to desktop machines. We benchmarked HPC-GVCW on multiple crop species for performance and accuracy with comparable results with previously published reports (using GATK alone). Finally, we used HPC-GVCW in production mode to call SNPs on a "subpopulation aware" 16-genome rice reference panel with ~ 3000 resequenced rice accessions. The entire process took ~ 16 weeks and resulted in the identification of an average of 27.3 M SNPs/genome and the discovery of ~ 2.3 million novel SNPs that were not present in the flagship reference genome for rice (i.e., IRGSP RefSeq). CONCLUSIONS: This study developed an open-source pipeline (HPC-GVCW) to run GATK on HPC platforms, which significantly improved the speed at which SNPs can be called. The workflow is widely applicable as demonstrated successfully for four major crop species with genomes ranging in size from 400 Mb to 2.4 Gb. Using HPC-GVCW in production mode to call SNPs on a 25 multi-crop-reference genome data set produced over 1.1 billion SNPs that were publicly released for functional and breeding studies. For rice, many novel SNPs were identified and were found to reside within genes and open chromatin regions that are predicted to have functional consequences. Combined, our results demonstrate the usefulness of combining a high-performance SNP calling architecture solution with a subpopulation-aware reference genome panel for rapid SNP discovery and public deployment.


Asunto(s)
Genoma de Planta , Polimorfismo de Nucleótido Simple , Flujo de Trabajo , Fitomejoramiento , Programas Informáticos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
3.
Plant Physiol ; 193(4): 2381-2397, 2023 Nov 22.
Artículo en Inglés | MEDLINE | ID: mdl-37665979

RESUMEN

Developing drought-resistant rice (Oryza sativa, L.) is essential for improving field productivity, especially in rain-fed areas affected by climate change. Wild relatives of rice are potential sources for drought-resistant traits. Therefore, we compared root growth and drought response among 22 wild Oryza species, from which Oryza glumaepatula was selected as a promising source for further exploration. A geographically diverse panel of 69 O. glumaepatula accessions was then screened for drought stress-related traits, and 6 of these accessions showed lower shoot dry weight (SDW) reduction, greater percentage of deep roots, and lower stomatal density (STO) under drought than the drought tolerant O. sativa variety, Sahbhagi dhan. Based on whole-genome resequencing of all 69 O. glumaepatula accessions and variant calling to a high-quality O. glumaepatula reference genome, we detected multiple genomic loci colocating for SDW, root dry weight at 30 to 45 cm depth, and STO in consecutive drought trials. Geo-referencing indicated that the potential drought donors originated in flood-prone locations, corroborating previous hypotheses about the coexistence of flood and drought tolerance within individual Oryza genomes. These findings present potential donor accessions, traits, and genomic loci from an AA genome wild relative of rice that, together with the recently developed reference genome, may be useful for further introgression of drought tolerance into the O. sativa backgrounds.


Asunto(s)
Oryza , Oryza/genética , Resistencia a la Sequía , Fenotipo , Genoma de Planta/genética , Sequías
4.
Nat Rev Genet ; 19(8): 505-517, 2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-29872215

RESUMEN

Rice is a staple crop for half the world's population, which is expected to grow by 3 billion over the next 30 years. It is also a key model for studying the genomics of agroecosystems. This dual role places rice at the centre of an enormous challenge facing agriculture: how to leverage genomics to produce enough food to feed an expanding global population. Scientists worldwide are investigating the genetic variation among domesticated rice species and their wild relatives with the aim of identifying loci that can be exploited to breed a new generation of sustainable crops known as Green Super Rice.


Asunto(s)
Productos Agrícolas/genética , Genoma de Planta , Oryza/genética , Plantas Modificadas Genéticamente/genética , Productos Agrícolas/crecimiento & desarrollo , Variación Genética , Oryza/crecimiento & desarrollo , Plantas Modificadas Genéticamente/crecimiento & desarrollo
5.
J Exp Bot ; 74(5): 1331-1342, 2023 03 13.
Artículo en Inglés | MEDLINE | ID: mdl-36527431

RESUMEN

The wild relatives of rice hold unexplored genetic diversity that can be employed to feed an estimated population of 10 billion by 2050. The Oryza Map Alignment Project (OMAP) initiated in 2003 has provided comprehensive genomic resources for comparative, evolutionary, and functional characterization of the wild relatives of rice, facilitating the cloning of >600 rice genes, including those for grain width (GW5) and submergence tolerance (SUB1A). Following in the footsteps of the original project, the goal of 'IOMAP: the Americas' is to investigate the present and historic genetic diversity of wild Oryza species endemic to the Americas through the sequencing of herbaria and in situ specimens. The generation of a large diversity panel describing past and current genetic status and potential erosion of genetic variation in the populations will provide useful knowledge for the conservation of the biodiversity in these species. The wild relatives of rice in the Americas present a wide range of resistance traits useful for crop improvement and neodomestication approaches. In the race against time for a sustainable food future, the neodomestication of the first cereal species recently accomplished in O. alta opens the door to the potential neodomestication of the other wild Oryza species in Americas.


Asunto(s)
Oryza , Oryza/genética , Fenotipo , Genómica , Grano Comestible/genética
6.
Genome Res ; 29(5): 870-880, 2019 05.
Artículo en Inglés | MEDLINE | ID: mdl-30992303

RESUMEN

Investigation of large structural variants (SVs) is a challenging yet important task in understanding trait differences in highly repetitive genomes. Combining different bioinformatic approaches for SV detection, we analyzed whole-genome sequencing data from 3000 rice genomes and identified 63 million individual SV calls that grouped into 1.5 million allelic variants. We found enrichment of long SVs in promoters and an excess of shorter variants in 5' UTRs. Across the rice genomes, we identified regions of high SV frequency enriched in stress response genes. We demonstrated how SVs may help in finding causative variants in genome-wide association analysis. These new insights into rice genome biology are valuable for understanding the effects SVs have on gene function, with the prospect of identifying novel agronomically important alleles that can be utilized to improve cultivated rice.


Asunto(s)
Variación Genética , Genoma de Planta , Variación Estructural del Genoma , Genómica/métodos , Oryza/genética , Alelos , Mapeo Cromosómico , Elementos Transponibles de ADN , Estudio de Asociación del Genoma Completo/métodos , Fenotipo , Análisis de Secuencia de ADN/métodos , Estrés Fisiológico/genética
7.
Int J Mol Sci ; 23(13)2022 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-35806374

RESUMEN

Alternative splicing (AS) is a ubiquitous phenomenon among eukaryotic intron-containing genes, which greatly contributes to transcriptome and proteome diversity. Here we performed the isoform sequencing (Iso-Seq) of soybean underground tissues inoculated and uninoculated with Rhizobium and obtained 200,681 full-length transcripts covering 26,183 gene loci. It was found that 80.78% of the multi-exon loci produced more than one splicing variant. Comprehensive analysis of these identified 7874 differentially splicing events with highly diverse splicing patterns during nodule development, especially in defense and transport-related processes. We further profiled genes with differential isoform usage and revealed that 2008 multi-isoform loci underwent stage-specific or simultaneous major isoform switches after Rhizobium inoculation, indicating that AS is a vital way to regulate nodule development. Moreover, we took the lead in identifying 1563 high-confidence long non-coding RNAs (lncRNAs) in soybean, and 157 of them are differentially expressed during nodule development. Therefore, our study uncovers the landscape of AS during the soybean-Rhizobium interaction and provides systematic transcriptomic data for future study of multiple novel directions in soybean.


Asunto(s)
Empalme Alternativo , ARN Largo no Codificante , Perfilación de la Expresión Génica , Isoformas de Proteínas/genética , ARN Largo no Codificante/genética , ARN Largo no Codificante/metabolismo , Glycine max/genética , Glycine max/metabolismo , Transcriptoma
8.
Mol Biol Evol ; 37(5): 1362-1375, 2020 05 01.
Artículo en Inglés | MEDLINE | ID: mdl-31960929

RESUMEN

The factors that drive the rapid changes in abundance of tandem arrays of highly repetitive sequences, known as satellite DNA, are not well understood. Drosophila virilis has one of the highest relative amounts of simple satellites of any organism that has been studied, with an estimated >40% of its genome composed of a few related 7-bp satellites. Here, we use D. virilis as a model to understand technical biases affecting satellite sequencing and the evolutionary processes that drive satellite composition. By analyzing sequencing data from Illumina, PacBio, and Nanopore platforms, we identify platform-specific biases and suggest best practices for accurate characterization of satellites by sequencing. We use comparative genomics and cytogenetics to demonstrate that the highly abundant AAACTAC satellite family arose from a related satellite in the branch leading to the virilis phylad 4.5-11 Ma before exploding in abundance in some species of the clade. The most abundant satellite is conserved in sequence and location in the pericentromeric region but has diverged widely in abundance among species, whereas the satellites nearest the centromere are rapidly turning over in sequence composition. By analyzing multiple strains of D. virilis, we saw that the abundances of two centromere-proximal satellites are anticorrelated along a geographical gradient, which we suggest could be caused by ongoing conflicts at the centromere. In conclusion, we illuminate several key attributes of satellite evolutionary dynamics that we hypothesize to be driven by processes including selection, meiotic drive, and constraints on satellite sequence and abundance.


Asunto(s)
Drosophila/genética , Evolución Molecular , Genoma de los Insectos , Repeticiones de Microsatélite , Animales , Hibridación Fluorescente in Situ , Análisis de Secuencia de ADN
9.
Plant Biotechnol J ; 19(3): 602-614, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33073461

RESUMEN

Brassica juncea (AABB), commonly referred to as mustard, is a natural allopolyploid of two diploid species-B. rapa (AA) and B. nigra (BB). We report a highly contiguous genome assembly of an oleiferous type of B. juncea variety Varuna, an archetypical Indian gene pool line of mustard, with ~100× PacBio single-molecule real-time (SMRT) long reads providing contigs with an N50 value of >5 Mb. Contigs were corrected for the misassemblies and scaffolded with BioNano optical mapping. We also assembled a draft genome of B. nigra (BB) variety Sangam using Illumina short-read sequencing and Oxford Nanopore long reads and used it to validate the assembly of the B genome of B. juncea. Two different linkage maps of B. juncea, containing a large number of genotyping-by-sequencing markers, were developed and used to anchor scaffolds/contigs to the 18 linkage groups of the species. The resulting chromosome-scale assembly of B. juncea Varuna is a significant improvement over the previous draft assembly of B. juncea Tumida, a vegetable type of mustard. The assembled genome was characterized for transposons, centromeric repeats, gene content and gene block associations. In comparison to the A genome, the B genome contains a significantly higher content of LTR/Gypsy retrotransposons, distinct centromeric repeats and a large number of B. nigra specific gene clusters that break the gene collinearity between the A and the B genomes. The B. juncea Varuna assembly will be of major value to the breeding work on oleiferous types of mustard that are grown extensively in south Asia and elsewhere.


Asunto(s)
Genoma de Planta , Planta de la Mostaza , Asia , Mapeo Cromosómico , Cromosomas , Genoma de Planta/genética , Planta de la Mostaza/genética , Fitomejoramiento
10.
Plant Biotechnol J ; 19(9): 1725-1742, 2021 09.
Artículo en Inglés | MEDLINE | ID: mdl-33768699

RESUMEN

Safflower (Carthamus tinctorius L.), a member of the Asteraceae, is a popular crop due to its high linoleic acid (LA) and flavonoid (such as hydroxysafflor yellow A) contents. Here, we report the first high-quality genome assembly (contig N50 of 21.23 Mb) for the 12 pseudochromosomes of safflower using single-molecule real-time sequencing, Hi-C mapping technologies and a genetic linkage map. Phyloge nomic analysis showed that safflower diverged from artichoke (Cynara cardunculus) and sunflower (Helianthus annuus) approximately 30.7 and 60.5 million years ago, respectively. Comparative genomic analyses revealed that uniquely expanded gene families in safflower were enriched for those predicted to be involved in lipid metabolism and transport and abscisic acid signalling. Notably, the fatty acid desaturase 2 (FAD2) and chalcone synthase (CHS) families, which function in the LA and flavonoid biosynthesis pathways, respectively, were expanded via tandem duplications in safflower. CarFAD2-12 was specifically expressed in seeds and was vital for high-LA content in seeds, while tandemly duplicated CarFAD2 genes were up-regulated in ovaries compared to CarFAD2-12, which indicates regulatory divergence of FAD2 in seeds and ovaries. CarCHS1, CarCHS4 and tandem-duplicated CarCHS5˜CarCHS6, which were up-regulated compared to other CarCHS members at early stages, contribute to the accumulation of major flavonoids in flowers. In addition, our data reveal multiple alternative splicing events in gene families related to fatty acid and flavonoid biosynthesis. Together, these results provide a high-quality reference genome and evolutionary insights into the molecular basis of fatty acid and flavonoid biosynthesis in safflower.


Asunto(s)
Carthamus tinctorius , Carthamus tinctorius/genética , Cromosomas , Flavonoides , Ácido Linoleico , Semillas/genética
11.
Brief Bioinform ; 20(2): 565-571, 2019 03 25.
Artículo en Inglés | MEDLINE | ID: mdl-29659709

RESUMEN

Improving productivity of the staple crops wheat and rice is essential to feed the growing global population, particularly in the context of a changing climate. However, current rates of yield gain are insufficient to support the predicted population growth. New approaches are required to accelerate the breeding process, and many of these are driven by the application of large-scale crop data. To leverage the substantial volumes and types of data that can be applied for precision breeding, the wheat and rice research communities are working towards the development of integrated systems to access and standardize the dispersed, heterogeneous available data. Here, we outline the initiatives of the International Wheat Information System (WheatIS) and the International Rice Informatics Consortium (IRIC) to establish Web-based single-access systems and data mining tools to make the available resources more accessible, drive discovery and accelerate the production of new crop varieties. We discuss the progress of WheatIS and IRIC towards unifying specialized wheat and rice databases and building custom software platforms to manage and interrogate these data. Single-access crop information systems will strengthen scientific collaboration, optimize the use of public research funds and help achieve the required yield gains in the two most important global food crops.


Asunto(s)
Productos Agrícolas/crecimiento & desarrollo , Sistemas de Información , Oryza/crecimiento & desarrollo , Triticum/crecimiento & desarrollo
12.
New Phytol ; 229(5): 2970-2983, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33111313

RESUMEN

In grasses, two types of phased, small interfering RNAs (phasiRNAs) are expressed largely in young, developing anthers. They are 21 or 24 nucleotides (nt) in length and are triggered by miR2118 or miR2275, respectively. However, most of their functions and activities are not fully understood. We performed comparative genomic analysis of their source loci (PHAS) in five Oryza genomes and combined this with analysis of high-throughput sRNA and degradome datasets. In total, we identified 8216 21-PHAS and 626 24-PHAS loci. Local tandem and segmental duplications mainly contributed to the expansion and supercluster distribution of the 21-PHAS loci. Despite their relatively conserved genomic positions, PHAS sequences diverged rapidly, except for the miR2118/2275 target sites, which were under strong selection for conservation. We found that 21-nt phasiRNAs with a 5'-terminal uridine (U) demonstrated cis-cleavage at PHAS precursors, and these cis-acting sites were also variable among close species. miR2118 could trigger phasiRNA production from its own antisense transcript and the derived phasiRNAs might reversibly regulate miR2118 precursors. We hypothesised that successful initiation of phasiRNA biogenesis is conservatively maintained, while phasiRNA products diverged quickly and are not individually conserved. In particular, phasiRNA production is under the control of multiple reciprocal regulation mechanisms.


Asunto(s)
MicroARNs , Oryza , Regulación de la Expresión Génica de las Plantas , MicroARNs/genética , Oryza/genética , Poaceae/genética , ARN de Planta/genética , ARN Interferente Pequeño/genética
13.
Plant Cell ; 30(8): 1729-1744, 2018 08.
Artículo en Inglés | MEDLINE | ID: mdl-29967288

RESUMEN

Centromeres are dynamic chromosomal regions, and the genetic and epigenetic environment of the centromere is often regarded as oppressive to protein-coding genes. Here, we used comparative genomic and phylogenomic approaches to study the evolution of centromeres and centromere-linked genes in the genus Oryza We report a 12.4-Mb high-quality BAC-based pericentromeric assembly for Oryza brachyantha, which diverged from cultivated rice (Oryza sativa) ∼15 million years ago. The synteny analyses reveal seven medium (>50 kb) pericentric inversions in O. sativa and 10 in O. brachyantha Of these inversions, three resulted in centromere movement (Chr1, Chr7, and Chr9). Additionally, we identified a potential centromere-repositioning event, in which the ancestral centromere on chromosome 12 in O. brachyantha jumped ∼400 kb away, possibly mediated by a duplicated transposition event (>28 kb). More strikingly, we observed an excess of syntenic gene loss at and near the centromeric regions (P < 2.2 × 10-16). Most (33/47) of the missing genes moved to other genomic regions; therefore such excess could be explained by the selective loss of the copy in or near centromeric regions after gene duplication. The pattern of gene loss immediately adjacent to centromeric regions suggests centromere chromatin dynamics (e.g., spreading or microrepositioning) may drive such gene loss.


Asunto(s)
Centrómero/genética , Oryza/genética , Cromatina/genética , Cromosomas de las Plantas/genética , Duplicación de Gen/genética , Genoma de Planta/genética
14.
Proc Natl Acad Sci U S A ; 114(45): 12003-12008, 2017 11 07.
Artículo en Inglés | MEDLINE | ID: mdl-29078296

RESUMEN

Few clades of plants have proven as difficult to classify as cacti. One explanation may be an unusually high level of convergent and parallel evolution (homoplasy). To evaluate support for this phylogenetic hypothesis at the molecular level, we sequenced the genomes of four cacti in the especially problematic tribe Pachycereeae, which contains most of the large columnar cacti of Mexico and adjacent areas, including the iconic saguaro cactus (Carnegiea gigantea) of the Sonoran Desert. We assembled a high-coverage draft genome for saguaro and lower coverage genomes for three other genera of tribe Pachycereeae (Pachycereus, Lophocereus, and Stenocereus) and a more distant outgroup cactus, Pereskia We used these to construct 4,436 orthologous gene alignments. Species tree inference consistently returned the same phylogeny, but gene tree discordance was high: 37% of gene trees having at least 90% bootstrap support conflicted with the species tree. Evidently, discordance is a product of long generation times and moderately large effective population sizes, leading to extensive incomplete lineage sorting (ILS). In the best supported gene trees, 58% of apparent homoplasy at amino sites in the species tree is due to gene tree-species tree discordance rather than parallel substitutions in the gene trees themselves, a phenomenon termed "hemiplasy." The high rate of genomic hemiplasy may contribute to apparent parallelisms in phenotypic traits, which could confound understanding of species relationships and character evolution in cacti.


Asunto(s)
Cactaceae/genética , Genoma de Planta/genética , Secuencia de Bases , Evolución Molecular , Genómica/métodos , México , Modelos Genéticos , América del Norte , Filogenia
15.
BMC Genomics ; 20(1): 905, 2019 Nov 27.
Artículo en Inglés | MEDLINE | ID: mdl-31775618

RESUMEN

BACKGROUND: The availability of thousands of complete rice genome sequences from diverse varieties and accessions has laid the foundation for in-depth exploration of the rice genome. One drawback to these collections is that most of these rice varieties have long life cycles, and/or low transformation efficiencies, which limits their usefulness as model organisms for functional genomics studies. In contrast, the rice variety Kitaake has a rapid life cycle (9 weeks seed to seed) and is easy to transform and propagate. For these reasons, Kitaake has emerged as a model for studies of diverse monocotyledonous species. RESULTS: Here, we report the de novo genome sequencing and analysis of Oryza sativa ssp. japonica variety KitaakeX, a Kitaake plant carrying the rice XA21 immune receptor. Our KitaakeX sequence assembly contains 377.6 Mb, consisting of 33 scaffolds (476 contigs) with a contig N50 of 1.4 Mb. Complementing the assembly are detailed gene annotations of 35,594 protein coding genes. We identified 331,335 genomic variations between KitaakeX and Nipponbare (ssp. japonica), and 2,785,991 variations between KitaakeX and Zhenshan97 (ssp. indica). We also compared Kitaake resequencing reads to the KitaakeX assembly and identified 219 small variations. The high-quality genome of the model rice plant KitaakeX will accelerate rice functional genomics. CONCLUSIONS: The high quality, de novo assembly of the KitaakeX genome will serve as a useful reference genome for rice and will accelerate functional genomics studies of rice and other species.


Asunto(s)
Genoma de Planta , Genómica , Oryza/genética , Secuenciación Completa del Genoma , Biología Computacional/métodos , Variación Genética , Genómica/métodos , Anotación de Secuencia Molecular , Oryza/clasificación , Fenotipo
16.
Nature ; 500(7462): 335-9, 2013 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-23883927

RESUMEN

Oil palm is the most productive oil-bearing crop. Although it is planted on only 5% of the total world vegetable oil acreage, palm oil accounts for 33% of vegetable oil and 45% of edible oil worldwide, but increased cultivation competes with dwindling rainforest reserves. We report the 1.8-gigabase (Gb) genome sequence of the African oil palm Elaeis guineensis, the predominant source of worldwide oil production. A total of 1.535 Gb of assembled sequence and transcriptome data from 30 tissue types were used to predict at least 34,802 genes, including oil biosynthesis genes and homologues of WRINKLED1 (WRI1), and other transcriptional regulators, which are highly expressed in the kernel. We also report the draft sequence of the South American oil palm Elaeis oleifera, which has the same number of chromosomes (2n = 32) and produces fertile interspecific hybrids with E. guineensis but seems to have diverged in the New World. Segmental duplications of chromosome arms define the palaeotetraploid origin of palm trees. The oil palm sequence enables the discovery of genes for important traits as well as somaclonal epigenetic alterations that restrict the use of clones in commercial plantings, and should therefore help to achieve sustainability for biofuels and edible oils, reducing the rainforest footprint of this tropical plantation crop.


Asunto(s)
Arecaceae/clasificación , Arecaceae/genética , Genoma de Planta/genética , Filogenia , Metabolismo de los Hidratos de Carbono/genética , Cromosomas de las Plantas/genética , Metabolismo de los Lípidos/genética , Modelos Genéticos , Datos de Secuencia Molecular
17.
Nucleic Acids Res ; 45(D1): D1075-D1081, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899667

RESUMEN

We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma de Planta , Mutación INDEL , Oryza/genética , Polimorfismo de Nucleótido Simple , Motor de Búsqueda , Programas Informáticos , Alelos , Biología Computacional/métodos , Frecuencia de los Genes , Sitios Genéticos , Genómica/métodos , Genotipo , Interfaz Usuario-Computador , Navegador Web
18.
Proc Natl Acad Sci U S A ; 113(35): E5163-71, 2016 08 30.
Artículo en Inglés | MEDLINE | ID: mdl-27535938

RESUMEN

Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement.


Asunto(s)
Cromosomas de las Plantas/genética , Variación Genética , Genoma de Planta/genética , Oryza/genética , Mapeo Cromosómico/métodos , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genes de Plantas/genética , Mutación INDEL , Oryza/clasificación , Polimorfismo de Nucleótido Simple , Especificidad de la Especie
19.
Mol Biol Evol ; 34(4): 969-979, 2017 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-28087768

RESUMEN

The origin of domesticated Asian rice (Oryza sativa) has been a contentious topic, with conflicting evidence for either single or multiple domestication of this key crop species. We examined the evolutionary history of domesticated rice by analyzing de novo assembled genomes from domesticated rice and its wild progenitors. Our results indicate multiple origins, where each domesticated rice subpopulation (japonica, indica, and aus) arose separately from progenitor O. rufipogon and/or O. nivara. Coalescence-based modeling of demographic parameters estimate that the first domesticated rice population to split off from O. rufipogon was O. sativa ssp. japonica, occurring at ∼13.1-24.1 ka, which is an order of magnitude older then the earliest archeological date of domestication. This date is consistent, however, with the expansion of O. rufipogon populations after the Last Glacial Maximum ∼18 ka and archeological evidence for early wild rice management in China. We also show that there is significant gene flow from japonica to both indica (∼17%) and aus (∼15%), which led to the transfer of domestication alleles from early-domesticated japonica to proto-indica and proto-aus populations. Our results provide support for a model in which different rice subspecies had separate origins, but that de novo domestication occurred only once, in O. sativa ssp. japonica, and introgressive hybridization from early japonica to proto-indica and proto-aus led to domesticated indica and aus rice.


Asunto(s)
Adaptación Biológica/genética , Flujo Génico/genética , Oryza/genética , Alelos , Evolución Biológica , Productos Agrícolas/genética , Domesticación , Evolución Molecular , Genes de Plantas/genética , Especiación Genética , Variación Genética/genética , Oryza/metabolismo , Filogenia , Alineación de Secuencia/métodos , Análisis de Secuencia de ADN/métodos
20.
Plant Biotechnol J ; 16(11): 1904-1917, 2018 11.
Artículo en Inglés | MEDLINE | ID: mdl-29604169

RESUMEN

Panax ginseng C. A. Meyer, reputed as the king of medicinal herbs, has slow growth, long generation time, low seed production and complicated genome structure that hamper its study. Here, we unveil the genomic architecture of tetraploid P. ginseng by de novo genome assembly, representing 2.98 Gbp with 59 352 annotated genes. Resequencing data indicated that diploid Panax species diverged in association with global warming in Southern Asia, and two North American species evolved via two intercontinental migrations. Two whole genome duplications (WGD) occurred in the family Araliaceae (including Panax) after divergence with the Apiaceae, the more recent one contributing to the ability of P. ginseng to overwinter, enabling it to spread broadly through the Northern Hemisphere. Functional and evolutionary analyses suggest that production of pharmacologically important dammarane-type ginsenosides originated in Panax and are produced largely in shoot tissues and transported to roots; that newly evolved P. ginseng fatty acid desaturases increase freezing tolerance; and that unprecedented retention of chlorophyll a/b binding protein genes enables efficient photosynthesis under low light. A genome-scale metabolic network provides a holistic view of Panax ginsenoside biosynthesis. This study provides valuable resources for improving medicinal values of ginseng either through genomics-assisted breeding or metabolic engineering.


Asunto(s)
Genoma de Planta/genética , Panax/genética , Adaptación Biológica/genética , Evolución Biológica , Diploidia , Genes del Cloroplasto/genética , Genes de Plantas/genética , Ginsenósidos/biosíntesis , Panax/metabolismo , Tetraploidía
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA