Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
Más filtros

Bases de datos
Tipo del documento
Intervalo de año de publicación
1.
Nature ; 588(7837): 277-283, 2020 12.
Artículo en Inglés | MEDLINE | ID: mdl-33239791

RESUMEN

Advances in genomics have expedited the improvement of several agriculturally important crops but similar efforts in wheat (Triticum spp.) have been more challenging. This is largely owing to the size and complexity of the wheat genome1, and the lack of genome-assembly data for multiple wheat lines2,3. Here we generated ten chromosome pseudomolecule and five scaffold assemblies of hexaploid wheat to explore the genomic diversity among wheat lines from global breeding programs. Comparative analysis revealed extensive structural rearrangements, introgressions from wild relatives and differences in gene content resulting from complex breeding histories aimed at improving adaptation to diverse environments, grain yield and quality, and resistance to stresses4,5. We provide examples outlining the utility of these genomes, including a detailed multi-genome-derived nucleotide-binding leucine-rich repeat protein repertoire involved in disease resistance and the characterization of Sm16, a gene associated with insect resistance. These genome assemblies will provide a basis for functional gene discovery and breeding to deliver the next generation of modern wheat cultivars.


Asunto(s)
Variación Genética , Genoma de Planta/genética , Genómica , Internacionalidad , Fitomejoramiento/métodos , Triticum/genética , Aclimatación/genética , Animales , Centrómero/genética , Centrómero/metabolismo , Mapeo Cromosómico , Clonación Molecular , Variaciones en el Número de Copia de ADN/genética , Elementos Transponibles de ADN/genética , Grano Comestible/genética , Grano Comestible/crecimiento & desarrollo , Genes de Plantas/genética , Introgresión Genética , Haplotipos , Insectos/patogenicidad , Proteínas NLR/genética , Enfermedades de las Plantas/genética , Proteínas de Plantas/genética , Polimorfismo de Nucleótido Simple/genética , Poliploidía , Triticum/clasificación , Triticum/crecimiento & desarrollo
2.
Plant J ; 118(5): 1516-1527, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38412295

RESUMEN

Bacterial wilt, caused by Xanthomonas translucens pv. graminis (Xtg), is a serious disease of economically important forage grasses, including Italian ryegrass (Lolium multiflorum Lam.). A major QTL for resistance to Xtg was previously identified, but the precise location as well as the genetic factors underlying the resistance are yet to be determined. To this end, we applied a bulked segregant analysis (BSA) approach, using whole-genome deep sequencing of pools of the most resistant and most susceptible individuals of a large (n = 7484) biparental F2 population segregating for resistance to Xtg. Using chromosome-level genome assemblies as references, we were able to define a ~300 kb region highly associated with resistance on pseudo-chromosome 4. Further investigation of this region revealed multiple genes with a known role in disease resistance, including genes encoding for Pik2-like disease resistance proteins, cysteine-rich kinases, and RGA4- and RGA5-like disease resistance proteins. Investigation of allele frequencies in the pools and comparative genome analysis in the grandparents of the F2 population revealed that some of these genes contain variants with allele frequencies that correspond to the expected heterozygosity in the resistant grandparent. This study emphasizes the efficacy of combining BSA studies in very large populations with whole genome deep sequencing and high-quality genome assemblies to pinpoint regions associated with a binary trait of interest and accurately define a small set of candidate genes. Furthermore, markers identified in this region hold significant potential for marker-assisted breeding strategies to breed resistance to Xtg in Italian ryegrass cultivars more efficiently.


Asunto(s)
Resistencia a la Enfermedad , Lolium , Enfermedades de las Plantas , Xanthomonas , Lolium/genética , Lolium/microbiología , Resistencia a la Enfermedad/genética , Enfermedades de las Plantas/microbiología , Enfermedades de las Plantas/genética , Enfermedades de las Plantas/inmunología , Xanthomonas/fisiología , Sitios de Carácter Cuantitativo/genética , Genes de Plantas/genética , Mapeo Cromosómico
3.
Mol Biol Evol ; 40(1)2023 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-36477354

RESUMEN

Self-incompatibility (SI) is a genetic mechanism of hermaphroditic plants to prevent inbreeding after self-pollination. Allogamous Poaceae species exhibit a unique gametophytic SI system controlled by two multi-allelic and independent loci, S and Z. Despite intense research efforts in the last decades, the genes that determine the initial recognition mechanism are yet to be identified. Here, we report the fine-mapping of the Z-locus in perennial ryegrass (Lolium perenne L.) and provide evidence that the pollen and stigma components are determined by two genes encoding DUF247 domain proteins (ZDUF247-I and ZDUF247-II) and the gene sZ, respectively. The pollen and stigma determinants are located side-by-side and were genetically linked in 10,245 individuals of two independent mapping populations segregating for Z. Moreover, they exhibited high allelic diversity as well as tissue-specific gene expression, matching the expected characteristics of SI determinants known from other systems. Revisiting the S-locus using the latest high-quality whole-genome assemblies revealed a similar gene composition and structure as found for Z, supporting the hypothesis of a duplicated origin of the two-locus SI system of grasses. Ultimately, comparative genomic analyses across a wide range of self-compatible and self-incompatible Poaceae species revealed that the absence of a functional copy of at least one of the six putative SI determinants is accompanied by a self-compatible phenotype. Our study provides new insights into the origin and evolution of the unique gametophytic SI system in one of the largest and economically most important plant families.


Asunto(s)
Lolium , Poaceae , Poaceae/genética , Lolium/genética , Polen/genética , Plantas , Genómica
4.
Nature ; 557(7703): 43-49, 2018 05.
Artículo en Inglés | MEDLINE | ID: mdl-29695866

RESUMEN

Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within- and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.


Asunto(s)
Productos Agrícolas/clasificación , Productos Agrícolas/genética , Variación Genética , Genoma de Planta/genética , Oryza/clasificación , Oryza/genética , Asia , Evolución Molecular , Genes de Plantas/genética , Genética de Población , Genómica , Haplotipos , Mutación INDEL/genética , Filogenia , Fitomejoramiento , Polimorfismo de Nucleótido Simple/genética
5.
Syst Biol ; 71(5): 1178-1194, 2022 08 10.
Artículo en Inglés | MEDLINE | ID: mdl-35244183

RESUMEN

Reconstructing accurate historical relationships within a species poses numerous challenges, not least in many plant groups in which gene flow is high enough to extend well beyond species boundaries. Nonetheless, the extent of tree-like history within a species is an empirical question on which it is now possible to bring large amounts of genome sequence to bear. We assess phylogenetic structure across the geographic range of the saguaro cactus, an emblematic member of Cactaceae, a clade known for extensive hybridization and porous species boundaries. Using 200 Gb of whole genome resequencing data from 20 individuals sampled from 10 localities, we assembled two data sets comprising 150,000 biallelic single nucleotide polymorphisms (SNPs) from protein coding sequences. From these, we inferred within-species trees and evaluated their significance and robustness using five qualitatively different inference methods. Despite the low sequence diversity, large census population sizes, and presence of wide-ranging pollen and seed dispersal agents, phylogenetic trees were well resolved and highly consistent across both data sets and all methods. We inferred that the most likely root, based on marginal likelihood comparisons, is to the east and south of the region of highest genetic diversity, which lies along the coast of the Gulf of California in Sonora, Mexico. Together with striking decreases in marginal likelihood found to the north, this supports hypotheses that saguaro's current range reflects postglacial expansion from the refugia in the south of its range. We conclude with observations about practical and theoretical issues raised by phylogenomic data sets within species, in which SNP-based methods must be used rather than gene tree methods that are widely used when sequence divergence is higher. These include computational scalability, inference of gene flow, and proper assessment of statistical support in the presence of linkage effects. [Phylogenomics; phylogeography; rooting; Sonoran Desert.].


Asunto(s)
Cactaceae , Cactaceae/genética , Hibridación Genética , Filogenia , Filogeografía , Análisis de Secuencia de ADN
6.
Plant J ; 107(4): 1166-1182, 2021 08.
Artículo en Inglés | MEDLINE | ID: mdl-34152039

RESUMEN

Allopolyploidization entailing the merger of two distinct genomes in a single hybrid organism, is an important process in plant evolution and a valuable tool in breeding programs. Newly established hybrids often experience massive genomic perturbations, including karyotype reshuffling and gene expression modifications. These phenomena may be asymmetric with respect to the two progenitors, with one of the parental genomes being "dominant." Such "genome dominance" can manifest in several ways, including biased homoeolog gene expression and expression level dominance. Here we employed a k-mer-based approach to study gene expression in reciprocal Festuca pratensis Huds. × Lolium multiflorum Lam. allopolyploid grasses. Our study revealed significantly more genes where expression mimicked that of the Lolium parent compared with the Festuca parent. This genome dominance was heritable to successive generation and its direction was only slightly modified by environmental conditions and plant age. Our results suggest that Lolium genome dominance was at least partially caused by its more efficient trans-acting gene expression regulatory factors. Unraveling the mechanisms responsible for propagation of parent-specific traits in hybrid crops contributes to our understanding of allopolyploid genome evolution and opens a way to targeted breeding strategies.


Asunto(s)
Festuca/genética , Regulación de la Expresión Génica de las Plantas , Genoma de Planta , Lolium/genética , Poliploidía , Productos Agrícolas , Bases de Datos Genéticas , Festuca/crecimiento & desarrollo , Perfilación de la Expresión Génica , Lolium/crecimiento & desarrollo , Secuencias Reguladoras de Ácidos Nucleicos , Análisis de Secuencia de ARN
7.
Plant J ; 101(3): 529-542, 2020 02.
Artículo en Inglés | MEDLINE | ID: mdl-31571285

RESUMEN

A wild grape haplotype (Rpv3-1) confers resistance to Plasmopara viticola. We mapped the causal factor for resistance to an interval containing a TIR-NB-LRR (TNL) gene pair that originated 1.6-2.6 million years ago by a tandem segmental duplication. Transient coexpression of the TNL pair in Vitis vinifera leaves activated pathogen-induced necrosis and reduced sporulation compared with control leaves. Even though transcripts of the TNL pair from the wild haplotype appear to be partially subject to nonsense-mediated mRNA decay, mature mRNA levels in a homozygous resistant genotype were individually higher than the mRNA trace levels observed for the orthologous single-copy TNL in sensitive genotypes. Allelic expression imbalance in a resistant heterozygote confirmed that cis-acting regulatory variation promotes expression in the wild haplotype. The movement of transposable elements had a major impact on the generation of haplotype diversity, altering the DNA context around similar TNL coding sequences and the GC-content in their proximal 5'-intergenic regions. The wild and domesticated haplotypes also diverged in conserved single-copy intergenic DNA, but the highest divergence was observed in intraspecific and not in interspecific comparisons. In this case, introgression breeding did not transgress the genetic boundaries of the domesticated species, because haplotypes present in modern varieties sometimes predate speciation events between wild and cultivated species.


Asunto(s)
Duplicación de Gen , Secuencias Repetitivas Esparcidas/genética , Oomicetos/fisiología , Enfermedades de las Plantas/inmunología , Proteínas de Plantas/metabolismo , Vitis/genética , Alelos , Cruzamiento , Resistencia a la Enfermedad/genética , Genotipo , Haplotipos , Enfermedades de las Plantas/parasitología , Hojas de la Planta/genética , Hojas de la Planta/inmunología , Hojas de la Planta/parasitología , Proteínas de Plantas/genética , Vitis/inmunología , Vitis/parasitología
8.
Plant Cell Physiol ; 62(1): 8-27, 2021 Mar 25.
Artículo en Inglés | MEDLINE | ID: mdl-33244607

RESUMEN

Bread wheat is a major crop that has long been the focus of basic and breeding research. Assembly of its genome has been difficult because of its large size and allohexaploid nature (AABBDD genome). Following the first reported assembly of the genome of the experimental strain Chinese Spring (CS), the 10+ Wheat Genomes Project was launched to produce multiple assemblies of worldwide modern cultivars. The only Asian cultivar in the project is Norin 61, a representative Japanese cultivar adapted to grow across a broad latitudinal range, mostly characterized by a wet climate and a short growing season. Here, we characterize the key aspects of its chromosome-scale genome assembly spanning 15 Gb with a raw scaffold N50 of 22 Mb. Analysis of the repetitive elements identified chromosomal regions unique to Norin 61 that encompass a tandem array of the pathogenesis-related 13 family. We report novel copy-number variations in the B homeolog of the florigen gene FT1/VRN3, pseudogenization of its D homeolog and the association of its A homeologous alleles with the spring/winter growth habit. Furthermore, the Norin 61 genome carries typical East Asian functional variants different from CS, ranging from a single nucleotide to multi-Mb scale. Examples of such variation are the Fhb1 locus, which confers Fusarium head-blight resistance, Ppd-D1a, which confers early flowering, Glu-D1f for Asian noodle quality and Rht-D1b, which introduced semi-dwarfism during the green revolution. The adoption of Norin 61 as a reference assembly for functional and evolutionary studies will enable comprehensive characterization of the underexploited Asian bread wheat diversity.


Asunto(s)
Resistencia a la Enfermedad/genética , Flores/crecimiento & desarrollo , Genes de Plantas/genética , Genoma de Planta/genética , Triticum/genética , Mapeo Cromosómico , Cromosomas de las Plantas/genética , Citogenética , Asia Oriental , Flores/genética , Fusarium , Genes de Plantas/fisiología , Estudios de Asociación Genética , Variación Genética/genética , Variación Genética/fisiología , Genoma de Planta/fisiología , Genotipo , Filogenia , Alineación de Secuencia , Análisis de Secuencia de ADN , Triticum/crecimiento & desarrollo , Triticum/inmunología , Triticum/fisiología
9.
Plant Biotechnol J ; 19(3): 602-614, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33073461

RESUMEN

Brassica juncea (AABB), commonly referred to as mustard, is a natural allopolyploid of two diploid species-B. rapa (AA) and B. nigra (BB). We report a highly contiguous genome assembly of an oleiferous type of B. juncea variety Varuna, an archetypical Indian gene pool line of mustard, with ~100× PacBio single-molecule real-time (SMRT) long reads providing contigs with an N50 value of >5 Mb. Contigs were corrected for the misassemblies and scaffolded with BioNano optical mapping. We also assembled a draft genome of B. nigra (BB) variety Sangam using Illumina short-read sequencing and Oxford Nanopore long reads and used it to validate the assembly of the B genome of B. juncea. Two different linkage maps of B. juncea, containing a large number of genotyping-by-sequencing markers, were developed and used to anchor scaffolds/contigs to the 18 linkage groups of the species. The resulting chromosome-scale assembly of B. juncea Varuna is a significant improvement over the previous draft assembly of B. juncea Tumida, a vegetable type of mustard. The assembled genome was characterized for transposons, centromeric repeats, gene content and gene block associations. In comparison to the A genome, the B genome contains a significantly higher content of LTR/Gypsy retrotransposons, distinct centromeric repeats and a large number of B. nigra specific gene clusters that break the gene collinearity between the A and the B genomes. The B. juncea Varuna assembly will be of major value to the breeding work on oleiferous types of mustard that are grown extensively in south Asia and elsewhere.


Asunto(s)
Genoma de Planta , Planta de la Mostaza , Asia , Mapeo Cromosómico , Cromosomas , Genoma de Planta/genética , Planta de la Mostaza/genética , Fitomejoramiento
10.
New Phytol ; 227(3): 914-929, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-31369159

RESUMEN

The evolution of l-DOPA 4,5-dioxygenase activity, encoded by the gene DODA, was a key step in the origin of betalain biosynthesis in Caryophyllales. We previously proposed that l-DOPA 4,5-dioxygenase activity evolved via a single Caryophyllales-specific neofunctionalisation event within the DODA gene lineage. However, this neofunctionalisation event has not been confirmed and the DODA gene lineage exhibits numerous gene duplication events, whose evolutionary significance is unclear. To address this, we functionally characterised 23 distinct DODA proteins for l-DOPA 4,5-dioxygenase activity, from four betalain-pigmented and five anthocyanin-pigmented species, representing key evolutionary transitions across Caryophyllales. By mapping these functional data to an updated DODA phylogeny, we then explored the evolution of l-DOPA 4,5-dioxygenase activity. We find that low l-DOPA 4,5-dioxygenase activity is distributed across the DODA gene lineage. In this context, repeated gene duplication events within the DODA gene lineage give rise to polyphyletic occurrences of elevated l-DOPA 4,5-dioxygenase activity, accompanied by convergent shifts in key functional residues and distinct genomic patterns of micro-synteny. In the context of an updated organismal phylogeny and newly inferred pigment reconstructions, we argue that repeated convergent acquisition of elevated l-DOPA 4,5-dioxygenase activity is consistent with recurrent specialisation to betalain synthesis in Caryophyllales.


Asunto(s)
Caryophyllales , Dioxigenasas , Betalaínas , Dioxigenasas/genética , Levodopa , Filogenia , Pigmentación
11.
Proc Natl Acad Sci U S A ; 114(45): 12003-12008, 2017 11 07.
Artículo en Inglés | MEDLINE | ID: mdl-29078296

RESUMEN

Few clades of plants have proven as difficult to classify as cacti. One explanation may be an unusually high level of convergent and parallel evolution (homoplasy). To evaluate support for this phylogenetic hypothesis at the molecular level, we sequenced the genomes of four cacti in the especially problematic tribe Pachycereeae, which contains most of the large columnar cacti of Mexico and adjacent areas, including the iconic saguaro cactus (Carnegiea gigantea) of the Sonoran Desert. We assembled a high-coverage draft genome for saguaro and lower coverage genomes for three other genera of tribe Pachycereeae (Pachycereus, Lophocereus, and Stenocereus) and a more distant outgroup cactus, Pereskia We used these to construct 4,436 orthologous gene alignments. Species tree inference consistently returned the same phylogeny, but gene tree discordance was high: 37% of gene trees having at least 90% bootstrap support conflicted with the species tree. Evidently, discordance is a product of long generation times and moderately large effective population sizes, leading to extensive incomplete lineage sorting (ILS). In the best supported gene trees, 58% of apparent homoplasy at amino sites in the species tree is due to gene tree-species tree discordance rather than parallel substitutions in the gene trees themselves, a phenomenon termed "hemiplasy." The high rate of genomic hemiplasy may contribute to apparent parallelisms in phenotypic traits, which could confound understanding of species relationships and character evolution in cacti.


Asunto(s)
Cactaceae/genética , Genoma de Planta/genética , Secuencia de Bases , Evolución Molecular , Genómica/métodos , México , Modelos Genéticos , América del Norte , Filogenia
12.
BMC Genomics ; 20(1): 905, 2019 Nov 27.
Artículo en Inglés | MEDLINE | ID: mdl-31775618

RESUMEN

BACKGROUND: The availability of thousands of complete rice genome sequences from diverse varieties and accessions has laid the foundation for in-depth exploration of the rice genome. One drawback to these collections is that most of these rice varieties have long life cycles, and/or low transformation efficiencies, which limits their usefulness as model organisms for functional genomics studies. In contrast, the rice variety Kitaake has a rapid life cycle (9 weeks seed to seed) and is easy to transform and propagate. For these reasons, Kitaake has emerged as a model for studies of diverse monocotyledonous species. RESULTS: Here, we report the de novo genome sequencing and analysis of Oryza sativa ssp. japonica variety KitaakeX, a Kitaake plant carrying the rice XA21 immune receptor. Our KitaakeX sequence assembly contains 377.6 Mb, consisting of 33 scaffolds (476 contigs) with a contig N50 of 1.4 Mb. Complementing the assembly are detailed gene annotations of 35,594 protein coding genes. We identified 331,335 genomic variations between KitaakeX and Nipponbare (ssp. japonica), and 2,785,991 variations between KitaakeX and Zhenshan97 (ssp. indica). We also compared Kitaake resequencing reads to the KitaakeX assembly and identified 219 small variations. The high-quality genome of the model rice plant KitaakeX will accelerate rice functional genomics. CONCLUSIONS: The high quality, de novo assembly of the KitaakeX genome will serve as a useful reference genome for rice and will accelerate functional genomics studies of rice and other species.


Asunto(s)
Genoma de Planta , Genómica , Oryza/genética , Secuenciación Completa del Genoma , Biología Computacional/métodos , Variación Genética , Genómica/métodos , Anotación de Secuencia Molecular , Oryza/clasificación , Fenotipo
13.
Nucleic Acids Res ; 45(D1): D1075-D1081, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27899667

RESUMEN

We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org.


Asunto(s)
Bases de Datos de Ácidos Nucleicos , Genoma de Planta , Mutación INDEL , Oryza/genética , Polimorfismo de Nucleótido Simple , Motor de Búsqueda , Programas Informáticos , Alelos , Biología Computacional/métodos , Frecuencia de los Genes , Sitios Genéticos , Genómica/métodos , Genotipo , Interfaz Usuario-Computador , Navegador Web
14.
Proc Natl Acad Sci U S A ; 113(35): E5163-71, 2016 08 30.
Artículo en Inglés | MEDLINE | ID: mdl-27535938

RESUMEN

Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement.


Asunto(s)
Cromosomas de las Plantas/genética , Variación Genética , Genoma de Planta/genética , Oryza/genética , Mapeo Cromosómico/métodos , Perfilación de la Expresión Génica , Regulación de la Expresión Génica de las Plantas , Genes de Plantas/genética , Mutación INDEL , Oryza/clasificación , Polimorfismo de Nucleótido Simple , Especificidad de la Especie
15.
Plant Biotechnol J ; 15(6): 765-774, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-27889940

RESUMEN

The related A genome species of the Oryza genus are the effective gene pool for rice. Here, we report draft genomes for two Australian wild A genome taxa: O. rufipogon-like population, referred to as Taxon A, and O. meridionalis-like population, referred to as Taxon B. These two taxa were sequenced and assembled by integration of short- and long-read next-generation sequencing (NGS) data to create a genomic platform for a wider rice gene pool. Here, we report that, despite the distinct chloroplast genome, the nuclear genome of the Australian Taxon A has a sequence that is much closer to that of domesticated rice (O. sativa) than to the other Australian wild populations. Analysis of 4643 genes in the A genome clade showed that the Australian annual, O. meridionalis, and related perennial taxa have the most divergent (around 3 million years) genome sequences relative to domesticated rice. A test for admixture showed possible introgression into the Australian Taxon A (diverged around 1.6 million years ago) especially from the wild indica/O. nivara clade in Asia. These results demonstrate that northern Australia may be the centre of diversity of the A genome Oryza and suggest the possibility that this might also be the centre of origin of this group and represent an important resource for rice improvement.


Asunto(s)
Genoma de Planta/genética , Oryza/genética , Proteínas de Plantas/genética , Evolución Molecular , Genoma del Cloroplasto/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Filogenia , Análisis de Secuencia de ADN
16.
Bioinformatics ; 32(20): 3058-3064, 2016 10 15.
Artículo en Inglés | MEDLINE | ID: mdl-27318200

RESUMEN

MOTIVATION: Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build 'new-gen-assemblies' that result in high-quality 'annotation-ready' pseudomolecules. RESULTS: With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to 'group,' 'merge,' 'order and orient' sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user's total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory. AVAILABILITY AND IMPLEMENTATION: The GPM (with LIMS) package is available at https://github.com/Jianwei-Zhang/LIMS CONTACTS: jzhang@mail.hzau.edu.cn or rwing@mail.arizona.eduSupplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Programas Informáticos , Genoma
17.
BMC Genomics ; 16: 538, 2015 Jul 22.
Artículo en Inglés | MEDLINE | ID: mdl-26194356

RESUMEN

BACKGROUND: Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size. RESULTS: Here we present the Rice TE database (RiTE-db)--a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies. CONCLUSIONS: This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.


Asunto(s)
Bases de Datos Genéticas , Evolución Molecular , Genoma de Planta , Oryza/genética , Elementos Transponibles de ADN/genética , Genómica , Programas Informáticos
18.
Am J Bot ; 102(7): 1115-27, 2015 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-26199368

RESUMEN

UNLABELLED: • PREMISE OF THE STUDY: Land-plant plastid genomes have only rarely undergone significant changes in gene content and order. Thus, discovery of additional examples adds power to tests for causes of such genome-scale structural changes.• METHODS: Using next-generation sequence data, we assembled the plastid genome of saguaro cactus and probed the nuclear genome for transferred plastid genes and functionally related nuclear genes. We combined these results with available data across Cactaceae and seed plants more broadly to infer the history of gene loss and to assess the strength of phylogenetic association between gene loss and loss of the inverted repeat (IR).• KEY RESULTS: The saguaro plastid genome is the smallest known for an obligately photosynthetic angiosperm (∼113 kb), having lost the IR and plastid ndh genes. This loss supports a statistically strong association across seed plants between the loss of ndh genes and the loss of the IR. Many nonplastid copies of plastid ndh genes were found in the nuclear genome, but none had intact reading frames; nor did three related nuclear-encoded subunits. However, nuclear pgr5, which functions in a partially redundant pathway, was intact.• CONCLUSIONS: The existence of an alternative pathway redundant with the function of the plastid NADH dehydrogenase-like complex (NDH) complex may permit loss of the plastid ndh gene suite in photoautotrophs like saguaro. Loss of these genes may be a recurring mechanism for overall plastid genome size reduction, especially in combination with loss of the IR.


Asunto(s)
Cactaceae/genética , Genoma de Plastidios/genética , Secuencias Invertidas Repetidas/genética , NADH Deshidrogenasa/genética , Plastidios/genética , ADN de Plantas/química , ADN de Plantas/genética , Evolución Molecular , Biblioteca de Genes , Secuenciación de Nucleótidos de Alto Rendimiento , Anotación de Secuencia Molecular , Filogenia , Proteínas de Plantas/genética , Análisis de Secuencia de ADN
19.
Ecol Evol ; 14(3): e10979, 2024 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-38476697

RESUMEN

The assembly of genomes from pooled samples of genetically heterogenous samples of conspecifics remains challenging. In this study, we show that high-quality genome assemblies can be produced from samples of multiple wild-caught individuals. We sequenced DNA extracted from a pooled sample of conspecific herbivorous insects (Hemiptera: Miridae: Tupiocoris notatus) acquired from a greenhouse infestation in Tucson, Arizona (in the range of 30-100 individuals; 0.5 mL tissue by volume) using PacBio highly accurate long reads (HiFi). The initial assembly contained multiple haplotigs (>85% BUSCOs duplicated), but duplicate contigs could be easily purged to reveal a highly complete assembly (95.6% BUSCO, 4.4% duplicated) that is highly contiguous by short-read assembly standards (N 50 = 675 kb; Largest contig = 4.3 Mb). We then used our assembly as the basis for a genome-guided differential expression study of host plant-specific transcriptional responses. We found thousands of genes (N = 4982) to be differentially expressed between our new data from individuals feeding on Datura wrightii (Solanaceae) and existing RNA-seq data from Nicotiana attenuata (Solanaceae)-fed individuals. We identified many of these genes as previously documented detoxification genes such as glutathione-S-transferases, cytochrome P450s, and UDP-glucosyltransferases. Together our results show that long-read sequencing of pooled samples can provide a cost-effective genome assembly option for small insects and can provide insights into the genetic mechanisms underlying interactions between plants and herbivorous pests.

20.
GigaByte ; 2024: gigabyte112, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38496214

RESUMEN

This work is an update and extension of the previously published article "Ultralong Oxford Nanopore Reads Enable the Development of a Reference-Grade Perennial Ryegrass Genome Assembly" by Frei et al. The published genome assembly of the doubled haploid perennial ryegrass (Lolium perenne L.) genotype Kyuss (Kyuss v1.0) marked a milestone for forage grass research and breeding. However, order and orientation errors may exist in the pseudo-chromosomes of Kyuss, since barley (Hordeum vulgare L.), which diverged 30 million years ago from perennial ryegrass, was used as the reference to scaffold Kyuss. To correct for structural errors possibly present in the published Kyuss assembly, we de novo assembled the genome again and generated 50-fold coverage high-throughput chromosome conformation capture (Hi-C) data to assist pseudo-chromosome construction. The resulting new chromosome-level assembly Kyuss v2.0 showed improved quality with high contiguity (contig N50 = 120 Mb), high completeness (total BUSCO score = 99%), high base-level accuracy (QV = 50), and correct pseudo-chromosome structure (validated by Hi-C contact map). This new assembly will serve as a better reference genome for Lolium spp. and greatly benefit the forage and turf grass research community.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA