Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 43
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nature ; 588(7837): 277-283, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-33239791

RESUMO

Advances in genomics have expedited the improvement of several agriculturally important crops but similar efforts in wheat (Triticum spp.) have been more challenging. This is largely owing to the size and complexity of the wheat genome1, and the lack of genome-assembly data for multiple wheat lines2,3. Here we generated ten chromosome pseudomolecule and five scaffold assemblies of hexaploid wheat to explore the genomic diversity among wheat lines from global breeding programs. Comparative analysis revealed extensive structural rearrangements, introgressions from wild relatives and differences in gene content resulting from complex breeding histories aimed at improving adaptation to diverse environments, grain yield and quality, and resistance to stresses4,5. We provide examples outlining the utility of these genomes, including a detailed multi-genome-derived nucleotide-binding leucine-rich repeat protein repertoire involved in disease resistance and the characterization of Sm16, a gene associated with insect resistance. These genome assemblies will provide a basis for functional gene discovery and breeding to deliver the next generation of modern wheat cultivars.


Assuntos
Variação Genética , Genoma de Planta/genética , Genômica , Internacionalidade , Melhoramento Vegetal/métodos , Triticum/genética , Aclimatação/genética , Animais , Centrômero/genética , Centrômero/metabolismo , Mapeamento Cromossômico , Clonagem Molecular , Variações do Número de Cópias de DNA/genética , Elementos de DNA Transponíveis/genética , Grão Comestível/genética , Grão Comestível/crescimento & desenvolvimento , Genes de Plantas/genética , Introgressão Genética , Haplótipos , Insetos/patogenicidade , Proteínas NLR/genética , Doenças das Plantas/genética , Proteínas de Plantas/genética , Polimorfismo de Nucleotídeo Único/genética , Poliploidia , Triticum/classificação , Triticum/crescimento & desenvolvimento
2.
Plant J ; 118(5): 1516-1527, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38412295

RESUMO

Bacterial wilt, caused by Xanthomonas translucens pv. graminis (Xtg), is a serious disease of economically important forage grasses, including Italian ryegrass (Lolium multiflorum Lam.). A major QTL for resistance to Xtg was previously identified, but the precise location as well as the genetic factors underlying the resistance are yet to be determined. To this end, we applied a bulked segregant analysis (BSA) approach, using whole-genome deep sequencing of pools of the most resistant and most susceptible individuals of a large (n = 7484) biparental F2 population segregating for resistance to Xtg. Using chromosome-level genome assemblies as references, we were able to define a ~300 kb region highly associated with resistance on pseudo-chromosome 4. Further investigation of this region revealed multiple genes with a known role in disease resistance, including genes encoding for Pik2-like disease resistance proteins, cysteine-rich kinases, and RGA4- and RGA5-like disease resistance proteins. Investigation of allele frequencies in the pools and comparative genome analysis in the grandparents of the F2 population revealed that some of these genes contain variants with allele frequencies that correspond to the expected heterozygosity in the resistant grandparent. This study emphasizes the efficacy of combining BSA studies in very large populations with whole genome deep sequencing and high-quality genome assemblies to pinpoint regions associated with a binary trait of interest and accurately define a small set of candidate genes. Furthermore, markers identified in this region hold significant potential for marker-assisted breeding strategies to breed resistance to Xtg in Italian ryegrass cultivars more efficiently.


Assuntos
Resistência à Doença , Lolium , Doenças das Plantas , Xanthomonas , Lolium/genética , Lolium/microbiologia , Resistência à Doença/genética , Doenças das Plantas/microbiologia , Doenças das Plantas/genética , Doenças das Plantas/imunologia , Xanthomonas/fisiologia , Locos de Características Quantitativas/genética , Genes de Plantas/genética , Mapeamento Cromossômico
3.
Mol Biol Evol ; 40(1)2023 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-36477354

RESUMO

Self-incompatibility (SI) is a genetic mechanism of hermaphroditic plants to prevent inbreeding after self-pollination. Allogamous Poaceae species exhibit a unique gametophytic SI system controlled by two multi-allelic and independent loci, S and Z. Despite intense research efforts in the last decades, the genes that determine the initial recognition mechanism are yet to be identified. Here, we report the fine-mapping of the Z-locus in perennial ryegrass (Lolium perenne L.) and provide evidence that the pollen and stigma components are determined by two genes encoding DUF247 domain proteins (ZDUF247-I and ZDUF247-II) and the gene sZ, respectively. The pollen and stigma determinants are located side-by-side and were genetically linked in 10,245 individuals of two independent mapping populations segregating for Z. Moreover, they exhibited high allelic diversity as well as tissue-specific gene expression, matching the expected characteristics of SI determinants known from other systems. Revisiting the S-locus using the latest high-quality whole-genome assemblies revealed a similar gene composition and structure as found for Z, supporting the hypothesis of a duplicated origin of the two-locus SI system of grasses. Ultimately, comparative genomic analyses across a wide range of self-compatible and self-incompatible Poaceae species revealed that the absence of a functional copy of at least one of the six putative SI determinants is accompanied by a self-compatible phenotype. Our study provides new insights into the origin and evolution of the unique gametophytic SI system in one of the largest and economically most important plant families.


Assuntos
Lolium , Poaceae , Poaceae/genética , Lolium/genética , Pólen/genética , Plantas , Genômica
4.
Nature ; 557(7703): 43-49, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29695866

RESUMO

Here we analyse genetic variation, population structure and diversity among 3,010 diverse Asian cultivated rice (Oryza sativa L.) genomes from the 3,000 Rice Genomes Project. Our results are consistent with the five major groups previously recognized, but also suggest several unreported subpopulations that correlate with geographic location. We identified 29 million single nucleotide polymorphisms, 2.4 million small indels and over 90,000 structural variations that contribute to within- and between-population variation. Using pan-genome analyses, we identified more than 10,000 novel full-length protein-coding genes and a high number of presence-absence variations. The complex patterns of introgression observed in domestication genes are consistent with multiple independent rice domestication events. The public availability of data from the 3,000 Rice Genomes Project provides a resource for rice genomics research and breeding.


Assuntos
Produtos Agrícolas/classificação , Produtos Agrícolas/genética , Variação Genética , Genoma de Planta/genética , Oryza/classificação , Oryza/genética , Ásia , Evolução Molecular , Genes de Plantas/genética , Genética Populacional , Genômica , Haplótipos , Mutação INDEL/genética , Filogenia , Melhoramento Vegetal , Polimorfismo de Nucleotídeo Único/genética
5.
Syst Biol ; 71(5): 1178-1194, 2022 08 10.
Artigo em Inglês | MEDLINE | ID: mdl-35244183

RESUMO

Reconstructing accurate historical relationships within a species poses numerous challenges, not least in many plant groups in which gene flow is high enough to extend well beyond species boundaries. Nonetheless, the extent of tree-like history within a species is an empirical question on which it is now possible to bring large amounts of genome sequence to bear. We assess phylogenetic structure across the geographic range of the saguaro cactus, an emblematic member of Cactaceae, a clade known for extensive hybridization and porous species boundaries. Using 200 Gb of whole genome resequencing data from 20 individuals sampled from 10 localities, we assembled two data sets comprising 150,000 biallelic single nucleotide polymorphisms (SNPs) from protein coding sequences. From these, we inferred within-species trees and evaluated their significance and robustness using five qualitatively different inference methods. Despite the low sequence diversity, large census population sizes, and presence of wide-ranging pollen and seed dispersal agents, phylogenetic trees were well resolved and highly consistent across both data sets and all methods. We inferred that the most likely root, based on marginal likelihood comparisons, is to the east and south of the region of highest genetic diversity, which lies along the coast of the Gulf of California in Sonora, Mexico. Together with striking decreases in marginal likelihood found to the north, this supports hypotheses that saguaro's current range reflects postglacial expansion from the refugia in the south of its range. We conclude with observations about practical and theoretical issues raised by phylogenomic data sets within species, in which SNP-based methods must be used rather than gene tree methods that are widely used when sequence divergence is higher. These include computational scalability, inference of gene flow, and proper assessment of statistical support in the presence of linkage effects. [Phylogenomics; phylogeography; rooting; Sonoran Desert.].


Assuntos
Cactaceae , Cactaceae/genética , Hibridização Genética , Filogenia , Filogeografia , Análise de Sequência de DNA
6.
Plant J ; 107(4): 1166-1182, 2021 08.
Artigo em Inglês | MEDLINE | ID: mdl-34152039

RESUMO

Allopolyploidization entailing the merger of two distinct genomes in a single hybrid organism, is an important process in plant evolution and a valuable tool in breeding programs. Newly established hybrids often experience massive genomic perturbations, including karyotype reshuffling and gene expression modifications. These phenomena may be asymmetric with respect to the two progenitors, with one of the parental genomes being "dominant." Such "genome dominance" can manifest in several ways, including biased homoeolog gene expression and expression level dominance. Here we employed a k-mer-based approach to study gene expression in reciprocal Festuca pratensis Huds. × Lolium multiflorum Lam. allopolyploid grasses. Our study revealed significantly more genes where expression mimicked that of the Lolium parent compared with the Festuca parent. This genome dominance was heritable to successive generation and its direction was only slightly modified by environmental conditions and plant age. Our results suggest that Lolium genome dominance was at least partially caused by its more efficient trans-acting gene expression regulatory factors. Unraveling the mechanisms responsible for propagation of parent-specific traits in hybrid crops contributes to our understanding of allopolyploid genome evolution and opens a way to targeted breeding strategies.


Assuntos
Festuca/genética , Regulação da Expressão Gênica de Plantas , Genoma de Planta , Lolium/genética , Poliploidia , Produtos Agrícolas , Bases de Dados Genéticas , Festuca/crescimento & desenvolvimento , Perfilação da Expressão Gênica , Lolium/crescimento & desenvolvimento , Sequências Reguladoras de Ácido Nucleico , Análise de Sequência de RNA
7.
Plant J ; 101(3): 529-542, 2020 02.
Artigo em Inglês | MEDLINE | ID: mdl-31571285

RESUMO

A wild grape haplotype (Rpv3-1) confers resistance to Plasmopara viticola. We mapped the causal factor for resistance to an interval containing a TIR-NB-LRR (TNL) gene pair that originated 1.6-2.6 million years ago by a tandem segmental duplication. Transient coexpression of the TNL pair in Vitis vinifera leaves activated pathogen-induced necrosis and reduced sporulation compared with control leaves. Even though transcripts of the TNL pair from the wild haplotype appear to be partially subject to nonsense-mediated mRNA decay, mature mRNA levels in a homozygous resistant genotype were individually higher than the mRNA trace levels observed for the orthologous single-copy TNL in sensitive genotypes. Allelic expression imbalance in a resistant heterozygote confirmed that cis-acting regulatory variation promotes expression in the wild haplotype. The movement of transposable elements had a major impact on the generation of haplotype diversity, altering the DNA context around similar TNL coding sequences and the GC-content in their proximal 5'-intergenic regions. The wild and domesticated haplotypes also diverged in conserved single-copy intergenic DNA, but the highest divergence was observed in intraspecific and not in interspecific comparisons. In this case, introgression breeding did not transgress the genetic boundaries of the domesticated species, because haplotypes present in modern varieties sometimes predate speciation events between wild and cultivated species.


Assuntos
Duplicação Gênica , Sequências Repetitivas Dispersas/genética , Oomicetos/fisiologia , Doenças das Plantas/imunologia , Proteínas de Plantas/metabolismo , Vitis/genética , Alelos , Cruzamento , Resistência à Doença/genética , Genótipo , Haplótipos , Doenças das Plantas/parasitologia , Folhas de Planta/genética , Folhas de Planta/imunologia , Folhas de Planta/parasitologia , Proteínas de Plantas/genética , Vitis/imunologia , Vitis/parasitologia
8.
Plant Cell Physiol ; 62(1): 8-27, 2021 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-33244607

RESUMO

Bread wheat is a major crop that has long been the focus of basic and breeding research. Assembly of its genome has been difficult because of its large size and allohexaploid nature (AABBDD genome). Following the first reported assembly of the genome of the experimental strain Chinese Spring (CS), the 10+ Wheat Genomes Project was launched to produce multiple assemblies of worldwide modern cultivars. The only Asian cultivar in the project is Norin 61, a representative Japanese cultivar adapted to grow across a broad latitudinal range, mostly characterized by a wet climate and a short growing season. Here, we characterize the key aspects of its chromosome-scale genome assembly spanning 15 Gb with a raw scaffold N50 of 22 Mb. Analysis of the repetitive elements identified chromosomal regions unique to Norin 61 that encompass a tandem array of the pathogenesis-related 13 family. We report novel copy-number variations in the B homeolog of the florigen gene FT1/VRN3, pseudogenization of its D homeolog and the association of its A homeologous alleles with the spring/winter growth habit. Furthermore, the Norin 61 genome carries typical East Asian functional variants different from CS, ranging from a single nucleotide to multi-Mb scale. Examples of such variation are the Fhb1 locus, which confers Fusarium head-blight resistance, Ppd-D1a, which confers early flowering, Glu-D1f for Asian noodle quality and Rht-D1b, which introduced semi-dwarfism during the green revolution. The adoption of Norin 61 as a reference assembly for functional and evolutionary studies will enable comprehensive characterization of the underexploited Asian bread wheat diversity.


Assuntos
Resistência à Doença/genética , Flores/crescimento & desenvolvimento , Genes de Plantas/genética , Genoma de Planta/genética , Triticum/genética , Mapeamento Cromossômico , Cromossomos de Plantas/genética , Citogenética , Ásia Oriental , Flores/genética , Fusarium , Genes de Plantas/fisiologia , Estudos de Associação Genética , Variação Genética/genética , Variação Genética/fisiologia , Genoma de Planta/fisiologia , Genótipo , Filogenia , Alinhamento de Sequência , Análise de Sequência de DNA , Triticum/crescimento & desenvolvimento , Triticum/imunologia , Triticum/fisiologia
9.
Plant Biotechnol J ; 19(3): 602-614, 2021 03.
Artigo em Inglês | MEDLINE | ID: mdl-33073461

RESUMO

Brassica juncea (AABB), commonly referred to as mustard, is a natural allopolyploid of two diploid species-B. rapa (AA) and B. nigra (BB). We report a highly contiguous genome assembly of an oleiferous type of B. juncea variety Varuna, an archetypical Indian gene pool line of mustard, with ~100× PacBio single-molecule real-time (SMRT) long reads providing contigs with an N50 value of >5 Mb. Contigs were corrected for the misassemblies and scaffolded with BioNano optical mapping. We also assembled a draft genome of B. nigra (BB) variety Sangam using Illumina short-read sequencing and Oxford Nanopore long reads and used it to validate the assembly of the B genome of B. juncea. Two different linkage maps of B. juncea, containing a large number of genotyping-by-sequencing markers, were developed and used to anchor scaffolds/contigs to the 18 linkage groups of the species. The resulting chromosome-scale assembly of B. juncea Varuna is a significant improvement over the previous draft assembly of B. juncea Tumida, a vegetable type of mustard. The assembled genome was characterized for transposons, centromeric repeats, gene content and gene block associations. In comparison to the A genome, the B genome contains a significantly higher content of LTR/Gypsy retrotransposons, distinct centromeric repeats and a large number of B. nigra specific gene clusters that break the gene collinearity between the A and the B genomes. The B. juncea Varuna assembly will be of major value to the breeding work on oleiferous types of mustard that are grown extensively in south Asia and elsewhere.


Assuntos
Genoma de Planta , Mostardeira , Ásia , Mapeamento Cromossômico , Cromossomos , Genoma de Planta/genética , Mostardeira/genética , Melhoramento Vegetal
10.
New Phytol ; 227(3): 914-929, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-31369159

RESUMO

The evolution of l-DOPA 4,5-dioxygenase activity, encoded by the gene DODA, was a key step in the origin of betalain biosynthesis in Caryophyllales. We previously proposed that l-DOPA 4,5-dioxygenase activity evolved via a single Caryophyllales-specific neofunctionalisation event within the DODA gene lineage. However, this neofunctionalisation event has not been confirmed and the DODA gene lineage exhibits numerous gene duplication events, whose evolutionary significance is unclear. To address this, we functionally characterised 23 distinct DODA proteins for l-DOPA 4,5-dioxygenase activity, from four betalain-pigmented and five anthocyanin-pigmented species, representing key evolutionary transitions across Caryophyllales. By mapping these functional data to an updated DODA phylogeny, we then explored the evolution of l-DOPA 4,5-dioxygenase activity. We find that low l-DOPA 4,5-dioxygenase activity is distributed across the DODA gene lineage. In this context, repeated gene duplication events within the DODA gene lineage give rise to polyphyletic occurrences of elevated l-DOPA 4,5-dioxygenase activity, accompanied by convergent shifts in key functional residues and distinct genomic patterns of micro-synteny. In the context of an updated organismal phylogeny and newly inferred pigment reconstructions, we argue that repeated convergent acquisition of elevated l-DOPA 4,5-dioxygenase activity is consistent with recurrent specialisation to betalain synthesis in Caryophyllales.


Assuntos
Caryophyllales , Dioxigenases , Betalaínas , Dioxigenases/genética , Levodopa , Filogenia , Pigmentação
11.
Proc Natl Acad Sci U S A ; 114(45): 12003-12008, 2017 11 07.
Artigo em Inglês | MEDLINE | ID: mdl-29078296

RESUMO

Few clades of plants have proven as difficult to classify as cacti. One explanation may be an unusually high level of convergent and parallel evolution (homoplasy). To evaluate support for this phylogenetic hypothesis at the molecular level, we sequenced the genomes of four cacti in the especially problematic tribe Pachycereeae, which contains most of the large columnar cacti of Mexico and adjacent areas, including the iconic saguaro cactus (Carnegiea gigantea) of the Sonoran Desert. We assembled a high-coverage draft genome for saguaro and lower coverage genomes for three other genera of tribe Pachycereeae (Pachycereus, Lophocereus, and Stenocereus) and a more distant outgroup cactus, Pereskia We used these to construct 4,436 orthologous gene alignments. Species tree inference consistently returned the same phylogeny, but gene tree discordance was high: 37% of gene trees having at least 90% bootstrap support conflicted with the species tree. Evidently, discordance is a product of long generation times and moderately large effective population sizes, leading to extensive incomplete lineage sorting (ILS). In the best supported gene trees, 58% of apparent homoplasy at amino sites in the species tree is due to gene tree-species tree discordance rather than parallel substitutions in the gene trees themselves, a phenomenon termed "hemiplasy." The high rate of genomic hemiplasy may contribute to apparent parallelisms in phenotypic traits, which could confound understanding of species relationships and character evolution in cacti.


Assuntos
Cactaceae/genética , Genoma de Planta/genética , Sequência de Bases , Evolução Molecular , Genômica/métodos , México , Modelos Genéticos , América do Norte , Filogenia
12.
BMC Genomics ; 20(1): 905, 2019 Nov 27.
Artigo em Inglês | MEDLINE | ID: mdl-31775618

RESUMO

BACKGROUND: The availability of thousands of complete rice genome sequences from diverse varieties and accessions has laid the foundation for in-depth exploration of the rice genome. One drawback to these collections is that most of these rice varieties have long life cycles, and/or low transformation efficiencies, which limits their usefulness as model organisms for functional genomics studies. In contrast, the rice variety Kitaake has a rapid life cycle (9 weeks seed to seed) and is easy to transform and propagate. For these reasons, Kitaake has emerged as a model for studies of diverse monocotyledonous species. RESULTS: Here, we report the de novo genome sequencing and analysis of Oryza sativa ssp. japonica variety KitaakeX, a Kitaake plant carrying the rice XA21 immune receptor. Our KitaakeX sequence assembly contains 377.6 Mb, consisting of 33 scaffolds (476 contigs) with a contig N50 of 1.4 Mb. Complementing the assembly are detailed gene annotations of 35,594 protein coding genes. We identified 331,335 genomic variations between KitaakeX and Nipponbare (ssp. japonica), and 2,785,991 variations between KitaakeX and Zhenshan97 (ssp. indica). We also compared Kitaake resequencing reads to the KitaakeX assembly and identified 219 small variations. The high-quality genome of the model rice plant KitaakeX will accelerate rice functional genomics. CONCLUSIONS: The high quality, de novo assembly of the KitaakeX genome will serve as a useful reference genome for rice and will accelerate functional genomics studies of rice and other species.


Assuntos
Genoma de Planta , Genômica , Oryza/genética , Sequenciamento Completo do Genoma , Biologia Computacional/métodos , Variação Genética , Genômica/métodos , Anotação de Sequência Molecular , Oryza/classificação , Fenótipo
13.
Nucleic Acids Res ; 45(D1): D1075-D1081, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899667

RESUMO

We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org.


Assuntos
Bases de Dados de Ácidos Nucleicos , Genoma de Planta , Mutação INDEL , Oryza/genética , Polimorfismo de Nucleotídeo Único , Ferramenta de Busca , Software , Alelos , Biologia Computacional/métodos , Frequência do Gene , Loci Gênicos , Genômica/métodos , Genótipo , Interface Usuário-Computador , Navegador
14.
Proc Natl Acad Sci U S A ; 113(35): E5163-71, 2016 08 30.
Artigo em Inglês | MEDLINE | ID: mdl-27535938

RESUMO

Asian cultivated rice consists of two subspecies: Oryza sativa subsp. indica and O. sativa subsp. japonica Despite the fact that indica rice accounts for over 70% of total rice production worldwide and is genetically much more diverse, a high-quality reference genome for indica rice has yet to be published. We conducted map-based sequencing of two indica rice lines, Zhenshan 97 (ZS97) and Minghui 63 (MH63), which represent the two major varietal groups of the indica subspecies and are the parents of an elite Chinese hybrid. The genome sequences were assembled into 237 (ZS97) and 181 (MH63) contigs, with an accuracy >99.99%, and covered 90.6% and 93.2% of their estimated genome sizes. Comparative analyses of these two indica genomes uncovered surprising structural differences, especially with respect to inversions, translocations, presence/absence variations, and segmental duplications. Approximately 42% of nontransposable element related genes were identical between the two genomes. Transcriptome analysis of three tissues showed that 1,059-2,217 more genes were expressed in the hybrid than in the parents and that the expressed genes in the hybrid were much more diverse due to their divergence between the parental genomes. The public availability of two high-quality reference genomes for the indica subspecies of rice will have large-ranging implications for plant biology and crop genetic improvement.


Assuntos
Cromossomos de Plantas/genética , Variação Genética , Genoma de Planta/genética , Oryza/genética , Mapeamento Cromossômico/métodos , Perfilação da Expressão Gênica , Regulação da Expressão Gênica de Plantas , Genes de Plantas/genética , Mutação INDEL , Oryza/classificação , Polimorfismo de Nucleotídeo Único , Especificidade da Espécie
15.
Plant Biotechnol J ; 15(6): 765-774, 2017 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-27889940

RESUMO

The related A genome species of the Oryza genus are the effective gene pool for rice. Here, we report draft genomes for two Australian wild A genome taxa: O. rufipogon-like population, referred to as Taxon A, and O. meridionalis-like population, referred to as Taxon B. These two taxa were sequenced and assembled by integration of short- and long-read next-generation sequencing (NGS) data to create a genomic platform for a wider rice gene pool. Here, we report that, despite the distinct chloroplast genome, the nuclear genome of the Australian Taxon A has a sequence that is much closer to that of domesticated rice (O. sativa) than to the other Australian wild populations. Analysis of 4643 genes in the A genome clade showed that the Australian annual, O. meridionalis, and related perennial taxa have the most divergent (around 3 million years) genome sequences relative to domesticated rice. A test for admixture showed possible introgression into the Australian Taxon A (diverged around 1.6 million years ago) especially from the wild indica/O. nivara clade in Asia. These results demonstrate that northern Australia may be the centre of diversity of the A genome Oryza and suggest the possibility that this might also be the centre of origin of this group and represent an important resource for rice improvement.


Assuntos
Genoma de Planta/genética , Oryza/genética , Proteínas de Plantas/genética , Evolução Molecular , Genoma de Cloroplastos/genética , Sequenciamento de Nucleotídeos em Larga Escala , Filogenia , Análise de Sequência de DNA
16.
Bioinformatics ; 32(20): 3058-3064, 2016 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-27318200

RESUMO

MOTIVATION: Next generation sequencing technologies have revolutionized our ability to rapidly and affordably generate vast quantities of sequence data. Once generated, raw sequences are assembled into contigs or scaffolds. However, these assemblies are mostly fragmented and inaccurate at the whole genome scale, largely due to the inability to integrate additional informative datasets (e.g. physical, optical and genetic maps). To address this problem, we developed a semi-automated software tool-Genome Puzzle Master (GPM)-that enables the integration of additional genomic signposts to edit and build 'new-gen-assemblies' that result in high-quality 'annotation-ready' pseudomolecules. RESULTS: With GPM, loaded datasets can be connected to each other via their logical relationships which accomplishes tasks to 'group,' 'merge,' 'order and orient' sequences in a draft assembly. Manual editing can also be performed with a user-friendly graphical interface. Final pseudomolecules reflect a user's total data package and are available for long-term project management. GPM is a web-based pipeline and an important part of a Laboratory Information Management System (LIMS) which can be easily deployed on local servers for any genome research laboratory. AVAILABILITY AND IMPLEMENTATION: The GPM (with LIMS) package is available at https://github.com/Jianwei-Zhang/LIMS CONTACTS: jzhang@mail.hzau.edu.cn or rwing@mail.arizona.eduSupplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Software , Genoma
17.
BMC Genomics ; 16: 538, 2015 Jul 22.
Artigo em Inglês | MEDLINE | ID: mdl-26194356

RESUMO

BACKGROUND: Comparative evolutionary analysis of whole genomes requires not only accurate annotation of gene space, but also proper annotation of the repetitive fraction which is often the largest component of most if not all genomes larger than 50 kb in size. RESULTS: Here we present the Rice TE database (RiTE-db)--a genus-wide collection of transposable elements and repeated sequences across 11 diploid species of the genus Oryza and the closely-related out-group Leersia perrieri. The database consists of more than 170,000 entries divided into three main types: (i) a classified and curated set of publicly-available repeated sequences, (ii) a set of consensus assemblies of highly-repetitive sequences obtained from genome sequencing surveys of 12 species; and (iii) a set of full-length TEs, identified and extracted from 12 whole genome assemblies. CONCLUSIONS: This is the first report of a repeat dataset that spans the majority of repeat variability within an entire genus, and one that includes complete elements as well as unassembled repeats. The database allows sequence browsing, downloading, and similarity searches. Because of the strategy adopted, the RiTE-db opens a new path to unprecedented direct comparative studies that span the entire nuclear repeat content of 15 million years of Oryza diversity.


Assuntos
Bases de Dados Genéticas , Evolução Molecular , Genoma de Planta , Oryza/genética , Elementos de DNA Transponíveis/genética , Genômica , Software
18.
Am J Bot ; 102(7): 1115-27, 2015 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-26199368

RESUMO

UNLABELLED: • PREMISE OF THE STUDY: Land-plant plastid genomes have only rarely undergone significant changes in gene content and order. Thus, discovery of additional examples adds power to tests for causes of such genome-scale structural changes.• METHODS: Using next-generation sequence data, we assembled the plastid genome of saguaro cactus and probed the nuclear genome for transferred plastid genes and functionally related nuclear genes. We combined these results with available data across Cactaceae and seed plants more broadly to infer the history of gene loss and to assess the strength of phylogenetic association between gene loss and loss of the inverted repeat (IR).• KEY RESULTS: The saguaro plastid genome is the smallest known for an obligately photosynthetic angiosperm (∼113 kb), having lost the IR and plastid ndh genes. This loss supports a statistically strong association across seed plants between the loss of ndh genes and the loss of the IR. Many nonplastid copies of plastid ndh genes were found in the nuclear genome, but none had intact reading frames; nor did three related nuclear-encoded subunits. However, nuclear pgr5, which functions in a partially redundant pathway, was intact.• CONCLUSIONS: The existence of an alternative pathway redundant with the function of the plastid NADH dehydrogenase-like complex (NDH) complex may permit loss of the plastid ndh gene suite in photoautotrophs like saguaro. Loss of these genes may be a recurring mechanism for overall plastid genome size reduction, especially in combination with loss of the IR.


Assuntos
Cactaceae/genética , Genomas de Plastídeos/genética , Sequências Repetidas Invertidas/genética , NADH Desidrogenase/genética , Plastídeos/genética , DNA de Plantas/química , DNA de Plantas/genética , Evolução Molecular , Biblioteca Gênica , Sequenciamento de Nucleotídeos em Larga Escala , Anotação de Sequência Molecular , Filogenia , Proteínas de Plantas/genética , Análise de Sequência de DNA
19.
GigaByte ; 2024: gigabyte112, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38496214

RESUMO

This work is an update and extension of the previously published article "Ultralong Oxford Nanopore Reads Enable the Development of a Reference-Grade Perennial Ryegrass Genome Assembly" by Frei et al. The published genome assembly of the doubled haploid perennial ryegrass (Lolium perenne L.) genotype Kyuss (Kyuss v1.0) marked a milestone for forage grass research and breeding. However, order and orientation errors may exist in the pseudo-chromosomes of Kyuss, since barley (Hordeum vulgare L.), which diverged 30 million years ago from perennial ryegrass, was used as the reference to scaffold Kyuss. To correct for structural errors possibly present in the published Kyuss assembly, we de novo assembled the genome again and generated 50-fold coverage high-throughput chromosome conformation capture (Hi-C) data to assist pseudo-chromosome construction. The resulting new chromosome-level assembly Kyuss v2.0 showed improved quality with high contiguity (contig N50 = 120 Mb), high completeness (total BUSCO score = 99%), high base-level accuracy (QV = 50), and correct pseudo-chromosome structure (validated by Hi-C contact map). This new assembly will serve as a better reference genome for Lolium spp. and greatly benefit the forage and turf grass research community.

20.
Ecol Evol ; 14(3): e10979, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38476697

RESUMO

The assembly of genomes from pooled samples of genetically heterogenous samples of conspecifics remains challenging. In this study, we show that high-quality genome assemblies can be produced from samples of multiple wild-caught individuals. We sequenced DNA extracted from a pooled sample of conspecific herbivorous insects (Hemiptera: Miridae: Tupiocoris notatus) acquired from a greenhouse infestation in Tucson, Arizona (in the range of 30-100 individuals; 0.5 mL tissue by volume) using PacBio highly accurate long reads (HiFi). The initial assembly contained multiple haplotigs (>85% BUSCOs duplicated), but duplicate contigs could be easily purged to reveal a highly complete assembly (95.6% BUSCO, 4.4% duplicated) that is highly contiguous by short-read assembly standards (N 50 = 675 kb; Largest contig = 4.3 Mb). We then used our assembly as the basis for a genome-guided differential expression study of host plant-specific transcriptional responses. We found thousands of genes (N = 4982) to be differentially expressed between our new data from individuals feeding on Datura wrightii (Solanaceae) and existing RNA-seq data from Nicotiana attenuata (Solanaceae)-fed individuals. We identified many of these genes as previously documented detoxification genes such as glutathione-S-transferases, cytochrome P450s, and UDP-glucosyltransferases. Together our results show that long-read sequencing of pooled samples can provide a cost-effective genome assembly option for small insects and can provide insights into the genetic mechanisms underlying interactions between plants and herbivorous pests.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA