Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 62
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Mol Biol Evol ; 41(9)2024 Sep 04.
Artigo em Inglês | MEDLINE | ID: mdl-39302634

RESUMO

During the meiosis of many eukaryote species, crossovers tend to occur within narrow regions called recombination hotspots. In plants, it is generally thought that gene regulatory sequences, especially promoters and 5' to 3' untranslated regions, are enriched in hotspots, but this has been characterized in a handful of species only. We also lack a clear description of fine-scale variation in recombination rates within genic regions and little is known about hotspot position and intensity in plants. To address this question, we constructed fine-scale recombination maps from genetic polymorphism data and inferred recombination hotspots in 11 plant species. We detected gradients of recombination in genic regions in most species, yet gradients varied in intensity and shape depending on specific hotspot locations and gene structure. To further characterize recombination gradients, we decomposed them according to gene structure by rank and number of exons. We generalized the previously observed pattern that recombination hotspots are organized around the boundaries of coding sequences, especially 5' promoters. However, our results also provided new insight into the relative importance of the 3' end of genes in some species and the possible location of hotspots away from genic regions in some species. Variation among species seemed driven more by hotspot location among and within genes than by differences in size or intensity among species. Our results shed light on the variation in recombination rates at a very fine scale, revealing the diversity and complexity of genic recombination gradients emerging from the interaction between hotspot location and gene structure.


Assuntos
Genoma de Planta , Recombinação Genética , Plantas/genética , Regiões Promotoras Genéticas , Polimorfismo Genético , Meiose/genética
2.
G3 (Bethesda) ; 2024 Sep 17.
Artigo em Inglês | MEDLINE | ID: mdl-39288023

RESUMO

Genome sequencing for agriculturally important Rosaceous crops has made rapid progress both in completeness and annotation quality. Whole genome sequence and annotation gives breeders, researchers, and growers information about cultivar specific traits such as fruit quality and disease resistance, and informs strategies to enhance postharvest storage. Here we present a haplotype-phased, chromosomal level genome of Malus domestica, 'WA 38', a new apple cultivar released to market in 2017 as Cosmic Crisp®. Using both short and long read sequencing data with a k-mer based approach, chromosomes originating from each parent were assembled and segregated. This is the first pome fruit genome fully phased into parental haplotypes in which chromosomes from each parent are identified and separated into their unique, respective haplomes. The two haplome assemblies, 'Honeycrisp' originated HapA and 'Enterprise' originated HapB, are about 650 Megabases each, and both have a BUSCO score of 98.7% complete. A total of 53,028 and 54,235 genes were annotated from HapA and HapB, respectively. Additionally, we provide genome-scale comparisons to 'Gala', 'Honeycrisp', and other relevant cultivars highlighting major differences in genome structure and gene family circumscription. This assembly and annotation was done in collaboration with the American Campus Tree Genomes project that includes 'WA 38' (Washington State University), 'd'Anjou' pear (Auburn University), and many more. To ensure transparency, reproducibility, and applicability for any genome project, our genome assembly and annotation workflow is recorded in detail and shared under a public GitLab repository. All software is containerized, offering a simple implementation of the workflow.

3.
Proc Natl Acad Sci U S A ; 121(40): e2402781121, 2024 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-39312655

RESUMO

While considerable knowledge exists about the enzymes pivotal for C4 photosynthesis, much less is known about the cis-regulation important for specifying their expression in distinct cell types. Here, we use single-cell-indexed ATAC-seq to identify cell-type-specific accessible chromatin regions (ACRs) associated with C4 enzymes for five different grass species. This study spans four C4 species, covering three distinct photosynthetic subtypes: Zea mays and Sorghum bicolor (NADP-dependent malic enzyme), Panicum miliaceum (NAD-dependent malic enzyme), Urochloa fusca (phosphoenolpyruvate carboxykinase), along with the C3 outgroup Oryza sativa. We studied the cis-regulatory landscape of enzymes essential across all C4 species and those unique to C4 subtypes, measuring cell-type-specific biases for C4 enzymes using chromatin accessibility data. Integrating these data with phylogenetics revealed diverse co-option of gene family members between species, showcasing the various paths of C4 evolution. Besides promoter proximal ACRs, we found that, on average, C4 genes have two to three distal cell-type-specific ACRs, highlighting the complexity and divergent nature of C4 evolution. Examining the evolutionary history of these cell-type-specific ACRs revealed a spectrum of conserved and novel ACRs, even among closely related species, indicating ongoing evolution of cis-regulation at these C4 loci. This study illuminates the dynamic and complex nature of cis-regulatory elements evolution in C4 photosynthesis, particularly highlighting the intricate cis-regulatory evolution of key loci. Our findings offer a valuable resource for future investigations, potentially aiding in the optimization of C3 crop performance under changing climatic conditions.


Assuntos
Regulação da Expressão Gênica de Plantas , Fotossíntese , Poaceae , Fotossíntese/genética , Poaceae/genética , Poaceae/metabolismo , Análise de Célula Única/métodos , Cromatina/metabolismo , Cromatina/genética , Oryza/genética , Oryza/metabolismo , Filogenia , Zea mays/genética , Zea mays/metabolismo , Malato Desidrogenase/metabolismo , Malato Desidrogenase/genética , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Sorghum/genética , Sorghum/metabolismo
4.
Front Plant Sci ; 15: 1426035, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38899156

RESUMO

[This corrects the article DOI: 10.3389/fpls.2024.1328966.].

5.
Sci China Life Sci ; 67(8): 1579-1590, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-38676814

RESUMO

Plant genomics and crop breeding are at the intersection of biotechnology and information technology. Driven by a combination of high-throughput sequencing, molecular biology and data science, great advances have been made in omics technologies at every step along the central dogma, especially in genome assembling, genome annotation, epigenomic profiling, and transcriptome profiling. These advances further revolutionized three directions of development. One is genetic dissection of complex traits in crops, along with genomic prediction and selection. The second is comparative genomics and evolution, which open up new opportunities to depict the evolutionary constraints of biological sequences for deleterious variant discovery. The third direction is the development of deep learning approaches for the rational design of biological sequences, especially proteins, for synthetic biology. All three directions of development serve as the foundation for a new era of crop breeding where agronomic traits are enhanced by genome design.


Assuntos
Genômica , Plantas , Software , Plantas/genética , Cruzamento , Genoma de Planta , Mapeamento Cromossômico , Proteínas de Plantas/química , Proteínas de Plantas/genética , Ciência de Dados
6.
aBIOTECH ; 5(1): 94-106, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38576435

RESUMO

Genomic data serve as an invaluable resource for unraveling the intricacies of the higher plant systems, including the constituent elements within and among species. Through various efforts in genomic data archiving, integrative analysis and value-added curation, the National Genomics Data Center (NGDC), which is a part of the China National Center for Bioinformation (CNCB), has successfully established and currently maintains a vast amount of database resources. This dedicated initiative of the NGDC facilitates a data-rich ecosystem that greatly strengthens and supports genomic research efforts. Here, we present a comprehensive overview of central repositories dedicated to archiving, presenting, and sharing plant omics data, introduce knowledgebases focused on variants or gene-based functional insights, highlight species-specific multiple omics database resources, and briefly review the online application tools. We intend that this review can be used as a guide map for plant researchers wishing to select effective data resources from the NGDC for their specific areas of study. Supplementary Information: The online version contains supplementary material available at 10.1007/s42994-023-00134-4.

7.
Plants (Basel) ; 13(5)2024 Mar 04.
Artigo em Inglês | MEDLINE | ID: mdl-38475564

RESUMO

This comprehensive article critically analyzes the advanced biotechnological strategies to mitigate plant drought stress. It encompasses an in-depth exploration of the latest developments in plant genomics, proteomics, and metabolomics, shedding light on the complex molecular mechanisms that plants employ to combat drought stress. The study also emphasizes the significant advancements in genetic engineering techniques, particularly CRISPR-Cas9 genome editing, which have revolutionized the creation of drought-resistant crop varieties. Furthermore, the article explores microbial biotechnology's pivotal role, such as plant growth-promoting rhizobacteria (PGPR) and mycorrhizae, in enhancing plant resilience against drought conditions. The integration of these cutting-edge biotechnological interventions with traditional breeding methods is presented as a holistic approach for fortifying crops against drought stress. This integration addresses immediate agricultural needs and contributes significantly to sustainable agriculture, ensuring food security in the face of escalating climate change challenges.

8.
Front Plant Sci ; 15: 1352040, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38469329

RESUMO

Abiotic stresses are major constraints in crop production, and are accountable for more than half of the total crop loss. Plants overcome these environmental stresses using coordinated activities of transcription factors and phytohormones. Pearl millet an important C4 cereal plant having high nutritional value and climate resilient features is grown in marginal lands of Africa and South-East Asia including India. Among several transcription factors, the basic leucine zipper (bZIP) is an important TF family associated with diverse biological functions in plants. In this study, we have identified 98 bZIP family members (PgbZIP) in pearl millet. Phylogenetic analysis divided these PgbZIP genes into twelve groups (A-I, S, U and X). Motif analysis has shown that all the PgbZIP proteins possess conserved bZIP domains and the exon-intron organization revealed conserved structural features among the identified genes. Cis-element analysis, RNA-seq data analysis, and real-time expression analysis of PgbZIP genes suggested the potential role of selected PgbZIP genes in growth/development and abiotic stress responses in pearl millet. Expression profiling of selected PgbZIPs under various phytohormones (ABA, SA and MeJA) treatment showed differential expression patterns of PgbZIP genes. Further, PgbZIP9, a homolog of AtABI5 was found to localize in the nucleus and modulate gene expression in pearl millet under stresses. Our present findings provide a better understanding of bZIP genes in pearl millet and lay a good foundation for the further functional characterization of multi-stress tolerant PgbZIP genes, which could become efficient tools for crop improvement.

9.
Front Plant Sci ; 15: 1328966, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38550287

RESUMO

Extensive research has focused on exploring the range of genome sizes in eukaryotes, with a particular emphasis on land plants, where significant variability has been observed. Accurate estimation of genome size is essential for various research purposes, but existing sequence-based methods have limitations, particularly for low-coverage datasets. In this study, we introduce LocoGSE, a novel genome size estimator designed specifically for low-coverage datasets generated by genome skimming approaches. LocoGSE relies on mapping the reads on single copy consensus proteins without the need for a reference genome assembly. We calibrated LocoGSE using 430 low-coverage Angiosperm genome skimming datasets and compared its performance against other estimators. Our results demonstrate that LocoGSE accurately predicts monoploid genome size even at very low depth of coverage (<1X) and on highly heterozygous samples. Additionally, LocoGSE provides stable estimates across individuals with varying ploidy levels. LocoGSE fills a gap in sequence-based plant genome size estimation by offering a user-friendly and reliable tool that does not rely on high coverage or reference assemblies. We anticipate that LocoGSE will facilitate plant genome size analysis and contribute to evolutionary and ecological studies in the field. Furthermore, at the cost of an initial calibration, LocoGSE can be used in other lineages.

10.
Genetics ; 227(1)2024 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-38457127

RESUMO

Since 1999, The Arabidopsis Information Resource (www.arabidopsis.org) has been curating data about the Arabidopsis thaliana genome. Its primary focus is integrating experimental gene function information from the peer-reviewed literature and codifying it as controlled vocabulary annotations. Our goal is to produce a "gold standard" functional annotation set that reflects the current state of knowledge about the Arabidopsis genome. At the same time, the resource serves as a nexus for community-based collaborations aimed at improving data quality, access, and reuse. For the past decade, our work has been made possible by subscriptions from our global user base. This update covers our ongoing biocuration work, some of our modernization efforts that contribute to the first major infrastructure overhaul since 2011, the introduction of JBrowse2, and the resource's role in community activities such as organizing the structural reannotation of the genome. For gene function assessment, we used gene ontology annotations as a metric to evaluate: (1) what is currently known about Arabidopsis gene function and (2) the set of "unknown" genes. Currently, 74% of the proteome has been annotated to at least one gene ontology term. Of those loci, half have experimental support for at least one of the following aspects: molecular function, biological process, or cellular component. Our work sheds light on the genes for which we have not yet identified any published experimental data and have no functional annotation. Drawing attention to these unknown genes highlights knowledge gaps and potential sources of novel discoveries.


Assuntos
Arabidopsis , Bases de Dados Genéticas , Anotação de Sequência Molecular , Arabidopsis/genética , Genoma de Planta , Ontologia Genética , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo
13.
Methods Mol Biol ; 2703: 3-22, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37646933

RESUMO

The FAIR data principle as a commitment to support long-term research data management is widely accepted in the scientific community. However, although many established infrastructures provide comprehensive and long-term stable services and platforms, a large quantity of research data is still hidden. Currently, high-throughput plant genomics and phenomics technologies are producing research data in abundance, the storage of which is not covered by established core databases. This concerns the data volume, for example, time series of images or high-resolution hyperspectral data; the quality of data formatting and annotation, e.g., with regard to structure and annotation specifications of core databases; uncovered data domains; or organizational constraints prohibiting primary data storage outside institutional boundaries. To share these potentially dark data in a FAIR way and master these challenges the ELIXIR Germany/de.NBI service Plant Genomic and Phenomics Research Data Repository (PGP) implements an on-premise approach, which allows research data to be kept in place and wrapped in FAIR-aware software infrastructure. In this chapter, the e!DAL infrastructure software and the PGP repository are presented as best practice on how to easily setup FAIR-compliant and intuitive research data services.


Assuntos
Genômica , Fenômica , Gerenciamento de Dados , Bases de Dados Factuais , Alemanha
14.
Phytopathology ; 113(9): 1661-1676, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-37486077

RESUMO

Plant viruses infect a wide range of commercially important crop plants and cause significant crop production losses worldwide. Numerous alterations in plant physiology related to the reprogramming of gene expression may result from viral infections. Although conventional integrated pest management-based strategies have been effective in reducing the impact of several viral diseases, continued emergence of new viruses and strains, expanding host ranges, and emergence of resistance-breaking strains necessitate a sustained effort toward the development and application of new approaches for virus management that would complement existing tactics. RNA interference-based techniques, and more recently, clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing technologies have paved the way for precise targeting of viral transcripts and manipulation of viral genomes and host factors. In-depth knowledge of the molecular mechanisms underlying the development of disease would further expand the applicability of these recent methods. Advances in next-generation/high-throughput sequencing have made possible more intensive studies into host-virus interactions. Utilizing the omics data and its application has the potential to expedite fast-tracking traditional plant breeding methods, as well as applying modern molecular tools for trait enhancement, including virus resistance. Here, we summarize the recent developments in the CRISPR/Cas system, transcriptomics, endogenous RNA interference, and exogenous application of dsRNA in virus disease management.


Assuntos
Vírus de Plantas , Viroses , Sistemas CRISPR-Cas , Interferência de RNA , Multiômica , Doenças das Plantas , Melhoramento Vegetal , Plantas/genética , Vírus de Plantas/genética , Viroses/genética , Gerenciamento Clínico , Genoma de Planta
16.
BMC Plant Biol ; 23(1): 59, 2023 Jan 28.
Artigo em Inglês | MEDLINE | ID: mdl-36707785

RESUMO

BACKGROUND: Massive parallel sequencing technologies have enabled the elucidation of plant phylogenetic relationships from chloroplast genomes at a high pace. These include members of the family Rhamnaceae. The current Rhamnaceae phylogenetic tree is from 13 out of 24 Rhamnaceae chloroplast genomes, and only one chloroplast genome of the genus Ventilago is available. Hence, the phylogenetic relationships in Rhamnaceae remain incomplete, and more representative species are needed. RESULTS: The complete chloroplast genome of Ventilago harmandiana Pierre was outlined using a hybrid assembly of long- and short-read technologies. The accuracy and validity of the final genome were confirmed with PCR amplifications and investigation of coverage depth. Sanger sequencing was used to correct for differences in lengths and nucleotide bases between inverted repeats because of the homopolymers. The phylogenetic trees reconstructed using prevalent methods for phylogenetic inference were topologically similar. The clustering based on codon usage was congruent with the molecular phylogenetic tree. The groups of genera in each tribe were in accordance with tribal classification based on molecular markers. We resolved the phylogenetic relationships among six Hovenia species, three Rhamnus species, and two Ventilago species. Our reconstructed tree provides the most complete and reliable low-level taxonomy to date for the family Rhamnaceae. Similar to other higher plants, the RNA editing mostly resulted in converting serine to leucine. Besides, most genes were subjected to purifying selection. Annotation anomalies, including indel calling errors, unaligned open reading frames of the same gene, inconsistent prediction of intergenic regions, and misannotated genes, were identified in the published chloroplast genomes used in this study. These could be a result of the usual imperfections in computational tools, and/or existing errors in reference genomes. Importantly, these are points of concern with regards to utilizing published chloroplast genomes for comparative genomic analysis. CONCLUSIONS: In summary, we successfully demonstrated the use of comprehensive genomic data, including DNA and amino acid sequences, to build a reliable and high-resolution phylogenetic tree for the family Rhamnaceae. Additionally, our study indicates that the revision of genome annotation before comparative genomic analyses is necessary to prevent the propagation of errors and complications in downstream analysis and interpretation.


Assuntos
Genoma de Cloroplastos , Rhamnaceae , Genoma de Cloroplastos/genética , Rhamnaceae/genética , Filogenia , Genômica/métodos , Cloroplastos/genética
17.
Genomics ; 115(2): 110568, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36702293

RESUMO

It has recently been shown that structural variants (SV) can have a higher impact on gene expression variation compared to single nucleotide variants (SNV) in different plant species. Additionally, SV were associated with phenotypic variation in several crops. However, compared to the established SV detection based on short-read sequencing, less approaches were described for linked-read based SV calling. We therefore evaluated the performance of six linked-read SV callers compared to an established short-read SV caller based on simulated linked-reads in tetraploid potato. The objectives of our study were to i) compare the performance of SV callers based on linked-read sequencing to short-read sequencing, ii) examine the influence of SV type, SV length, haplotype incidence (HI), as well as sequencing coverage on the SV calling performance in the tetraploid potato genome, and iii) evaluate the accuracy of detecting insertions by linked-read compared to short-read sequencing. We observed high break point resolutions (BPR) detecting short SV and slightly lower BPR for large SV. Our observations highlighted the importance of short-read signals provided by Manta and LinkedSV to detect short SV. Manta and NAIBR performed well for detecting larger deletions, inversions, and duplications. Detected large SV were weakly influenced by the HI. Furthermore, we illustrated that large insertions can be assembled by Novel-X. Our results suggest the usage of the short-read and linked-read SV callers Manta, NAIBR, LinkedSV, and Novel-X based on at least 90x linked-read sequencing coverage to ensure the detection of a broad range of SV in the tetraploid potato genome.


Assuntos
Solanum tuberosum , Análise de Sequência de DNA/métodos , Solanum tuberosum/genética , Benchmarking , Tetraploidia , Genoma , Sequenciamento de Nucleotídeos em Larga Escala
18.
Front Plant Sci ; 13: 891155, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35874023

RESUMO

Bacteria communities associated with plants have been given increasing consideration because they are arguably beneficial to their host plants. To understand the ecological and evolutionary impact of these mutualistic associations, it is important to explore the vast unknown territory of bacterial genomic diversity and their functional contributions associated with the major branches of the tree-of-life. Arguably, this aim can be achieved by profiling bacterial communities by applying high throughput sequencing approaches, besides establishing model plant organisms to test key predictions. This study utilized the Illumina Miseq reads of bacterial 16S rRNA sequences to determine the bacterial diversity associated with the endosphere of the leaves of the highly specialized rock spleenwort Asplenium delavayi (Aspleniaceae). By documenting the bacterial communities associated with ferns collected in natural occurrence and cultivation, this study discovered the most species-rich bacterial communities associated with terrestrial ferns reported until now. Despite the substantial variations of species diversity and composition among accessions, a set of 28 bacterial OTUs was found to be shared among all accessions. Functional analyses recovered evidence to support the predictions that changes in bacterial community compositions correspond to functional differentiation. Given the ease of cultivating this species, Asplenium delavayi is introduced here as a model organism to explore the ecological and evolutionary benefits created by mutualistic associations between bacteria and ferns.

19.
Methods Mol Biol ; 2443: 197-209, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35037207

RESUMO

SciApps is an open-source, web-based platform for processing, storing, visualizing, and distributing genomic data and analysis results. Built upon the Tapis (formerly Agave) platform, SciApps brings users TB-scale of data storage via CyVerse Data Store and over one million CPUs via the Extreme Science and Engineering Discovery Environment (XSEDE) resources at Texas Advanced Computing Center (TACC). SciApps provides users ways to chain individual jobs into automated and reproducible workflows in a distributed cloud and provides a management system for data, associated metadata, individual analysis jobs, and multi-step workflows. This chapter provides examples of how to (1) submitting, managing, constructing workflows, (2) using public workflows for Bulked Segregant Analysis (BSA), (3) constructing a Data Analysis Center (DAC), and Data Coordination Center (DCC) for the plant ENCODE project.


Assuntos
Genômica , Software , Biologia Computacional , Genoma de Planta , Genômica/métodos , Armazenamento e Recuperação da Informação , Fluxo de Trabalho
20.
Methods Mol Biol ; 2443: 245-257, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35037210

RESUMO

Optical mapping plays an important role in plant genomics, particularly in plant genome assembly and large-scale structural variation detection. While DNA sequencing provides base-by-base nucleotide information, optical mapping shows the physical locations of selected enzyme restriction sites in a genome. The long single-molecule maps produced by optical mapping make it a useful auxiliary technique to DNA sequencing, which generally cannot span large and complex genomic regions. Although optical mapping, therefore, offers unique advantages to researchers, there are few dedicated tools to assist in optical mapping analyses. In this chapter, we present runBNG2, a successor of runBNG to help optical-mapping data analysis for diverse datasets.


Assuntos
Genoma de Planta , Genômica , Genômica/métodos , Plantas/genética , Mapeamento por Restrição , Análise de Sequência de DNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA