Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 33
Filter
1.
Nucleic Acids Res ; 52(D1): D1538-D1547, 2024 Jan 05.
Article in English | MEDLINE | ID: mdl-37986220

ABSTRACT

Plant Reactome (https://plantreactome.gramene.org) is a freely accessible, comprehensive plant pathway knowledgebase. It provides curated reference pathways from rice (Oryza sativa) and gene-orthology-based pathway projections to 129 additional species, spanning single-cell photoautotrophs, non-vascular plants, and higher plants, thus encompassing a wide-ranging taxonomic diversity. Currently, Plant Reactome houses a collection of 339 reference pathways, covering metabolic and transport pathways, hormone signaling, genetic regulations of developmental processes, and intricate transcriptional networks that orchestrate a plant's response to abiotic and biotic stimuli. Beyond being a mere repository, Plant Reactome serves as a dynamic data discovery platform. Users can analyze and visualize omics data, such as gene expression, gene-gene interaction, proteome, and metabolome data, all within the rich context of plant pathways. Plant Reactome is dedicated to fostering data interoperability, upholding global data standards, and embracing the tenets of the Findable, Accessible, Interoperable and Re-usable (FAIR) data policy.


Subject(s)
Knowledge Bases , Metabolic Networks and Pathways , Multiomics , Plants , Metabolic Networks and Pathways/genetics , Plants/genetics , Plants/metabolism , Signal Transduction/genetics , Internet , Databases, Protein
2.
Nucleic Acids Res ; 50(D1): D996-D1003, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34791415

ABSTRACT

Ensembl Genomes (https://www.ensemblgenomes.org) provides access to non-vertebrate genomes and analysis complementing vertebrate resources developed by the Ensembl project (https://www.ensembl.org). The two resources collectively present genome annotation through a consistent set of interfaces spanning the tree of life presenting genome sequence, annotation, variation, transcriptomic data and comparative analysis. Here, we present our largest increase in plant, metazoan and fungal genomes since the project's inception creating one of the world's most comprehensive genomic resources and describe our efforts to reduce genome redundancy in our Bacteria portal. We detail our new efforts in gene annotation, our emerging support for pangenome analysis, our efforts to accelerate data dissemination through the Ensembl Rapid Release resource and our new AlphaFold visualization. Finally, we present details of our future plans including updates on our integration with Ensembl, and how we plan to improve our support for the microbial research community. Software and data are made available without restriction via our website, online tools platform and programmatic interfaces (available under an Apache 2.0 license). Data updates are synchronised with Ensembl's release cycle.


Subject(s)
Databases, Genetic , Genomics , Internet , Software , Animals , Computational Biology , Genome, Bacterial/genetics , Genome, Fungal/genetics , Genome, Plant/genetics , Plants/classification , Plants/genetics , Vertebrates/classification , Vertebrates/genetics
3.
Nucleic Acids Res ; 49(D1): D1452-D1463, 2021 01 08.
Article in English | MEDLINE | ID: mdl-33170273

ABSTRACT

Gramene (http://www.gramene.org), a knowledgebase founded on comparative functional analyses of genomic and pathway data for model plants and major crops, supports agricultural researchers worldwide. The resource is committed to open access and reproducible science based on the FAIR data principles. Since the last NAR update, we made nine releases; doubled the genome portal's content; expanded curated genes, pathways and expression sets; and implemented the Domain Informational Vocabulary Extraction (DIVE) algorithm for extracting gene function information from publications. The current release, #63 (October 2020), hosts 93 reference genomes-over 3.9 million genes in 122 947 families with orthologous and paralogous classifications. Plant Reactome portrays pathway networks using a combination of manual biocuration in rice (320 reference pathways) and orthology-based projections to 106 species. The Reactome platform facilitates comparison between reference and projected pathways, gene expression analyses and overlays of gene-gene interactions. Gramene integrates ontology-based protein structure-function annotation; information on genetic, epigenetic, expression, and phenotypic diversity; and gene functional annotations extracted from plant-focused journals using DIVE. We train plant researchers in biocuration of genes and pathways; host curated maize gene structures as tracks in the maize genome browser; and integrate curated rice genes and pathways in the Plant Reactome.


Subject(s)
Databases, Genetic , Gene Expression Regulation, Plant , Genome, Plant , Genomics/methods , Plant Proteins/genetics , Plants/genetics , Crops, Agricultural , DNA Transposable Elements , Gene Duplication , Gene Ontology , Gene Regulatory Networks , Internet , Knowledge Bases , Metabolic Networks and Pathways , Molecular Sequence Annotation , Oryza/genetics , Oryza/metabolism , Plant Proteins/metabolism , Plants/classification , Plants/metabolism , Polyploidy , Protein Interaction Mapping , Software , Zea mays/genetics , Zea mays/metabolism
4.
Nucleic Acids Res ; 48(D1): D1093-D1103, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31680153

ABSTRACT

Plant Reactome (https://plantreactome.gramene.org) is an open-source, comparative plant pathway knowledgebase of the Gramene project. It uses Oryza sativa (rice) as a reference species for manual curation of pathways and extends pathway knowledge to another 82 plant species via gene-orthology projection using the Reactome data model and framework. It currently hosts 298 reference pathways, including metabolic and transport pathways, transcriptional networks, hormone signaling pathways, and plant developmental processes. In addition to browsing plant pathways, users can upload and analyze their omics data, such as the gene-expression data, and overlay curated or experimental gene-gene interaction data to extend pathway knowledge. The curation team actively engages researchers and students on gene and pathway curation by offering workshops and online tutorials. The Plant Reactome supports, implements and collaborates with the wider community to make data and tools related to genes, genomes, and pathways Findable, Accessible, Interoperable and Re-usable (FAIR).


Subject(s)
Computational Biology/methods , Databases, Genetic , Genomics , Metabolomics , Plants/genetics , Plants/metabolism , Proteomics , Gene Regulatory Networks , Genomics/methods , Humans , Metabolic Networks and Pathways , Metabolomics/methods , Proteomics/methods , Signal Transduction , Web Browser
5.
Nucleic Acids Res ; 48(D1): D689-D695, 2020 01 08.
Article in English | MEDLINE | ID: mdl-31598706

ABSTRACT

Ensembl Genomes (http://www.ensemblgenomes.org) is an integrating resource for genome-scale data from non-vertebrate species, complementing the resources for vertebrate genomics developed in the context of the Ensembl project (http://www.ensembl.org). Together, the two resources provide a consistent set of interfaces to genomic data across the tree of life, including reference genome sequence, gene models, transcriptional data, genetic variation and comparative analysis. Data may be accessed via our website, online tools platform and programmatic interfaces, with updates made four times per year (in synchrony with Ensembl). Here, we provide an overview of Ensembl Genomes, with a focus on recent developments. These include the continued growth, more robust and reproducible sets of orthologues and paralogues, and enriched views of gene expression and gene function in plants. Finally, we report on our continued deeper integration with the Ensembl project, which forms a key part of our future strategy for dealing with the increasing quantity of available genome-scale data across the tree of life.


Subject(s)
Computational Biology/methods , Databases, Genetic , Genetic Variation , Genome, Bacterial , Genome, Fungal , Genome, Plant , Algorithms , Animals , Caenorhabditis elegans/genetics , Genomics , Internet , Molecular Sequence Annotation , Phenotype , Plants/genetics , Reference Values , Software , User-Computer Interface
6.
Nature ; 510(7505): 356-62, 2014 Jun 19.
Article in English | MEDLINE | ID: mdl-24919147

ABSTRACT

Eucalypts are the world's most widely planted hardwood trees. Their outstanding diversity, adaptability and growth have made them a global renewable resource of fibre and energy. We sequenced and assembled >94% of the 640-megabase genome of Eucalyptus grandis. Of 36,376 predicted protein-coding genes, 34% occur in tandem duplications, the largest proportion thus far in plant genomes. Eucalyptus also shows the highest diversity of genes for specialized metabolites such as terpenes that act as chemical defence and provide unique pharmaceutical oils. Genome sequencing of the E. grandis sister species E. globulus and a set of inbred E. grandis tree genomes reveals dynamic genome evolution and hotspots of inbreeding depression. The E. grandis genome is the first reference for the eudicot order Myrtales and is placed here sister to the eurosids. This resource expands our understanding of the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.


Subject(s)
Eucalyptus/genetics , Genome, Plant , Eucalyptus/classification , Evolution, Molecular , Genetic Variation , Inbreeding , Phylogeny
7.
Nucleic Acids Res ; 46(D1): D1181-D1189, 2018 01 04.
Article in English | MEDLINE | ID: mdl-29165610

ABSTRACT

Gramene (http://www.gramene.org) is a knowledgebase for comparative functional analysis in major crops and model plant species. The current release, #54, includes over 1.7 million genes from 44 reference genomes, most of which were organized into 62,367 gene families through orthologous and paralogous gene classification, whole-genome alignments, and synteny. Additional gene annotations include ontology-based protein structure and function; genetic, epigenetic, and phenotypic diversity; and pathway associations. Gramene's Plant Reactome provides a knowledgebase of cellular-level plant pathway networks. Specifically, it uses curated rice reference pathways to derive pathway projections for an additional 66 species based on gene orthology, and facilitates display of gene expression, gene-gene interactions, and user-defined omics data in the context of these pathways. As a community portal, Gramene integrates best-of-class software and infrastructure components including the Ensembl genome browser, Reactome pathway browser, and Expression Atlas widgets, and undergoes periodic data and software upgrades. Via powerful, intuitive search interfaces, users can easily query across various portals and interactively analyze search results by clicking on diverse features such as genomic context, highly augmented gene trees, gene expression anatomograms, associated pathways, and external informatics resources. All data in Gramene are accessible through both visual and programmatic interfaces.


Subject(s)
Databases, Genetic , Gene Expression Regulation, Plant , Genomics/methods , Knowledge Bases , Plants/genetics , Epigenesis, Genetic , Gene Ontology , Genetic Research , Genetic Variation , Genome, Plant , Metabolic Networks and Pathways/genetics , Molecular Sequence Annotation , Plants/metabolism , Software , User-Computer Interface
8.
Nucleic Acids Res ; 45(D1): D1029-D1039, 2017 01 04.
Article in English | MEDLINE | ID: mdl-27799469

ABSTRACT

Plant Reactome (http://plantreactome.gramene.org/) is a free, open-source, curated plant pathway database portal, provided as part of the Gramene project. The database provides intuitive bioinformatics tools for the visualization, analysis and interpretation of pathway knowledge to support genome annotation, genome analysis, modeling, systems biology, basic research and education. Plant Reactome employs the structural framework of a plant cell to show metabolic, transport, genetic, developmental and signaling pathways. We manually curate molecular details of pathways in these domains for reference species Oryza sativa (rice) supported by published literature and annotation of well-characterized genes. Two hundred twenty-two rice pathways, 1025 reactions associated with 1173 proteins, 907 small molecules and 256 literature references have been curated to date. These reference annotations were used to project pathways for 62 model, crop and evolutionarily significant plant species based on gene homology. Database users can search and browse various components of the database, visualize curated baseline expression of pathway-associated genes provided by the Expression Atlas and upload and analyze their Omics datasets. The database also offers data access via Application Programming Interfaces (APIs) and in various standardized pathway formats, such as SBML and BioPAX.


Subject(s)
Computational Biology/methods , Databases, Genetic , Plants/genetics , Plants/metabolism , Search Engine , Genomics/methods , Metabolic Networks and Pathways , Signal Transduction , Systems Biology/methods , User-Computer Interface , Web Browser
9.
Nucleic Acids Res ; 44(D1): D1133-40, 2016 Jan 04.
Article in English | MEDLINE | ID: mdl-26553803

ABSTRACT

Gramene (http://www.gramene.org) is an online resource for comparative functional genomics in crops and model plant species. Its two main frameworks are genomes (collaboration with Ensembl Plants) and pathways (The Plant Reactome and archival BioCyc databases). Since our last NAR update, the database website adopted a new Drupal management platform. The genomes section features 39 fully assembled reference genomes that are integrated using ontology-based annotation and comparative analyses, and accessed through both visual and programmatic interfaces. Additional community data, such as genetic variation, expression and methylation, are also mapped for a subset of genomes. The Plant Reactome pathway portal (http://plantreactome.gramene.org) provides a reference resource for analyzing plant metabolic and regulatory pathways. In addition to ∼ 200 curated rice reference pathways, the portal hosts gene homology-based pathway projections for 33 plant species. Both the genome and pathway browsers interface with the EMBL-EBI's Expression Atlas to enable the projection of baseline and differential expression data from curated expression studies in plants. Gramene's archive website (http://archive.gramene.org) continues to provide previously reported resources on comparative maps, markers and QTL. To further aid our users, we have also introduced a live monthly educational webinar series and a Gramene YouTube channel carrying video tutorials.


Subject(s)
Databases, Genetic , Genome, Plant , Plants/metabolism , Gene Expression , Genetic Variation , Genomics , Internet , Metabolic Networks and Pathways , Molecular Sequence Annotation , Plants/genetics
10.
Nucleic Acids Res ; 42(Database issue): D1193-9, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24217918

ABSTRACT

Gramene (http://www.gramene.org) is a curated online resource for comparative functional genomics in crops and model plant species, currently hosting 27 fully and 10 partially sequenced reference genomes in its build number 38. Its strength derives from the application of a phylogenetic framework for genome comparison and the use of ontologies to integrate structural and functional annotation data. Whole-genome alignments complemented by phylogenetic gene family trees help infer syntenic and orthologous relationships. Genetic variation data, sequences and genome mappings available for 10 species, including Arabidopsis, rice and maize, help infer putative variant effects on genes and transcripts. The pathways section also hosts 10 species-specific metabolic pathways databases developed in-house or by our collaborators using Pathway Tools software, which facilitates searches for pathway, reaction and metabolite annotations, and allows analyses of user-defined expression datasets. Recently, we released a Plant Reactome portal featuring 133 curated rice pathways. This portal will be expanded for Arabidopsis, maize and other plant species. We continue to provide genetic and QTL maps and marker datasets developed by crop researchers. The project provides a unique community platform to support scientific research in plant genomics including studies in evolution, genetics, plant breeding, molecular biology, biochemistry and systems biology.


Subject(s)
Databases, Genetic , Genome, Plant , Genomics , Crops, Agricultural/genetics , Genetic Variation , Internet , Metabolic Networks and Pathways/genetics , Molecular Sequence Annotation , Plants/genetics , Plants/metabolism
11.
New Phytol ; 206(4): 1406-22, 2015 Jun.
Article in English | MEDLINE | ID: mdl-25353719

ABSTRACT

As a step toward functional annotation of genes required for floral initiation and development within the Eucalyptus genome, we used short read sequencing to analyze transcriptomes of floral buds from early and late developmental stages, and compared these with transcriptomes of diverse vegetative tissues, including leaves, roots, and stems. A subset of 4807 genes (13% of protein-coding genes) were differentially expressed between floral buds of either stage and vegetative tissues. A similar proportion of genes were differentially expressed among all tissues. A total of 479 genes were differentially expressed between early and late stages of floral development. Gene function enrichment identified 158 gene ontology classes that were overrepresented in floral tissues, including 'pollen development' and 'aromatic compound biosynthetic process'. At least 40 floral-dominant genes lacked functional annotations and thus may be novel floral transcripts. We analyzed several genes and gene families in depth, including 49 putative biomarkers of floral development, the MADS-box transcription factors, 'S-domain'-receptor-like kinases, and selected gene family members with phosphatidylethanolamine-binding protein domains. Expanded MADS-box gene subfamilies in Eucalyptus grandis included SUPPRESSOR OF OVEREXPRESSION OF CO 1 (SOC1), SEPALLATA (SEP) and SHORT VEGETATIVE PHASE (SVP) Arabidopsis thaliana homologs. These data provide a rich resource for functional and evolutionary analysis of genes controlling eucalypt floral development, and new tools for breeding and biotechnology.


Subject(s)
Eucalyptus/genetics , Flowers/genetics , Gene Expression Regulation, Plant , Transcriptome/genetics , Biomarkers/metabolism , Gene Expression Profiling , Gene Ontology , Genes, Plant , Phylogeny , Plant Proteins/genetics , Plant Proteins/metabolism , Sequence Analysis, RNA
12.
Biomolecules ; 13(9)2023 09 17.
Article in English | MEDLINE | ID: mdl-37759803

ABSTRACT

The availability of multiple sequenced genomes from a single species made it possible to explore intra- and inter-specific genomic comparisons at higher resolution and build clade-specific pan-genomes of several crops. The pan-genomes of crops constructed from various cultivars, accessions, landraces, and wild ancestral species represent a compendium of genes and structural variations and allow researchers to search for the novel genes and alleles that were inadvertently lost in domesticated crops during the historical process of crop domestication or in the process of extensive plant breeding. Fortunately, many valuable genes and alleles associated with desirable traits like disease resistance, abiotic stress tolerance, plant architecture, and nutrition qualities exist in landraces, ancestral species, and crop wild relatives. The novel genes from the wild ancestors and landraces can be introduced back to high-yielding varieties of modern crops by implementing classical plant breeding, genomic selection, and transgenic/gene editing approaches. Thus, pan-genomic represents a great leap in plant research and offers new avenues for targeted breeding to mitigate the impact of global climate change. Here, we summarize the tools used for pan-genome assembly and annotations, web-portals hosting plant pan-genomes, etc. Furthermore, we highlight a few discoveries made in crops using the pan-genomic approach and future potential of this emerging field of study.


Subject(s)
Genome, Plant , Plant Breeding , Genomics , Gene Editing , Domestication , Crops, Agricultural/genetics
13.
Plants (Basel) ; 12(11)2023 May 29.
Article in English | MEDLINE | ID: mdl-37299125

ABSTRACT

Modeling biological processes and genetic-regulatory networks using in silico approaches provides a valuable framework for understanding how genes and associated allelic and genotypic differences result in specific traits. Submergence tolerance is a significant agronomic trait in rice; however, the gene-gene interactions linked with this polygenic trait remain largely unknown. In this study, we constructed a network of 57 transcription factors involved in seed germination and coleoptile elongation under submergence. The gene-gene interactions were based on the co-expression profiles of genes and the presence of transcription factor binding sites in the promoter region of target genes. We also incorporated published experimental evidence, wherever available, to support gene-gene, gene-protein, and protein-protein interactions. The co-expression data were obtained by re-analyzing publicly available transcriptome data from rice. Notably, this network includes OSH1, OSH15, OSH71, Sub1B, ERFs, WRKYs, NACs, ZFP36, TCPs, etc., which play key regulatory roles in seed germination, coleoptile elongation and submergence response, and mediate gravitropic signaling by regulating OsLAZY1 and/or IL2. The network of transcription factors was manually biocurated and submitted to the Plant Reactome Knowledgebase to make it publicly accessible. We expect this work will facilitate the re-analysis/re-use of OMICs data and aid genomics research to accelerate crop improvement.

14.
Front Plant Sci ; 14: 1272966, 2023.
Article in English | MEDLINE | ID: mdl-38162307

ABSTRACT

Chia (Salvia hispanica L.) is one of the most popular nutrition-rich foods and pseudocereal crops of the family Lamiaceae. Chia seeds are a rich source of proteins, polyunsaturated fatty acids (PUFAs), dietary fibers, and antioxidants. In this study, we present the assembly of the chia reference genome, which spans 303.6 Mb and encodes 48,090 annotated protein-coding genes. Our analysis revealed that ~42% of the chia genome harbors repetitive content, and identified ~3 million single nucleotide polymorphisms (SNPs) and 15,380 simple sequence repeat (SSR) marker sites. By investigating the chia transcriptome, we discovered that ~44% of the genes undergo alternative splicing with a higher frequency of intron retention events. Additionally, we identified chia genes associated with important nutrient content and quality traits, such as the biosynthesis of PUFAs and seed mucilage fiber (dietary fiber) polysaccharides. Notably, this is the first report of in-silico annotation of a plant genome for protein-derived small bioactive peptides (biopeptides) associated with improving human health. To facilitate further research and translational applications of this valuable orphan crop, we have developed the Salvia genomics database (SalviaGDB), accessible at https://salviagdb.org.

15.
Database (Oxford) ; 20232023 Dec 11.
Article in English | MEDLINE | ID: mdl-38079567

ABSTRACT

Large-scale genotype and phenotype data have been increasingly generated to identify genetic markers, understand gene function and evolution and facilitate genomic selection. These datasets hold immense value for both current and future studies, as they are vital for crop breeding, yield improvement and overall agricultural sustainability. However, integrating these datasets from heterogeneous sources presents significant challenges and hinders their effective utilization. We established the Genotype-Phenotype Working Group in November 2021 as a part of the AgBioData Consortium (https://www.agbiodata.org) to review current data types and resources that support archiving, analysis and visualization of genotype and phenotype data to understand the needs and challenges of the plant genomic research community. For 2021-22, we identified different types of datasets and examined metadata annotations related to experimental design/methods/sample collection, etc. Furthermore, we thoroughly reviewed publicly funded repositories for raw and processed data as well as secondary databases and knowledgebases that enable the integration of heterogeneous data in the context of the genome browser, pathway networks and tissue-specific gene expression. Based on our survey, we recommend a need for (i) additional infrastructural support for archiving many new data types, (ii) development of community standards for data annotation and formatting, (iii) resources for biocuration and (iv) analysis and visualization tools to connect genotype data with phenotype data to enhance knowledge synthesis and to foster translational research. Although this paper only covers the data and resources relevant to the plant research community, we expect that similar issues and needs are shared by researchers working on animals. Database URL: https://www.agbiodata.org.


Subject(s)
Big Data , Databases, Genetic , Genotype , Phenotype , Plant Breeding
16.
Methods Mol Biol ; 2443: 511-525, 2022.
Article in English | MEDLINE | ID: mdl-35037224

ABSTRACT

Plant Reactome (https://plantreactome.gramene.org) and PubChem ( https://pubchem.ncbi.nlm.nih.gov ) are two reference data portals and resources for curated plant pathways, small molecules, metabolites, gene products, and macromolecular interactions. Plant Reactome knowledgebase, a conceptual plant pathway network, is built by biocuration and integrating (bio)chemical entities, gene products, and macromolecular interactions. It provides manually curated pathways for the reference species Oryza sativa (rice) and gene orthology-based projections that extend pathway knowledge to 106 plant species. Currently, it hosts 320 reference pathways for plant metabolism, hormone signaling, transport, genetic regulation, plant organ development and differentiation, and biotic and abiotic stress responses. In addition to the pathway browsing and search functions, the Plant Reactome provides the analysis tools for pathway comparison between reference and projected species, pathway enrichment in gene expression data, and overlay of gene-gene interaction data on pathways. PubChem, a popular reference database of (bio)chemical entities, provides information on small molecules and other types of chemical entities, such as siRNAs, miRNAs, lipids, carbohydrates, and chemically modified nucleotides. The data in PubChem is collected from hundreds of data sources, including Plant Reactome. This chapter provides a brief overview of the Plant Reactome and the PubChem knowledgebases, their association to other public resources providing accessory information, and how users can readily access the contents.


Subject(s)
Knowledge Bases , Metabolic Networks and Pathways , Databases, Factual , Plants/genetics , Plants/metabolism , Proteins/metabolism
17.
PeerJ ; 9: e11052, 2021.
Article in English | MEDLINE | ID: mdl-33777532

ABSTRACT

The S-domain subfamily of receptor-like kinases (SDRLKs) in plants is poorly characterized. Most members of this subfamily are currently assigned gene function based on the S-locus Receptor Kinase from Brassica that acts as the female determinant of self-incompatibility (SI). However, Brassica like SI mechanisms does not exist in most plants. Thus, automated Gene Ontology (GO) pipelines are not sufficient for functional annotation of SDRLK subfamily members and lead to erroneous association with the GO biological process of SI. Here, we show that manual bio-curation can help to correct and improve the gene annotations and association with relevant biological processes. Using publicly available genomic and transcriptome datasets, we conducted a detailed analysis of the expansion of the rice (Oryza sativa) SDRLK subfamily, the structure of individual genes and proteins, and their expression.The 144-member SDRLK family in rice consists of 82 receptor-like kinases (RLKs) (67 full-length, 15 truncated),12 receptor-like proteins, 14 SD kinases, 26 kinase-like and 10 GnK2 domain-containing kinases and RLKs. Except for nine genes, all other SDRLK family members are transcribed in rice, but they vary in their tissue-specific and stress-response expression profiles. Furthermore, 98 genes show differential expression under biotic stress and 98 genes show differential expression under abiotic stress conditions, but share 81 genes in common.Our analysis led to the identification of candidate genes likely to play important roles in plant development, pathogen resistance, and abiotic stress tolerance. We propose a nomenclature for 144 SDRLK gene family members based on gene/protein conserved structural features, gene expression profiles, and literature review. Our biocuration approach, rooted in the principles of findability, accessibility, interoperability and reusability, sets forth an example of how manual annotation of large-gene families can fill in the knowledge gap that exists due to the implementation of automated GO projections, thereby helping to improve the quality and contents of public databases.

18.
J Plant Physiol ; 266: 153531, 2021 Nov.
Article in English | MEDLINE | ID: mdl-34601337

ABSTRACT

Lectins are ubiquitous proteins that reversibly bind to specific carbohydrates and, thus, serve as readers of the sugar code. In photosynthetic organisms, lectin family proteins play important roles in capturing and releasing photosynthates via an endogenous lectin cycle. Often, lectin proteins consist of one or more lectin domains in combination with other types of domains. This structural diversity of lectins is the basis for their current classification, which is consistent with their diverse functions in cell signaling associated with growth and development, as well as in the plant's response to biotic, symbiotic, and abiotic stimuli. Furthermore, the lectin family shows evolutionary expansion that has distinct clade-specific signatures. Although the function(s) of many plant lectin family genes are unknown, studies in the model plant Arabidopsis thaliana have provided insights into their diverse roles. Here, we have used a biocuration approach rooted in the critical review of scientific literature and information available in the public genomic databases to summarize the expression, localization, and known functions of lectins in Arabidopsis. A better understanding of the structure and function of lectins is expected to aid in improving agricultural productivity through the manipulation of candidate genes for breeding climate-resilient crops, or by regulating metabolic pathways by applications of plant growth regulators.


Subject(s)
Arabidopsis , Carbohydrates , Plant Lectins , Arabidopsis/genetics , Crops, Agricultural , Plant Breeding
19.
Front Plant Sci ; 12: 667678, 2021.
Article in English | MEDLINE | ID: mdl-34354718

ABSTRACT

Chia (Salvia hispanica L.), now a popular superfood and a pseudocereal, is one of the richest sources of dietary nutrients such as protein, fiber, and polyunsaturated fatty acids (PUFAs). At present, the genomic and genetic information available in the public domain for this crop are scanty, which hinders an understanding of its growth and development and genetic improvement. We report an RNA-sequencing (RNA-Seq)-based comprehensive transcriptome atlas of Chia sampled from 13 tissue types covering vegetative and reproductive growth stages. We used ~355 million high-quality reads of total ~394 million raw reads from transcriptome sequencing to generate de novo reference transcriptome assembly and the tissue-specific transcript assemblies. After the quality assessment of the merged assemblies and implementing redundancy reduction methods, 82,663 reference transcripts were identified. About 65,587 of 82,663 transcripts were translated into 99,307 peptides, and we were successful in assigning InterPro annotations to 45,209 peptides and gene ontology (GO) terms to 32,638 peptides. The assembled transcriptome is estimated to have the complete sequence information for ~86% of the genes found in the Chia genome. Furthermore, the analysis of 53,200 differentially expressed transcripts (DETs) revealed their distinct expression patterns in Chia's vegetative and reproductive tissues; tissue-specific networks and developmental stage-specific networks of transcription factors (TFs); and the regulation of the expression of enzyme-coding genes associated with important metabolic pathways. In addition, we identified 2,411 simple sequence repeats (SSRs) as potential genetic markers from the transcripts. Overall, this study provides a comprehensive transcriptome atlas, and SSRs, contributing to building essential genomic resources to support basic research, genome annotation, functional genomics, and molecular breeding of Chia.

20.
Database (Oxford) ; 20192019 01 01.
Article in English | MEDLINE | ID: mdl-30649295

ABSTRACT

Biocuration plays a crucial role in building databases and complex systems-level platforms required for processing, annotating and analyzing 'Big Data' in biology. However, biocuration efforts cannot keep pace with a dramatic increase in the production of omics data; this presents one of the bottlenecks in genomics. In two pathway curation jamborees, Plant Reactome curators tested strategies for introducing researchers to pathway curation tools, harnessing biologists' expertise in curating plant pathways and developing a network of community biocurators. We summarize the strategy, workflow and outcomes of these exercises, and discuss the role of community biocuration in advancing databases and genomic resources.


Subject(s)
Data Curation/methods , Databases, Genetic , Gene Regulatory Networks/genetics , Genomics/methods , Big Data , Data Mining , Genes, Plant/genetics , Workflow
SELECTION OF CITATIONS
SEARCH DETAIL