Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 20 de 23
1.
Genetics ; 220(4)2022 04 04.
Article En | MEDLINE | ID: mdl-35134929

WormBase (www.wormbase.org) is the central repository for the genetics and genomics of the nematode Caenorhabditis elegans. We provide the research community with data and tools to facilitate the use of C. elegans and related nematodes as model organisms for studying human health, development, and many aspects of fundamental biology. Throughout our 22-year history, we have continued to evolve to reflect progress and innovation in the science and technologies involved in the study of C. elegans. We strive to incorporate new data types and richer data sets, and to provide integrated displays and services that avail the knowledge generated by the published nematode genetics literature. Here, we provide a broad overview of the current state of WormBase in terms of data type, curation workflows, analysis, and tools, including exciting new advances for analysis of single-cell data, text mining and visualization, and the new community collaboration forum. Concurrently, we continue the integration and harmonization of infrastructure, processes, and tools with the Alliance of Genome Resources, of which WormBase is a founding member.


Caenorhabditis , Nematoda , Animals , Caenorhabditis/genetics , Caenorhabditis elegans/genetics , Databases, Genetic , Genome , Genomics , Humans , Nematoda/genetics
2.
Nucleic Acids Res ; 48(D1): D762-D767, 2020 01 08.
Article En | MEDLINE | ID: mdl-31642470

WormBase (https://wormbase.org/) is a mature Model Organism Information Resource supporting researchers using the nematode Caenorhabditis elegans as a model system for studies across a broad range of basic biological processes. Toward this mission, WormBase efforts are arranged in three primary facets: curation, user interface and architecture. In this update, we describe progress in each of these three areas. In particular, we discuss the status of literature curation and recently added data, detail new features of the web interface and options for users wishing to conduct data mining workflows, and discuss our efforts to build a robust and scalable architecture by leveraging commercial cloud offerings. We conclude with a description of WormBase's role as a founding member of the nascent Alliance of Genome Resources.


Caenorhabditis elegans/genetics , Databases, Genetic , Genes, Helminth , Animals , Data Mining , Genomics , Internet , User-Computer Interface
3.
Database (Oxford) ; 20182018 01 01.
Article En | MEDLINE | ID: mdl-29688367

Large volumes of data generated by research laboratories coupled with the required effort and cost of curation present a significant barrier to inclusion of these data in authoritative community databases. Further, many publicly funded experimental observations remain invisible to curation simply because they are never published: results often do not fit within the scope of a standard publication; trainee-generated data are forgotten when the experimenter (e.g. student, post-doc) leaves the lab; results are omitted from science narratives due to publication bias where certain results are considered irrelevant for the publication. While authors are in the best position to curate their own data, they face a steep learning curve to ensure that appropriate referential tags, metadata, and ontologies are applied correctly to their observations, a task sometimes considered beyond the scope of their research and other numerous responsibilities. Getting researchers to adopt a new system of data reporting and curation requires a fundamental change in behavior among all members of the research community. To solve these challenges, we have created a novel scholarly communication platform that captures data from researchers and directly delivers them to information resources via Micropublication. This platform incentivizes authors to publish their unpublished observations along with associated metadata by providing a deliberately fast and lightweight but still peer-reviewed process that results in a citable publication. Our long-term goal is to develop a data ecosystem that improves reproducibility and accountability of publicly funded research and in turn accelerates both basic and translational discovery. Database URL: www.micropublication.org.


Biomedical Research , Data Curation/methods , Databases, Factual , Humans , Periodicals as Topic
4.
Nucleic Acids Res ; 46(D1): D869-D874, 2018 01 04.
Article En | MEDLINE | ID: mdl-29069413

WormBase (http://www.wormbase.org) is an important knowledge resource for biomedical researchers worldwide. To accommodate the ever increasing amount and complexity of research data, WormBase continues to advance its practices on data acquisition, curation and retrieval to most effectively deliver comprehensive knowledge about Caenorhabditis elegans, and genomic information about other nematodes and parasitic flatworms. Recent notable enhancements include user-directed submission of data, such as micropublication; genomic data curation and presentation, including additional genomes and JBrowse, respectively; new query tools, such as SimpleMine, Gene Enrichment Analysis; new data displays, such as the Person Lineage browser and the Summary of Ontology-based Annotations. Anticipating more rapid data growth ahead, WormBase continues the process of migrating to a cutting-edge database technology to achieve better stability, scalability, reproducibility and a faster response time. To better serve the broader research community, WormBase, with five other Model Organism Databases and The Gene Ontology project, have begun to collaborate formally as the Alliance of Genome Resources.


Databases, Genetic , Genome , Nematoda/genetics , Animals , Caenorhabditis/genetics , Caenorhabditis elegans/genetics , Data Curation , Data Mining , Datasets as Topic , Disease Models, Animal , Forecasting , Gene Ontology , Humans , Information Storage and Retrieval , Platyhelminths/genetics , Publishing , RNA Interference , Sequence Alignment , User-Computer Interface , Web Browser
5.
Nucleic Acids Res ; 44(D1): D774-80, 2016 Jan 04.
Article En | MEDLINE | ID: mdl-26578572

WormBase (www.wormbase.org) is a central repository for research data on the biology, genetics and genomics of Caenorhabditis elegans and other nematodes. The project has evolved from its original remit to collect and integrate all data for a single species, and now extends to numerous nematodes, ranging from evolutionary comparators of C. elegans to parasitic species that threaten plant, animal and human health. Research activity using C. elegans as a model system is as vibrant as ever, and we have created new tools for community curation in response to the ever-increasing volume and complexity of data. To better allow users to navigate their way through these data, we have made a number of improvements to our main website, including new tools for browsing genomic features and ontology annotations. Finally, we have developed a new portal for parasitic worm genomes. WormBase ParaSite (parasite.wormbase.org) contains all publicly available nematode and platyhelminth annotated genome sequences, and is designed specifically to support helminth genomic research.


Caenorhabditis elegans/genetics , Databases, Genetic , Genome, Helminth , Genomics , Nematoda/genetics , Animals , Genes, Helminth , Molecular Sequence Annotation , Platyhelminths/genetics , Software
6.
Nucleic Acids Res ; 42(Database issue): D789-93, 2014 Jan.
Article En | MEDLINE | ID: mdl-24194605

WormBase (http://www.wormbase.org/) is a highly curated resource dedicated to supporting research using the model organism Caenorhabditis elegans. With an electronic history predating the World Wide Web, WormBase contains information ranging from the sequence and phenotype of individual alleles to genome-wide studies generated using next-generation sequencing technologies. In recent years, we have expanded the contents to include data on additional nematodes of agricultural and medical significance, bringing the knowledge of C. elegans to bear on these systems and providing support for underserved research communities. Manual curation of the primary literature remains a central focus of the WormBase project, providing users with reliable, up-to-date and highly cross-linked information. In this update, we describe efforts to organize the original atomized and highly contextualized curated data into integrated syntheses of discrete biological topics. Next, we discuss our experiences coping with the vast increase in available genome sequences made possible through next-generation sequencing platforms. Finally, we describe some of the features and tools of the new WormBase Web site that help users better find and explore data of interest.


Caenorhabditis elegans/genetics , Databases, Genetic , Genome, Helminth , Animals , Internet , Molecular Sequence Annotation , Nematoda/genetics
7.
Nucleic Acids Res ; 40(Database issue): D735-41, 2012 Jan.
Article En | MEDLINE | ID: mdl-22067452

Since its release in 2000, WormBase (http://www.wormbase.org) has grown from a small resource focusing on a single species and serving a dedicated research community, to one now spanning 15 species essential to the broader biomedical and agricultural research fields. To enhance the rate of curation, we have automated the identification of key data in the scientific literature and use similar methodology for data extraction. To ease access to the data, we are collaborating with journals to link entities in research publications to their report pages at WormBase. To facilitate discovery, we have added new views of the data, integrated large-scale datasets and expanded descriptions of models for human disease. Finally, we have introduced a dramatic overhaul of the WormBase website for public beta testing. Designed to balance complexity and usability, the new site is species-agnostic, highly customizable, and interactive. Casual users and developers alike will be able to leverage the public RESTful application programming interface (API) to generate custom data mining solutions and extensions to the site. We report on the growth of our database and on our work in keeping pace with the growing demand for data, efforts to anticipate the requirements of users and new collaborations with the larger science community.


Caenorhabditis elegans/genetics , Databases, Genetic , Genome, Helminth , Nematoda/genetics , Animals , Caenorhabditis/genetics , Caenorhabditis elegans/anatomy & histology , Computer Graphics , Gene Expression Profiling , Genomics , Internet , Molecular Sequence Annotation , Phenotype
8.
Nature ; 479(7374): 529-33, 2011 Oct 26.
Article En | MEDLINE | ID: mdl-22031327

Parasitic diseases have a devastating, long-term impact on human health, welfare and food production worldwide. More than two billion people are infected with geohelminths, including the roundworms Ascaris (common roundworm), Necator and Ancylostoma (hookworms), and Trichuris (whipworm), mainly in developing or impoverished nations of Asia, Africa and Latin America. In humans, the diseases caused by these parasites result in about 135,000 deaths annually, with a global burden comparable with that of malaria or tuberculosis in disability-adjusted life years. Ascaris alone infects around 1.2 billion people and, in children, causes nutritional deficiency, impaired physical and cognitive development and, in severe cases, death. Ascaris also causes major production losses in pigs owing to reduced growth, failure to thrive and mortality. The Ascaris-swine model makes it possible to study the parasite, its relationship with the host, and ascariasis at the molecular level. To enable such molecular studies, we report the 273 megabase draft genome of Ascaris suum and compare it with other nematode genomes. This genome has low repeat content (4.4%) and encodes about 18,500 protein-coding genes. Notably, the A. suum secretome (about 750 molecules) is rich in peptidases linked to the penetration and degradation of host tissues, and an assemblage of molecules likely to modulate or evade host immune responses. This genome provides a comprehensive resource to the scientific community and underpins the development of new and urgently needed interventions (drugs, vaccines and diagnostic tests) against ascariasis and other nematodiases.


Ascaris suum/genetics , Genome, Helminth/genetics , Animals , Antinematodal Agents , Ascariasis/drug therapy , Ascariasis/parasitology , Ascaris suum/drug effects , Drug Design , Genes, Helminth/genetics , Genomics , Molecular Sequence Annotation , Molecular Targeted Therapy
9.
Nat Methods ; 7(6): 451-3, 2010 Jun.
Article En | MEDLINE | ID: mdl-20418868

We developed a method, MosDEL, to generate targeted knockouts of genes in Caenorhabditis elegans by injection. We generated a double-strand break by mobilizing a Mos1 transposon adjacent to the region to be deleted; the double-stranded break is repaired using injected DNA as a template. Repair can delete up to 25 kb of DNA and simultaneously insert a positive selection marker.


Caenorhabditis elegans/genetics , DNA Transposable Elements/genetics , Gene Deletion , Animals , Comparative Genomic Hybridization , DNA Repair , DNA-Binding Proteins/physiology , Transposases/physiology
10.
Nucleic Acids Res ; 38(Database issue): D463-7, 2010 Jan.
Article En | MEDLINE | ID: mdl-19910365

WormBase (http://www.wormbase.org) is a central data repository for nematode biology. Initially created as a service to the Caenorhabditis elegans research field, WormBase has evolved into a powerful research tool in its own right. In the past 2 years, we expanded WormBase to include the complete genomic sequence, gene predictions and orthology assignments from a range of related nematodes. This comparative data enrich the C. elegans data with improved gene predictions and a better understanding of gene function. In turn, they bring the wealth of experimental knowledge of C. elegans to other systems of medical and agricultural importance. Here, we describe new species and data types now available at WormBase. In addition, we detail enhancements to our curatorial pipeline and website infrastructure to accommodate new genomes and an extensive user base.


Caenorhabditis elegans/genetics , Caenorhabditis/genetics , Computational Biology/methods , Databases, Genetic , Databases, Nucleic Acid , Alleles , Animals , Computational Biology/trends , Databases, Protein , Information Storage and Retrieval/methods , Internet , Phenotype , Protein Structure, Tertiary , Software , Transcription Factors
11.
BMC Bioinformatics ; 9: 549, 2008 Dec 19.
Article En | MEDLINE | ID: mdl-19099578

BACKGROUND: While the C. elegans genome is extensively annotated, relatively little information is available for other Caenorhabditis species. The nematode genome annotation assessment project (nGASP) was launched to objectively assess the accuracy of protein-coding gene prediction software in C. elegans, and to apply this knowledge to the annotation of the genomes of four additional Caenorhabditis species and other nematodes. Seventeen groups worldwide participated in nGASP, and submitted 47 prediction sets across 10 Mb of the C. elegans genome. Predictions were compared to reference gene sets consisting of confirmed or manually curated gene models from WormBase. RESULTS: The most accurate gene-finders were 'combiner' algorithms, which made use of transcript- and protein-alignments and multi-genome alignments, as well as gene predictions from other gene-finders. Gene-finders that used alignments of ESTs, mRNAs and proteins came in second. There was a tie for third place between gene-finders that used multi-genome alignments and ab initio gene-finders. The median gene level sensitivity of combiners was 78% and their specificity was 42%, which is nearly the same accuracy reported for combiners in the human genome. C. elegans genes with exons of unusual hexamer content, as well as those with unusually many exons, short exons, long introns, a weak translation start signal, weak splice sites, or poorly conserved orthologs posed the greatest difficulty for gene-finders. CONCLUSION: This experiment establishes a baseline of gene prediction accuracy in Caenorhabditis genomes, and has guided the choice of gene-finders for the annotation of newly sequenced genomes of Caenorhabditis and other nematode species. We have created new gene sets for C. briggsae, C. remanei, C. brenneri, C. japonica, and Brugia malayi using some of the best-performing gene-finders.


Caenorhabditis/genetics , Genome, Helminth , Animals , Caenorhabditis elegans/classification , Caenorhabditis elegans/genetics , Computational Biology/methods , DNA/genetics , Databases, Genetic , Genes, Helminth , Genomics
12.
Nucleic Acids Res ; 36(Database issue): D612-7, 2008 Jan.
Article En | MEDLINE | ID: mdl-17991679

WormBase (www.wormbase.org) is the major publicly available database of information about Caenorhabditis elegans, an important system for basic biological and biomedical research. Derived from the initial ACeDB database of C. elegans genetic and sequence information, WormBase now includes the genomic, anatomical and functional information about C. elegans, other Caenorhabditis species and other nematodes. As such, it is a crucial resource not only for C. elegans biologists but the larger biomedical and bioinformatics communities. Coverage of core areas of C. elegans biology will allow the biomedical community to make full use of the results of intensive molecular genetic analysis and functional genomic studies of this organism. Improved search and display tools, wider cross-species comparisons and extended ontologies are some of the features that will help scientists extend their research and take advantage of other nematode species genome sequences.


Caenorhabditis elegans/genetics , Databases, Genetic , Genome, Helminth , Animals , Caenorhabditis elegans/metabolism , Chromosome Mapping , Gene Expression , Gene Regulatory Networks , Genes, Helminth , Genomics , Internet , Mass Spectrometry , Peptides/chemistry , Phenotype , User-Computer Interface
13.
Nucleic Acids Res ; 35(Database issue): D472-5, 2007 Jan.
Article En | MEDLINE | ID: mdl-17099225

WormBook (www.wormbook.org) is an open-access, online collection of original, peer-reviewed chapters on the biology of Caenorhabditis elegans and related nematodes. Since WormBook was launched in June 2005 with 12 chapters, it has grown to over 100 chapters, covering nearly every aspect of C.elegans research, from Cell Biology and Neurobiology to Evolution and Ecology. WormBook also serves as the text companion to WormBase, the C.elegans model organism database. Objects such as genes, proteins and cells are linked to the relevant pages in WormBase, providing easily accessible background information. Additionally, WormBook chapters contain links to other relevant topics in WormBook, and the in-text citations are linked to their abstracts in PubMed and full-text references, if available. Since WormBook is online, its chapters are able to contain movies and complex images that would not be possible in a print version. WormBook is designed to keep up with the rapid pace of discovery in the field of C.elegans research and continues to grow. WormBook represents a generic publishing infrastructure that is easily adaptable to other research communities to facilitate the dissemination of knowledge in the field.


Caenorhabditis elegans/physiology , Databases, Factual , Animals , Caenorhabditis elegans/genetics , Internet , Software , User-Computer Interface
14.
Nucleic Acids Res ; 35(Database issue): D506-10, 2007 Jan.
Article En | MEDLINE | ID: mdl-17099234

WormBase (http://wormbase.org), a model organism database for Caenorhabditis elegans and other related nematodes, continues to evolve and expand. Over the past year WormBase has added new data on C.elegans, including data on classical genetics, cell biology and functional genomics; expanded the annotation of closely related nematodes with a new genome browser for Caenorhabditis remanei; and deployed new hardware for stronger performance. Several existing datasets including phenotype descriptions and RNAi experiments have seen a large increase in new content. New datasets such as the C.remanei draft assembly and annotations, the Vancouver Fosmid library and TEC-RED 5' end sites are now available as well. Access to and searching WormBase has become more dependable and flexible via multiple mirror sites and indexing through Google.


Caenorhabditis elegans/genetics , Caenorhabditis/genetics , Databases, Genetic , Animals , Genes, Helminth , Genome, Helminth , Genomics , Internet , Oligonucleotide Array Sequence Analysis , Phenotype , RNA Interference , User-Computer Interface
15.
Methods Mol Biol ; 351: 13-29, 2006.
Article En | MEDLINE | ID: mdl-16988423

The genome of the nematode Caenorhabditis elegans was the first animal genome sequenced. Subsequent sequencing of the Caenorhabditis briggsae genome enabled a comparison of the genomes of two nematode species. In this chapter, we describe the methods that we used to compare the C. elegans genome to that of C. briggsae. We discuss how these methods could be developed to compare the C. elegans and C. briggsae genomes to those of Caenorhabditis remanei, C. n. sp. represented by strains PB2801 and CB5161, among others (1), and Caenorhabditis japonica, which are currently being sequenced.


Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans/genetics , Evolution, Molecular , Genes, Helminth/genetics , Genomics , Animals , Genomics/methods , Species Specificity
16.
Methods Mol Biol ; 351: 31-50, 2006.
Article En | MEDLINE | ID: mdl-16988424

WormBase is a comprehensive repository for information on Caenorhabditis elegans and related nematodes. Although the primary web-based interface of WormBase (http:// www.wormbase.org/) is familiar to most C. elegans researchers, WormBase also offers powerful data-mining features for addressing questions of comparative genomics, genome structure, and evolution. In this chapter, we focus on data mining at WormBase through the use of flexible web interfaces, custom queries, and scripts. The intended audience includes users wishing to query the database beyond the confines of the web interface or fetch data en masse. No knowledge of programming is necessary or assumed, although users with intermediate skills in the Perl scripting language will be able to utilize additional data-mining approaches.


Caenorhabditis elegans/genetics , Databases, Genetic , Genes, Helminth/genetics , Genomics , Internet , User-Computer Interface , Animals , Genomics/methods , Programming Languages
17.
Nucleic Acids Res ; 34(Database issue): D475-8, 2006 Jan 01.
Article En | MEDLINE | ID: mdl-16381915

WormBase (http://wormbase.org), the public database for genomics and biology of Caenorhabditis elegans, has been restructured for stronger performance and expanded for richer biological content. Performance was improved by accelerating the loading of central data pages such as the omnibus Gene page, by rationalizing internal data structures and software for greater portability, and by making the Genome Browser highly customizable in how it views and exports genomic subsequences. Arbitrarily complex, user-specified queries are now possible through Textpresso (for all available literature) and through WormMart (for most genomic data). Biological content was enriched by reconciling all available cDNA and expressed sequence tag data with gene predictions, clarifying single nucleotide polymorphism and RNAi sites, and summarizing known functions for most genes studied in this organism.


Caenorhabditis elegans Proteins/chemistry , Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans/genetics , Databases, Genetic , Software , Animals , Caenorhabditis elegans/physiology , DNA, Complementary/chemistry , Expressed Sequence Tags/chemistry , Genome, Helminth , Genomics , Internet , Polymorphism, Single Nucleotide , RNA Interference , User-Computer Interface
18.
Nucleic Acids Res ; 33(Database issue): D383-9, 2005 Jan 01.
Article En | MEDLINE | ID: mdl-15608221

WormBase (http://www.wormbase.org), the model organism database for information about Caenorhabditis elegans and related nematodes, continues to expand in breadth and depth. Over the past year, WormBase has added multiple large-scale datasets including SAGE, interactome, 3D protein structure datasets and NCBI KOGs. To accommodate this growth, the International WormBase Consortium has improved the user interface by adding new features to aid in navigation, visualization of large-scale datasets, advanced searching and data mining. Internally, we have restructured the database models to rationalize the representation of genes and to prepare the system to accept the genome sequences of three additional Caenorhabditis species over the coming year.


Caenorhabditis elegans Proteins/chemistry , Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans/genetics , Caenorhabditis/genetics , Databases, Genetic , Genomics , Animals , Caenorhabditis/metabolism , Caenorhabditis elegans/metabolism , Caenorhabditis elegans Proteins/metabolism , Databases, Genetic/trends , Gene Expression Profiling , Protein Conformation , Software , Systems Integration , Two-Hybrid System Techniques , User-Computer Interface
19.
Genome Res ; 14(10B): 2155-61, 2004 Oct.
Article En | MEDLINE | ID: mdl-15489338

The ORFeome project has validated and corrected a large number of predicted gene models in the nematode C. elegans, and has provided an enormous resource for proteome-scale studies. To make the resource useful to the research and teaching community, it needs to be integrated with other large-scale data sets, including the C. elegans genome, cell lineage, neurological wiring diagram, transcriptome, and gene expression map. This integration is also critical because the ORFeome data sets, like other 'omics' data sets, have significant false-positive and false-negative rates, and comparison to related data is necessary to make confidence judgments in any given data point. WormBase, the central data repository for information about C. elegans and related nematodes, provides such a platform for integration. In this report, we will describe how C. elegans ORFeome data are deposited in the database, how they are used to correct gene models, how they are integrated and displayed in the context of other data sets at the WormBase Web site, and how WormBase establishes connection with the reagent-based resources at the ORFeome project Web site.


Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans/genetics , Databases, Genetic , Genes, Helminth/physiology , Genomics , Open Reading Frames/genetics , Animals , Base Sequence , Caenorhabditis elegans Proteins/metabolism , Computational Biology , Information Storage and Retrieval , Molecular Sequence Data , Proteome , Proteomics
20.
Nucleic Acids Res ; 32(Database issue): D411-7, 2004 Jan 01.
Article En | MEDLINE | ID: mdl-14681445

WormBase (http://www.wormbase.org/) is the central data repository for information about Caenorhabditis elegans and related nematodes. As a model organism database, WormBase extends beyond the genomic sequence, integrating experimental results with extensively annotated views of the genome. The WormBase Consortium continues to expand the biological scope and utility of WormBase with the inclusion of large-scale genomic analyses, through active data and literature curation, through new analysis and visualization tools, and through refinement of the user interface. Over the past year, the nearly complete genomic sequence and comparative analyses of the closely related species Caenorhabditis briggsae have been integrated into WormBase, including gene predictions, ortholog assignments and a new synteny viewer to display the relationships between the two species. Extensive site-wide refinement of the user interface now provides quick access to the most frequently accessed resources and a consistent browsing experience across the site. Unified single-page views now provide complete summaries of commonly accessed entries like genes. These advances continue to increase the utility of WormBase for C.elegans researchers, as well as for those researchers exploring problems in functional and comparative genomics in the context of a powerful genetic system.


Caenorhabditis elegans/genetics , Caenorhabditis/genetics , Databases, Genetic , Genomics , Animals , Computational Biology , Information Storage and Retrieval , Internet , User-Computer Interface
...