Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
Genetics ; 227(1)2024 May 07.
Article in English | MEDLINE | ID: mdl-38573366

ABSTRACT

WormBase has been the major repository and knowledgebase of information about the genome and genetics of Caenorhabditis elegans and other nematodes of experimental interest for over 2 decades. We have 3 goals: to keep current with the fast-paced C. elegans research, to provide better integration with other resources, and to be sustainable. Here, we discuss the current state of WormBase as well as progress and plans for moving core WormBase infrastructure to the Alliance of Genome Resources (the Alliance). As an Alliance member, WormBase will continue to interact with the C. elegans community, develop new features as needed, and curate key information from the literature and large-scale projects.


Subject(s)
Caenorhabditis elegans , Caenorhabditis elegans/genetics , Animals , Databases, Genetic , Genome, Helminth , Genomics/methods
2.
MicroPubl Biol ; 20222022.
Article in English | MEDLINE | ID: mdl-35663412

ABSTRACT

Biological databases collect and standardize data through biocuration. Even though major model organism databases have adopted some automation of curation methods, a large portion of biocuration is still performed manually. To speed up the extraction of the genomic positions of variants, we have developed a hybrid approach that combines regular expressions, Named Entity Recognition based on BERT (Bidirectional Encoder Representations from Transformers) and bag-of-words to extract variant genomic locations from C. elegans papers for WormBase. Our model has a precision of 82.59% for the gene-mutation matches tested on extracted text from 100 papers, and even recovers some data not discovered during manual curation. Code at: https://github.com/WormBase/genomic-info-from-papers.

3.
Genetics ; 220(4)2022 04 04.
Article in English | MEDLINE | ID: mdl-35134929

ABSTRACT

WormBase (www.wormbase.org) is the central repository for the genetics and genomics of the nematode Caenorhabditis elegans. We provide the research community with data and tools to facilitate the use of C. elegans and related nematodes as model organisms for studying human health, development, and many aspects of fundamental biology. Throughout our 22-year history, we have continued to evolve to reflect progress and innovation in the science and technologies involved in the study of C. elegans. We strive to incorporate new data types and richer data sets, and to provide integrated displays and services that avail the knowledge generated by the published nematode genetics literature. Here, we provide a broad overview of the current state of WormBase in terms of data type, curation workflows, analysis, and tools, including exciting new advances for analysis of single-cell data, text mining and visualization, and the new community collaboration forum. Concurrently, we continue the integration and harmonization of infrastructure, processes, and tools with the Alliance of Genome Resources, of which WormBase is a founding member.


Subject(s)
Caenorhabditis , Nematoda , Animals , Caenorhabditis/genetics , Caenorhabditis elegans/genetics , Databases, Genetic , Genome , Genomics , Humans , Nematoda/genetics
4.
Nucleic Acids Res ; 50(D1): D996-D1003, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34791415

ABSTRACT

Ensembl Genomes (https://www.ensemblgenomes.org) provides access to non-vertebrate genomes and analysis complementing vertebrate resources developed by the Ensembl project (https://www.ensembl.org). The two resources collectively present genome annotation through a consistent set of interfaces spanning the tree of life presenting genome sequence, annotation, variation, transcriptomic data and comparative analysis. Here, we present our largest increase in plant, metazoan and fungal genomes since the project's inception creating one of the world's most comprehensive genomic resources and describe our efforts to reduce genome redundancy in our Bacteria portal. We detail our new efforts in gene annotation, our emerging support for pangenome analysis, our efforts to accelerate data dissemination through the Ensembl Rapid Release resource and our new AlphaFold visualization. Finally, we present details of our future plans including updates on our integration with Ensembl, and how we plan to improve our support for the microbial research community. Software and data are made available without restriction via our website, online tools platform and programmatic interfaces (available under an Apache 2.0 license). Data updates are synchronised with Ensembl's release cycle.


Subject(s)
Databases, Genetic , Genomics , Internet , Software , Animals , Computational Biology , Genome, Bacterial/genetics , Genome, Fungal/genetics , Genome, Plant/genetics , Plants/classification , Plants/genetics , Vertebrates/classification , Vertebrates/genetics
5.
MicroPubl Biol ; 20212021.
Article in English | MEDLINE | ID: mdl-34746681

ABSTRACT

C. elegans was the first animal to have its genome completely sequenced. In the decades since, the genome continues to be actively curated, annotated, and improved. Here we report the discovery of a new gene in a region of the genome that is currently not associated with any annotated gene or feature. We present RNA-seq and RT-PCR evidence that this gene is expressed at detectable levels, and that it is alternatively spliced. The new gene (Y97E10C.2) shares an operon with two upstream genes. We provide RNA-seq and RT-PCR evidence for a missing exon in the upstream gene T05B11.7, as well as an alternatively-spliced exon.

SELECTION OF CITATIONS
SEARCH DETAIL
...