Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
1.
Nucleic Acids Res ; 52(6): 2848-2864, 2024 Apr 12.
Artículo en Inglés | MEDLINE | ID: mdl-38416577

RESUMEN

During their maturation, ribosomal RNAs (rRNAs) are decorated by hundreds of chemical modifications that participate in proper folding of rRNA secondary structures and therefore in ribosomal function. Along with pseudouridine, methylation of the 2'-hydroxyl ribose moiety (Nm) is the most abundant modification of rRNAs. The majority of Nm modifications in eukaryotes are placed by Fibrillarin, a conserved methyltransferase belonging to a ribonucleoprotein complex guided by C/D box small nucleolar RNAs (C/D box snoRNAs). These modifications impact interactions between rRNAs, tRNAs and mRNAs, and some are known to fine tune translation rates and efficiency. In this study, we built the first comprehensive map of Nm sites in Drosophila melanogaster rRNAs using two complementary approaches (RiboMethSeq and Nanopore direct RNA sequencing) and identified their corresponding C/D box snoRNAs by whole-transcriptome sequencing. We de novo identified 61 Nm sites, from which 55 are supported by both sequencing methods, we validated the expression of 106 C/D box snoRNAs and we predicted new or alternative rRNA Nm targets for 31 of them. Comparison of methylation level upon different stresses show only slight but specific variations, indicating that this modification is relatively stable in D. melanogaster. This study paves the way to investigate the impact of snoRNA-mediated 2'-O-methylation on translation and proteostasis in a whole organism.


Asunto(s)
Drosophila melanogaster , ARN Nucleolar Pequeño , Animales , ARN Nucleolar Pequeño/metabolismo , Drosophila melanogaster/genética , Drosophila melanogaster/metabolismo , Ribosomas/genética , Ribosomas/metabolismo , Secuencia de Bases , ARN Ribosómico/metabolismo , Metilación
2.
Nucleic Acids Res ; 49(D1): D899-D907, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33219682

RESUMEN

FlyBase (flybase.org) is an essential online database for researchers using Drosophila melanogaster as a model organism, facilitating access to a diverse array of information that includes genetic, molecular, genomic and reagent resources. Here, we describe the introduction of several new features at FlyBase, including Pathway Reports, paralog information, disease models based on orthology, customizable tables within reports and overview displays ('ribbons') of expression and disease data. We also describe a variety of recent important updates, including incorporation of a developmental proteome, upgrades to the GAL4 search tab, additional Experimental Tool Reports, migration to JBrowse for genome browsing and improvements to batch queries/downloads and the Fast-Track Your Paper tool.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Drosophila melanogaster/genética , Genoma de los Insectos/genética , Genómica/métodos , Animales , Genes de Insecto/genética , Bases del Conocimiento , Anotación de Secuencia Molecular/métodos , Motor de Búsqueda/métodos , Navegador Web
3.
Nucleic Acids Res ; 47(D1): D759-D765, 2019 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-30364959

RESUMEN

FlyBase (flybase.org) is a knowledge base that supports the community of researchers that use the fruit fly, Drosophila melanogaster, as a model organism. The FlyBase team curates and organizes a diverse array of genetic, molecular, genomic, and developmental information about Drosophila. At the beginning of 2018, 'FlyBase 2.0' was released with a significantly improved user interface and new tools. Among these important changes are a new organization of search results into interactive lists or tables (hitlists), enhanced reference lists, and new protein domain graphics. An important new data class called 'experimental tools' consolidates information on useful fly strains and other resources related to a specific gene, which significantly enhances the ability of the Drosophila researcher to design and carry out experiments. With the release of FlyBase 2.0, there has also been a restructuring of backend architecture and a continued development of application programming interfaces (APIs) for programmatic access to FlyBase data. In this review, we describe these major new features and functionalities of the FlyBase 2.0 site and how they support the use of Drosophila as a model organism for biological discovery and translational research.


Asunto(s)
Bases de Datos Genéticas , Drosophila melanogaster/genética , Genoma de los Insectos/genética , Genómica , Animales , Dominios Proteicos/genética , Programas Informáticos
4.
Nucleic Acids Res ; 45(D1): D663-D671, 2017 01 04.
Artículo en Inglés | MEDLINE | ID: mdl-27799470

RESUMEN

Since 1992, FlyBase (flybase.org) has been an essential online resource for the Drosophila research community. Concentrating on the most extensively studied species, Drosophila melanogaster, FlyBase includes information on genes (molecular and genetic), transgenic constructs, phenotypes, genetic and physical interactions, and reagents such as stocks and cDNAs. Access to data is provided through a number of tools, reports, and bulk-data downloads. Looking to the future, FlyBase is expanding its focus to serve a broader scientific community. In this update, we describe new features, datasets, reagent collections, and data presentations that address this goal, including enhanced orthology data, Human Disease Model Reports, protein domain search and visualization, concise gene summaries, a portal for external resources, video tutorials and the FlyBase Community Advisory Group.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Genéticas , Drosophila/genética , Genómica/métodos , Animales , Modelos Animales de Enfermedad , Estudios de Asociación Genética , Humanos , Navegador Web
5.
Nucleic Acids Res ; 44(D1): D786-92, 2016 Jan 04.
Artículo en Inglés | MEDLINE | ID: mdl-26467478

RESUMEN

Many publications describe sets of genes or gene products that share a common biology. For example, genome-wide studies and phylogenetic analyses identify genes related in sequence; high-throughput genetic and molecular screens reveal functionally related gene products; and advanced proteomic methods can determine the subunit composition of multi-protein complexes. It is useful for such gene collections to be presented as discrete lists within the appropriate Model Organism Database (MOD) so that researchers can readily access these data alongside other relevant information. To this end, FlyBase (flybase.org), the MOD for Drosophila melanogaster, has established a 'Gene Group' resource: high-quality sets of genes derived from the published literature and organized into individual report pages. To facilitate further analyses, Gene Group Reports also include convenient download and analysis options, together with links to equivalent gene groups at other databases. This new resource will enable researchers with diverse backgrounds and interests to easily view and analyse acknowledged D. melanogaster gene sets and compare them with those of other species.


Asunto(s)
Bases de Datos Genéticas , Drosophila melanogaster/genética , Genes de Insecto , Animales , Proteínas de Drosophila/genética
6.
Nucleic Acids Res ; 41(Database issue): D751-7, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23125371

RESUMEN

An accurate, comprehensive, non-redundant and up-to-date bibliography is a crucial component of any Model Organism Database (MOD). Principally, the bibliography provides a set of references that are specific to the field served by the MOD. Moreover, it serves as a backbone to which all curated biological data can be attributed. Here, we describe the organization and main features of the bibliography in FlyBase (flybase.org), the MOD for Drosophila melanogaster. We present an overview of the current content of the bibliography, the pipeline for identifying and adding new references, the presentation of data within Reference Reports and effective methods for searching and retrieving bibliographic data. We highlight recent improvements in these areas and describe the advantages of using the FlyBase bibliography over alternative literature resources. Although this article is focused on bibliographic data, many of the features and tools described are applicable to browsing and querying other datasets in FlyBase.


Asunto(s)
Bibliografías como Asunto , Bases de Datos Genéticas , Drosophila melanogaster/genética , Animales , Drosophila/genética , Internet
7.
PLoS Genet ; 7(12): e1002408, 2011 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-22194697

RESUMEN

The ribosome is critical for all aspects of cell growth due to its essential role in protein synthesis. Paradoxically, many Ribosomal proteins (Rps) act as tumour suppressors in Drosophila and vertebrates. To examine how reductions in Rps could lead to tissue overgrowth, we took advantage of the observation that an RpS6 mutant dominantly suppresses the small rough eye phenotype in a cyclin E hypomorphic mutant (cycE(JP)). We demonstrated that the suppression of cycE(JP) by the RpS6 mutant is not a consequence of restoring CycE protein levels or activity in the eye imaginal tissue. Rather, the use of UAS-RpS6 RNAi transgenics revealed that the suppression of cycE(JP) is exerted via a mechanism extrinsic to the eye, whereby reduced Rp levels in the prothoracic gland decreases the activity of ecdysone, the steroid hormone, delaying developmental timing and hence allowing time for tissue and organ overgrowth. These data provide for the first time a rationale to explain the counter-intuitive organ overgrowth phenotypes observed for certain members of the Minute class of Drosophila Rp mutants. They also demonstrate how Rp mutants can affect growth and development cell non-autonomously.


Asunto(s)
Ciclina E/metabolismo , Drosophila melanogaster/crecimiento & desarrollo , Drosophila melanogaster/genética , Ecdisona/metabolismo , Ojo/crecimiento & desarrollo , Proteína S6 Ribosómica/genética , Animales , Animales Modificados Genéticamente , Proliferación Celular , Ciclina E/genética , Drosophila melanogaster/metabolismo , Glándulas Endocrinas/metabolismo , Ojo/metabolismo , Regulación del Desarrollo de la Expresión Génica , Organogénesis/genética , Fenotipo , Interferencia de ARN , Proteína S6 Ribosómica/metabolismo
8.
MicroPubl Biol ; 20242024.
Artículo en Inglés | MEDLINE | ID: mdl-38741935

RESUMEN

The conserved family of alpha-ketoacid dehydrogenase complexes (AKDHCs) catalyze essential reactions in central metabolism and their dysregulation is implicated in several human diseases. Drosophila melanogaster provides an excellent model system to study the genetics and functions of these complexes. However, a systematic account of Drosophila AKDHCs and their composition has been lacking. Here, I identify and classify the genes encoding all Drosophila AKDHC subunits, update their functional annotations and integrate this work into the FlyBase database.

9.
Genetics ; 227(1)2024 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-38301657

RESUMEN

FlyBase (flybase.org) is a model organism database and knowledge base about Drosophila melanogaster, commonly known as the fruit fly. Researchers from around the world rely on the genetic, genomic, and functional information available in FlyBase, as well as its tools to view and interrogate these data. In this article, we describe the latest developments and updates to FlyBase. These include the introduction of single-cell RNA sequencing data, improved content and display of functional information, updated orthology pipelines, new chemical reports, and enhancements to our outreach resources.


Asunto(s)
Bases de Datos Genéticas , Drosophila melanogaster , Animales , Drosophila melanogaster/genética , Genes de Insecto , Genoma de los Insectos , Genómica/métodos
10.
Curr Protoc ; 3(4): e731, 2023 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-37014762

RESUMEN

FlyBase (www.flybase.org) is the primary online database of genetic, genomic, and functional information about Drosophila melanogaster. The long and rich history of Drosophila research, combined with recent surges in genomic-scale and high-throughput technologies, means that FlyBase now houses a huge quantity of data. Researchers need to be able to query these data rapidly and intuitively, and the QuickSearch tool has been designed to meet these needs. This tool is conveniently located on the FlyBase homepage and is organized into a series of simple tabbed interfaces that cover the major data and annotation classes within the database. This article describes the functionality of all aspects of the QuickSearch tool. With this knowledge, FlyBase users will be equipped to take full advantage of all QuickSearch features and thereby gain improved access to data relevant to their research. © 2023 The Authors. Current Protocols published by Wiley Periodicals LLC. Basic Protocol 1: Using the "Search FlyBase" tab of QuickSearch Basic Protocol 2: Using the "Data Class" tab of QuickSearch Basic Protocol 3: Using the "References" tab of QuickSearch Basic Protocol 4: Using the "Gene Groups" tab of QuickSearch Basic Protocol 5: Using the "Pathways" tab of QuickSearch Basic Protocol 6: Using the "GO" tab of QuickSearch Basic Protocol 7: Using the "Protein Domains" tab of QuickSearch Basic Protocol 8: Using the "Expression" tab of QuickSearch Basic Protocol 9: Using the "GAL4 etc" tab of QuickSearch Basic Protocol 10: Using the "Phenotype" tab of QuickSearch Basic Protocol 11: Using the "Human Disease" tab of QuickSearch Basic Protocol 12: Using the "Homologs" tab of QuickSearch Support Protocol 1: Managing FlyBase hit lists.


Asunto(s)
Drosophila melanogaster , Genoma de los Insectos , Animales , Humanos , Drosophila melanogaster/genética , Genes de Insecto , Bases de Datos Genéticas , Drosophila/genética
11.
Genetics ; 224(1)2023 05 04.
Artículo en Inglés | MEDLINE | ID: mdl-36866529

RESUMEN

The Gene Ontology (GO) knowledgebase (http://geneontology.org) is a comprehensive resource concerning the functions of genes and gene products (proteins and noncoding RNAs). GO annotations cover genes from organisms across the tree of life as well as viruses, though most gene function knowledge currently derives from experiments carried out in a relatively small number of model organisms. Here, we provide an updated overview of the GO knowledgebase, as well as the efforts of the broad, international consortium of scientists that develops, maintains, and updates the GO knowledgebase. The GO knowledgebase consists of three components: (1) the GO-a computational knowledge structure describing the functional characteristics of genes; (2) GO annotations-evidence-supported statements asserting that a specific gene product has a particular functional characteristic; and (3) GO Causal Activity Models (GO-CAMs)-mechanistic models of molecular "pathways" (GO biological processes) created by linking multiple GO annotations using defined relations. Each of these components is continually expanded, revised, and updated in response to newly published discoveries and receives extensive QA checks, reviews, and user feedback. For each of these components, we provide a description of the current contents, recent developments to keep the knowledgebase up to date with new discoveries, and guidance on how users can best make use of the data that we provide. We conclude with future directions for the project.


Asunto(s)
Bases de Datos Genéticas , Proteínas , Ontología de Genes , Proteínas/genética , Anotación de Secuencia Molecular , Biología Computacional
12.
BMC Bioinformatics ; 13: 16, 2012 Jan 26.
Artículo en Inglés | MEDLINE | ID: mdl-22280404

RESUMEN

BACKGROUND: Curation of information from bioscience literature into biological knowledge databases is a crucial way of capturing experimental information in a computable form. During the biocuration process, a critical first step is to identify from all published literature the papers that contain results for a specific data type the curator is interested in annotating. This step normally requires curators to manually examine many papers to ascertain which few contain information of interest and thus, is usually time consuming. We developed an automatic method for identifying papers containing these curation data types among a large pool of published scientific papers based on the machine learning method Support Vector Machine (SVM). This classification system is completely automatic and can be readily applied to diverse experimental data types. It has been in use in production for automatic categorization of 10 different experimental datatypes in the biocuration process at WormBase for the past two years and it is in the process of being adopted in the biocuration process at FlyBase and the Saccharomyces Genome Database (SGD). We anticipate that this method can be readily adopted by various databases in the biocuration community and thereby greatly reducing time spent on an otherwise laborious and demanding task. We also developed a simple, readily automated procedure to utilize training papers of similar data types from different bodies of literature such as C. elegans and D. melanogaster to identify papers with any of these data types for a single database. This approach has great significance because for some data types, especially those of low occurrence, a single corpus often does not have enough training papers to achieve satisfactory performance. RESULTS: We successfully tested the method on ten data types from WormBase, fifteen data types from FlyBase and three data types from Mouse Genomics Informatics (MGI). It is being used in the curation work flow at WormBase for automatic association of newly published papers with ten data types including RNAi, antibody, phenotype, gene regulation, mutant allele sequence, gene expression, gene product interaction, overexpression phenotype, gene interaction, and gene structure correction. CONCLUSIONS: Our methods are applicable to a variety of data types with training set containing several hundreds to a few thousand documents. It is completely automatic and, thus can be readily incorporated to different workflow at different literature-based databases. We believe that the work presented here can contribute greatly to the tremendous task of automating the important yet labor-intensive biocuration effort.


Asunto(s)
Inteligencia Artificial , Bases de Datos Factuales , Bases de Datos Genéticas , Animales , Automatización , Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Genómica , Ratones/genética , Publicaciones , Máquina de Vectores de Soporte
13.
MicroPubl Biol ; 20222022.
Artículo en Inglés | MEDLINE | ID: mdl-35789696

RESUMEN

Transfer RNAs (tRNAs) are ubiquitous adapter molecules that link specific codons in messenger RNA (mRNA) with their corresponding amino acids during protein synthesis. The tRNA genes of Drosophila have been investigated for over half a century but have lacked systematic identification and nomenclature. Here, we review and integrate data within FlyBase and the Genomic tRNA Database (GtRNAdb) to identify the full complement of tRNA genes in the D. melanogaster nuclear and mitochondrial genomes. We apply a logical and informative nomenclature to all tRNA genes, and provide an overview of their characteristics and genomic features.

14.
BMC Bioinformatics ; 12: 175, 2011 May 19.
Artículo en Inglés | MEDLINE | ID: mdl-21595960

RESUMEN

BACKGROUND: Journal articles and databases are two major modes of communication in the biological sciences, and thus integrating these critical resources is of urgent importance to increase the pace of discovery. Projects focused on bridging the gap between journals and databases have been on the rise over the last five years and have resulted in the development of automated tools that can recognize entities within a document and link those entities to a relevant database. Unfortunately, automated tools cannot resolve ambiguities that arise from one term being used to signify entities that are quite distinct from one another. Instead, resolving these ambiguities requires some manual oversight. Finding the right balance between the speed and portability of automation and the accuracy and flexibility of manual effort is a crucial goal to making text markup a successful venture. RESULTS: We have established a journal article mark-up pipeline that links GENETICS journal articles and the model organism database (MOD) WormBase. This pipeline uses a lexicon built with entities from the database as a first step. The entity markup pipeline results in links from over nine classes of objects including genes, proteins, alleles, phenotypes and anatomical terms. New entities and ambiguities are discovered and resolved by a database curator through a manual quality control (QC) step, along with help from authors via a web form that is provided to them by the journal. New entities discovered through this pipeline are immediately sent to an appropriate curator at the database. Ambiguous entities that do not automatically resolve to one link are resolved by hand ensuring an accurate link. This pipeline has been extended to other databases, namely Saccharomyces Genome Database (SGD) and FlyBase, and has been implemented in marking up a paper with links to multiple databases. CONCLUSIONS: Our semi-automated pipeline hyperlinks articles published in GENETICS to model organism databases such as WormBase. Our pipeline results in interactive articles that are data rich with high accuracy. The use of a manual quality control step sets this pipeline apart from other hyperlinking tools and results in benefits to authors, journals, readers and databases.


Asunto(s)
Bases de Datos Factuales , Publicaciones Periódicas como Asunto , Animales , Biología/métodos , Biología/tendencias , Caenorhabditis elegans/genética , Bases de Datos Genéticas , Internet , Control de Calidad
15.
Nucleic Acids Res ; 37(Database issue): D555-9, 2009 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-18948289

RESUMEN

FlyBase (http://flybase.org) is a database of Drosophila genetic and genomic information. Gene Ontology (GO) terms are used to describe three attributes of wild-type gene products: their molecular function, the biological processes in which they play a role, and their subcellular location. This article describes recent changes to the FlyBase GO annotation strategy that are improving the quality of the GO annotation data. Many of these changes stem from our participation in the GO Reference Genome Annotation Project--a multi-database collaboration producing comprehensive GO annotation sets for 12 diverse species.


Asunto(s)
Bases de Datos Genéticas , Proteínas de Drosophila/genética , Drosophila/genética , Genes de Insecto , Animales , Genoma de los Insectos , Genómica , Vocabulario Controlado
16.
Front Physiol ; 12: 648481, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-33815151

RESUMEN

UDP-glycosyltransferases (UGTs) are important conjugation enzymes found in all kingdoms of life, catalyzing a sugar conjugation with small lipophilic compounds and playing a crucial role in detoxification and homeostasis. The UGT gene family is defined by a signature motif in the C-terminal domain where the uridine diphosphate (UDP)-sugar donor binds. UGTs have been identified in a number of insect genomes over the last decade and much progress has been achieved in characterizing their expression patterns and molecular functions. Here, we present an update of the complete repertoire of UGT genes in Drosophila melanogaster and provide a brief overview of the latest research in this model insect. A total of 35 UGT genes are found in the D. melanogaster genome, localized to chromosomes 2 and 3 with a high degree of gene duplications on the chromosome arm 3R. All D. melanogaster UGT genes have now been named in FlyBase according to the unified UGT nomenclature guidelines. A phylogenetic analysis of UGT genes shows lineage-specific gene duplications. Analysis of anatomical and induced gene expression patterns demonstrate that some UGT genes are differentially expressed in various tissues or after environmental treatments. Extended searches of UGT orthologs from 18 additional Drosophila species reveal a diversity of UGT gene numbers and composition. The roles of Drosophila UGTs identified to date are briefly reviewed, and include xenobiotic metabolism, nicotine resistance, olfaction, cold tolerance, sclerotization, pigmentation, and immunity. Together, the updated genomic information and research overview provided herein will aid further research in this developing field.

17.
MicroPubl Biol ; 20212021 Jun 23.
Artículo en Inglés | MEDLINE | ID: mdl-34189422

RESUMEN

The neprilysin (M13) family of metalloendopeptidases comprises highly conserved ectoenzymes that cleave and thereby inactivate many physiologically relevant peptides in the extracellular space. Impaired neprilysin activity is associated with numerous human diseases. Here, we present a comprehensive list and classification of M13 family members in Drosophila melanogaster. Seven Neprilysin (Nep) genes encode active peptidases, while 21 Neprilysin-like (Nepl) genes encode proteins predicted to be catalytically inactive. RNAseq data demonstrate that all 28 genes are expressed during development, often in a tissue-specific pattern. Most Nep proteins possess a transmembrane domain, whereas almost all Nepl proteins are predicted to be secreted.

18.
Database (Oxford) ; 20202020 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-31960022

RESUMEN

Brief summaries describing the function of each gene's product(s) are of great value to the research community, especially when interpreting genome-wide studies that reveal changes to hundreds of genes. However, manually writing such summaries, even for a single species, is a daunting task; for example, the Drosophila melanogaster genome contains almost 14 000 protein-coding genes. One solution is to use computational methods to generate summaries, but this often fails to capture the key functions or express them eloquently. Here, we describe how we solicited help from the research community to generate manually written summaries of D. melanogaster gene function. Based on the data within the FlyBase database, we developed a computational pipeline to identify researchers who have worked extensively on each gene. We e-mailed these researchers to ask them to draft a brief summary of the main function(s) of the gene's product, which we edited for consistency to produce a 'gene snapshot'. This approach yielded 1800 gene snapshot submissions within a 3-month period. We discuss the general utility of this strategy for other databases that capture data from the research literature. Database URL: https://flybase.org/.


Asunto(s)
Recolección de Datos/métodos , Bases de Datos Genéticas , Drosophila melanogaster/genética , Genoma de los Insectos/genética , Animales , Programas Informáticos
19.
Fly (Austin) ; 14(1-4): 49-61, 2020.
Artículo en Inglés | MEDLINE | ID: mdl-31933406

RESUMEN

DNA synthesis during replication or repair is a fundamental cellular process that is catalyzed by a set of evolutionary conserved polymerases. Despite a large body of research, the DNA polymerases of Drosophila melanogaster have not yet been systematically reviewed, leading to inconsistencies in their nomenclature, shortcomings in their functional (Gene Ontology, GO) annotations and an under-appreciation of the extent of their characterization. Here, we describe the complete set of DNA polymerases in D. melanogaster, applying nomenclature already in widespread use in other species, and improving their functional annotation. A total of 19 genes encode the proteins comprising three replicative polymerases (alpha-primase, delta, epsilon), five translesion/repair polymerases (zeta, eta, iota, Rev1, theta) and the mitochondrial polymerase (gamma). We also provide an overview of the biochemical and genetic characterization of these factors in D. melanogaster. This work, together with the incorporation of the improved nomenclature and GO annotation into key biological databases, including FlyBase and UniProtKB, will greatly facilitate access to information about these important proteins.


Asunto(s)
ADN Polimerasa Dirigida por ADN/metabolismo , Proteínas de Drosophila/metabolismo , Drosophila melanogaster/enzimología , Regulación Enzimológica de la Expresión Génica/fisiología , Animales , ADN Polimerasa Dirigida por ADN/genética , Proteínas de Drosophila/genética
20.
Database (Oxford) ; 20192019 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-30689844

RESUMEN

The catalytic activities of enzymes can be described using Gene Ontology (GO) terms and Enzyme Commission (EC) numbers. These annotations are available from numerous biological databases and are routinely accessed by researchers and bioinformaticians to direct their work. However, enzyme data may not be congruent between different resources, while the origin, quality and genomic coverage of these data within any one resource are often unclear. GO/EC annotations are assigned either manually by expert curators or inferred computationally, and there is potential for errors in both types of annotation. If such errors remain unchecked, false positive annotations may be propagated across multiple resources, significantly degrading the quality and usefulness of these data. Similarly, the absence of annotations (false negatives) from any one resource can lead to incorrect inferences or conclusions. We are systematically reviewing and enhancing the functional annotation of the enzymes of Drosophila melanogaster, focusing on improvements within the FlyBase (www.flybase.org) database. We have reviewed four major enzyme groups to date: oxidoreductases, lyases, isomerases and ligases. Herein, we describe our review workflow, the improvement in the quality and coverage of enzyme annotations within FlyBase and the wider impact of our work on other related databases.


Asunto(s)
Bases de Datos Genéticas , Proteínas de Drosophila/genética , Drosophila melanogaster , Enzimas/genética , Genes de Insecto/genética , Anotación de Secuencia Molecular/métodos , Animales , Drosophila melanogaster/enzimología , Drosophila melanogaster/genética , Ontología de Genes , Genómica
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA