RESUMO
Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand.Here we present a community-driven curation effort, supported by ELIXIR-the European infrastructure for biological information-that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners.As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools.
Assuntos
Biologia Computacional , Sistema de Registros , Curadoria de Dados , SoftwareRESUMO
BACKGROUND: Sharing of data about variation and the associated phenotypes is a critical need, yet variant information can be arbitrarily complex, making a single standard vocabulary elusive and re-formatting difficult. Complex standards have proven too time-consuming to implement. RESULTS: The GEN2PHEN project addressed these difficulties by developing a comprehensive data model for capturing biomedical observations, Observ-OM, and building the VarioML format around it. VarioML pairs a simplified open specification for describing variants, with a toolkit for adapting the specification into one's own research workflow. Straightforward variant data can be captured, federated, and exchanged with no overhead; more complex data can be described, without loss of compatibility. The open specification enables push-button submission to gene variant databases (LSDBs) e.g., the Leiden Open Variation Database, using the Cafe Variome data publishing service, while VarioML bidirectionally transforms data between XML and web-application code formats, opening up new possibilities for open source web applications building on shared data. A Java implementation toolkit makes VarioML easily integrated into biomedical applications. VarioML is designed primarily for LSDB data submission and transfer scenarios, but can also be used as a standard variation data format for JSON and XML document databases and user interface components. CONCLUSIONS: VarioML is a set of tools and practices improving the availability, quality, and comprehensibility of human variation information. It enables researchers, diagnostic laboratories, and clinics to share that information with ease, clarity, and without ambiguity.
Assuntos
Bases de Dados Genéticas , Doença/genética , Variação Genética , Disseminação de Informação/métodos , Sistemas Computacionais , HumanosRESUMO
The filamentous fungus Aspergillus niger is widely exploited by the fermentation industry for the production of enzymes and organic acids, particularly citric acid. We sequenced the 33.9-megabase genome of A. niger CBS 513.88, the ancestor of currently used enzyme production strains. A high level of synteny was observed with other aspergilli sequenced. Strong function predictions were made for 6,506 of the 14,165 open reading frames identified. A detailed description of the components of the protein secretion pathway was made and striking differences in the hydrolytic enzyme spectra of aspergilli were observed. A reconstructed metabolic network comprising 1,069 unique reactions illustrates the versatile metabolism of A. niger. Noteworthy is the large number of major facilitator superfamily transporters and fungal zinc binuclear cluster transcription factors, and the presence of putative gene clusters for fumonisin and ochratoxin A synthesis.
Assuntos
Aspergillus niger/genética , Mapeamento Cromossômico , Cromossomos Fúngicos/genética , Genoma Fúngico/genética , Proteínas de Plantas/genética , Análise de Sequência de DNA/métodos , Sequência de Bases , Dados de Sequência MolecularRESUMO
The resistance of Candida albicans to many stresses is dependent on the stress-activated protein kinase (SAPK) Hog1. Hence we have explored the role of Hog1 in the regulation of transcriptional responses to stress. DNA microarrays were used to characterize the global transcriptional responses of HOG1 and hog1 cells to three stress conditions that activate the Hog1 SAPK: osmotic stress, oxidative stress, and heavy metal stress. This revealed both stress-specific transcriptional responses and a core transcriptional response to stress in C. albicans. The core transcriptional response was characterized by a subset of genes that responded in a stereotypical manner to all of the stresses analyzed. Inactivation of HOG1 significantly attenuated transcriptional responses to osmotic and heavy metal stresses, but not to oxidative stress, and this was reflected in the role of Hog1 in the regulation of C. albicans core stress genes. Instead, the Cap1 transcription factor plays a key role in the oxidative stress regulation of C. albicans core stress genes. Our data show that the SAPK network in C. albicans has diverged from corresponding networks in model yeasts and that the C. albicans SAPK pathway functions in parallel with other pathways to regulate the core transcriptional response to stress.
Assuntos
Candida albicans/enzimologia , Proteínas Fúngicas/fisiologia , Regulação Fúngica da Expressão Gênica , Proteínas Quinases Ativadas por Mitógeno/fisiologia , RNA Mensageiro/metabolismo , Fatores de Transcrição de Zíper de Leucina Básica , Candida albicans/genética , Proteínas de Ciclo Celular/metabolismo , Análise por Conglomerados , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo , Deleção de Genes , Perfilação da Expressão Gênica , Genoma Fúngico , Metais Pesados/metabolismo , Proteínas Quinases Ativadas por Mitógeno/genética , Modelos Biológicos , Pressão Osmótica , Estresse Oxidativo , Saccharomycetales/metabolismo , Schizosaccharomyces/metabolismo , Transdução de SinaisRESUMO
BACKGROUND: The number of sequenced fungal genomes is ever increasing, with about 200 genomes already fully sequenced or in progress. Only a small percentage of those genomes have been comprehensively studied, for example using techniques from functional genomics. Comparative analysis has proven to be a useful strategy for enhancing our understanding of evolutionary biology and of the less well understood genomes. However, the data required for these analyses tends to be distributed in various heterogeneous data sources, making systematic comparative studies a cumbersome task. Furthermore, comparative analyses benefit from close integration of derived data sets that cluster genes or organisms in a way that eases the expression of requests that clarify points of similarity or difference between species. DESCRIPTION: To support systematic comparative analyses of fungal genomes we have developed the e-Fungi database, which integrates a variety of data for more than 30 fungal genomes. Publicly available genome data, functional annotations, and pathway information has been integrated into a single data repository and complemented with results of comparative analyses, such as MCL and OrthoMCL cluster analysis, and predictions of signaling proteins and the sub-cellular localisation of proteins. To access the data, a library of analysis tasks is available through a web interface. The analysis tasks are motivated by recent comparative genomics studies, and aim to support the study of evolutionary biology as well as community efforts for improving the annotation of genomes. Web services for each query are also available, enabling the tasks to be incorporated into workflows. CONCLUSION: The e-Fungi database provides fungal biologists with a resource for comparative studies of a large range of fungal genomes. Its analysis library supports the comparative study of genome data, functional annotation, and results of large scale analyses over all the genomes stored in the database. The database is accessible at http://www.e-fungi.org.uk, as is the WSDL for the web services.
Assuntos
Bases de Dados Genéticas , Genoma Fúngico/genética , Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Internet , Interface Usuário-ComputadorRESUMO
The Notch receptor mediates a short-range signal that regulates many cell fate decisions. The misregulation of Notch has been linked to cancer and to developmental disorders. Upon binding to its ligands, Delta (Dl) or Serrate (Ser), the Notch ectodomain is shed by the action of an ADAM protease. The Notch intracellular domain is subsequently released proteolytically from the membrane by Presenilin and translocates to the nucleus to activate the transcription factor, Suppressor of Hairless. We show in Drosophila that Notch signaling is limited by the activity of two Nedd4 family HECT domain proteins, Suppressor of deltex [Su(dx)] and DNedd4. We rule out models by which Su(dx) downregulates Notch through modulating Deltex or by limiting the adherens junction accumulation of Notch. Instead, we show that Su(dx) regulates the postendocytic sorting of Notch within the early endosome to an Hrs- and ubiquitin-enriched subdomain en route to the late endosome. We propose a model in which endocytic sorting of Notch mediates a decision between its activation and downregulation. Such intersections between trafficking routes may provide key points at which other signals can modulate Notch activity in both normal development and in the pathological misactivation of Notch.
Assuntos
Endossomos/fisiologia , Regulação da Expressão Gênica , Proteínas de Membrana/metabolismo , Transdução de Sinais/fisiologia , Ubiquitina-Proteína Ligases/metabolismo , Animais , Drosophila , Proteínas de Drosophila/metabolismo , Complexos Endossomais de Distribuição Requeridos para Transporte , Imuno-Histoquímica , Proteínas de Membrana/fisiologia , Modelos Biológicos , Ubiquitina-Proteína Ligases Nedd4 , Transporte Proteico/fisiologia , Receptores Notch , Ubiquitina-Proteína Ligases/fisiologiaRESUMO
As with most machines, the integral safety mechanism on firearms is vital to injury or fatality free operation. Presently, there is little or no standardisation in the design of these mechanisms. In this investigation, five existing designs found on both military and commercial rifles were evaluated ergonomically to determine the most effective characteristics for incorporation into future designs. The designs were evaluated experimentally on ease of use, visual effectiveness and operational impact. Three groups, representing a total of 30 subjects with widely varying experience, were selected. Results strongly suggest that safeties whose actuators are mounted within easy reach of the trigger finger are preferred and have the least operational impact. Subjects also preferred and were more adept at recognising safety status when the indicator was located on the receiver/barrel along the normal line of sight. Subjects more often correctly identified safety status when the indicator utilised colouring, was clearly marked and/or was in the normal line of sight. The results of this research prove that ergonomics can contribute to the understanding of firearm safety dynamics. The two essential components of safety mechanism design identified in this investigation, unambiguous status visibility and impact-free operation, are also likely to have implications in non-firearm safety mechanism design. This is particularly true for devices whose inadvertent operation can cause injury, as well as systems in which operational effectiveness can be jeopardised when attentiveness or operational control are lost in the process of actuating a poorly designed safety mechanism.
Assuntos
Ergonomia , Armas de Fogo , Equipamentos de Proteção , Desenho de EquipamentoRESUMO
The recent proliferation of genome sequencing in diverse fungal species has provided the first opportunity for comparative genome analysis across a eukaryotic kingdom. Here, we report a comparative study of 34 complete fungal genome sequences, representing a broad diversity of Ascomycete, Basidiomycete, and Zygomycete species. We have clustered all predicted protein-encoding gene sequences from these species to provide a means of investigating gene innovations, gene family expansions, protein family diversification, and the conservation of essential gene functions-empirically determined in Saccharomyces cerevisiae-among the fungi. The results are presented with reference to a phylogeny of the 34 fungal species, based on 29 universally conserved protein-encoding gene sequences. We contrast this phylogeny with one based on gene presence and absence and show that, while the two phylogenies are largely in agreement, there are differences in the positioning of some species. We have investigated levels of gene duplication and demonstrate that this varies greatly between fungal species, although there are instances of coduplication in distantly related fungi. We have also investigated the extent of orthology for protein families and demonstrate unexpectedly high levels of diversity among genes involved in lipid metabolism. These analyses have been collated in the e-Fungi data warehouse, providing an online resource for comparative genomic analysis of the fungi.
Assuntos
Fungos/genética , Genes Fúngicos , Variação Genética , Genoma Fúngico , Análise de Sequência de DNA , Evolução Biológica , Duplicação Gênica , FilogeniaRESUMO
BACKGROUND: Cell growth underlies many key cellular and developmental processes, yet a limited number of studies have been carried out on cell-growth regulation. Comprehensive studies at the transcriptional, proteomic and metabolic levels under defined controlled conditions are currently lacking. RESULTS: Metabolic control analysis is being exploited in a systems biology study of the eukaryotic cell. Using chemostat culture, we have measured the impact of changes in flux (growth rate) on the transcriptome, proteome, endometabolome and exometabolome of the yeast Saccharomyces cerevisiae. Each functional genomic level shows clear growth-rate-associated trends and discriminates between carbon-sufficient and carbon-limited conditions. Genes consistently and significantly upregulated with increasing growth rate are frequently essential and encode evolutionarily conserved proteins of known function that participate in many protein-protein interactions. In contrast, more unknown, and fewer essential, genes are downregulated with increasing growth rate; their protein products rarely interact with one another. A large proportion of yeast genes under positive growth-rate control share orthologs with other eukaryotes, including humans. Significantly, transcription of genes encoding components of the TOR complex (a major controller of eukaryotic cell growth) is not subject to growth-rate regulation. Moreover, integrative studies reveal the extent and importance of post-transcriptional control, patterns of control of metabolic fluxes at the level of enzyme synthesis, and the relevance of specific enzymatic reactions in the control of metabolic fluxes during cell growth. CONCLUSION: This work constitutes a first comprehensive systems biology study on growth-rate control in the eukaryotic cell. The results have direct implications for advanced studies on cell growth, in vivo regulation of metabolic fluxes for comprehensive metabolic engineering, and for the design of genome-scale systems biology models of the eukaryotic cell.
Assuntos
Células Eucarióticas/fisiologia , Regulação Fúngica da Expressão Gênica , Saccharomyces cerevisiae/citologia , Saccharomyces cerevisiae/fisiologia , Biologia de Sistemas/métodos , Transcrição Gênica , Carbono/metabolismo , Técnicas de Cultura de Células , Perfilação da Expressão Gênica , Humanos , Proteínas Quinases/genética , Proteínas Quinases/metabolismo , Transdução de Sinais , Serina-Treonina Quinases TORRESUMO
Global studies of protein-protein interactions are crucial to both elucidating gene function and producing an integrated view of the workings of living cells. High-throughput studies of the yeast interactome have been performed using both genetic and biochemical screens. Despite their size, the overlap between these experimental datasets is very limited. This could be due to each approach sampling only a small fraction of the total interactome. Alternatively, a large proportion of the data from these screens may represent false-positive interactions. We have used the Genome Information Management System (GIMS) to integrate interactome datasets with transcriptome and protein annotation data and have found significant evidence that the proportion of false-positive results is high. Not all high-throughput datasets are similarly contaminated, and the tandem affinity purification (TAP) approach appears to yield a high proportion of reliable interactions for which corroborating evidence is available. From our integrative analyses, we have generated a set of verified interactome data for yeast.
RESUMO
Comprehensive protein protein interaction maps promise to reveal many aspects of the complex regulatory network underlying cellular function. Recently, large-scale approaches have predicted many new protein interactions in yeast. To measure their accuracy and potential as well as to identify biases, strengths and weaknesses, we compare the methods with each other and with a reference set of previously reported protein interactions.
Assuntos
Proteínas/metabolismo , Viés , Bases de Dados Factuais , Reações Falso-Positivas , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo , Espectrometria de Massas , Ligação Proteica , Proteínas/genética , RNA Mensageiro/análise , RNA Mensageiro/genética , Padrões de Referência , Sensibilidade e Especificidade , Técnicas do Sistema de Duplo-HíbridoRESUMO
In Drosophila, Suppressor of deltex (Su(dx)) mutations display a wing vein gap phenotype resembling that of Notch gain of function alleles. The Su(dx) protein may therefore act as a negative regulator of Notch but its activity on actual Notch signalling levels has not been demonstrated. Here we show that Su(dx) does regulate the level of Notch signalling in vivo, upstream of Notch target genes and in different developmental contexts, including a previously unknown role in leg joint formation. Overexpression of Su(dx) was capable of blocking both the endogenous activity of Notch and the ectopic Notch signalling induced by the overexpression of Deltex, an intracellular Notch binding protein. In addition, using the conditional phenotype of the Su(dx)(sp) allele, we show that loss of Su(dx) activity is rapidly followed by an up-regulation of E(spl)mbeta expression, the immediate target of Notch signal activation during wing vein development. While Su(dx) adult wing vein phenotypes are quite mild, only affecting the distal tips of the veins, we show that the initial consequence of loss of Su(dx) activity is more severe than previously thought. Using a time-course experiment we show that the phenotype is buffered by feedback regulation illustrating how signalling networks can make development robust to perturbation.
Assuntos
Proteínas de Drosophila , Drosophila/crescimento & desenvolvimento , Drosophila/genética , Genes de Insetos , Ligases/genética , Proteínas de Membrana/genética , Ubiquitina-Proteína Ligases , Alelos , Animais , Animais Geneticamente Modificados , Regulação para Baixo , Drosophila/metabolismo , Extremidades/crescimento & desenvolvimento , Retroalimentação , Regulação da Expressão Gênica no Desenvolvimento , Articulações/crescimento & desenvolvimento , Proteínas de Membrana/metabolismo , Mutação , Fenótipo , Receptores Notch , Transdução de Sinais , Asas de Animais/crescimento & desenvolvimentoRESUMO
Effective analyses in functional genomics require access to many kinds of biological data. For example, the analysis of upregulated genes in a microarray experiment might be aided by information concerning protein interactions or proteins' cellular locations. However, such information is often stored in different formats at different sites, in ways that may not be amenable to integrated analysis. The Genome Information Management System (GIMS) is an object database that integrates genomic data with data on the transcriptome, protein-protein interactions, metabolic pathways and annotations, such as gene ontology terms and identifiers. The resulting system supports the running of analyses over this integrated data resource, and provides comprehensive facilities for handling and interrelating the results of these analyses. GIMS has been used to store Saccharomyces cerevisiae data, and we demonstrate how the integrated storage of diverse types of data can be beneficial for analysis, using combinations of complex queries. As an example, we describe how GIMS has been used to analyse a collection of aryl alcohol dehydrogenase gene deletion mutants. The GIMS database can be accessed remotely using a Java application that can be downloaded from http://img.cs.man.ac.uk/gims.