Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
PLoS One ; 3(6): e2300, 2008 Jun 04.
Artículo en Inglés | MEDLINE | ID: mdl-18523684

RESUMEN

Fungi and oomycetes are the causal agents of many of the most serious diseases of plants. Here we report a detailed comparative analysis of the genome sequences of thirty-six species of fungi and oomycetes, including seven plant pathogenic species, that aims to explore the common genetic features associated with plant disease-causing species. The predicted translational products of each genome have been clustered into groups of potential orthologues using Markov Chain Clustering and the data integrated into the e-Fungi object-oriented data warehouse (http://www.e-fungi.org.uk/). Analysis of the species distribution of members of these clusters has identified proteins that are specific to filamentous fungal species and a group of proteins found only in plant pathogens. By comparing the gene inventories of filamentous, ascomycetous phytopathogenic and free-living species of fungi, we have identified a set of gene families that appear to have expanded during the evolution of phytopathogens and may therefore serve important roles in plant disease. We have also characterised the predicted set of secreted proteins encoded by each genome and identified a set of protein families which are significantly over-represented in the secretomes of plant pathogenic fungi, including putative effector proteins that might perturb host cell biology during plant infection. The results demonstrate the potential of comparative genome analysis for exploring the evolution of eukaryotic microbial pathogenesis.


Asunto(s)
Hongos/genética , Genoma Fúngico , Saccharomyces cerevisiae/genética , Evolución Biológica , Especificidad de la Especie
2.
Brief Bioinform ; 9(2): 174-88, 2008 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-18281347

RESUMEN

Proteomics, the study of the protein complement of a biological system, is generating increasing quantities of data from rapidly developing technologies employed in a variety of different experimental workflows. Experimental processes, e.g. for comparative 2D gel studies or LC-MS/MS analyses of complex protein mixtures, involve a number of steps: from experimental design, through wet and dry lab operations, to publication of data in repositories and finally to data annotation and maintenance. The presence of inaccuracies throughout the processing pipeline, however, results in data that can be untrustworthy, thus offsetting the benefits of high-throughput technology. While researchers and practitioners are generally aware of some of the information quality issues associated with public proteomics data, there are few accepted criteria and guidelines for dealing with them. In this article, we highlight factors that impact on the quality of experimental data and review current approaches to information quality management in proteomics. Data quality issues are considered throughout the lifecycle of a proteomics experiment, from experiment design and technique selection, through data analysis, to archiving and sharing.


Asunto(s)
Almacenamiento y Recuperación de la Información , Proteómica , Control de Calidad , Sistemas de Administración de Bases de Datos , Electroforesis en Gel Bidimensional , Almacenamiento y Recuperación de la Información/métodos , Almacenamiento y Recuperación de la Información/normas , Espectrometría de Masas , Proteínas/análisis , Proteómica/instrumentación , Proteómica/métodos , Proteómica/normas , Programas Informáticos
3.
BMC Genomics ; 8: 426, 2007 Nov 20.
Artículo en Inglés | MEDLINE | ID: mdl-18028535

RESUMEN

BACKGROUND: The number of sequenced fungal genomes is ever increasing, with about 200 genomes already fully sequenced or in progress. Only a small percentage of those genomes have been comprehensively studied, for example using techniques from functional genomics. Comparative analysis has proven to be a useful strategy for enhancing our understanding of evolutionary biology and of the less well understood genomes. However, the data required for these analyses tends to be distributed in various heterogeneous data sources, making systematic comparative studies a cumbersome task. Furthermore, comparative analyses benefit from close integration of derived data sets that cluster genes or organisms in a way that eases the expression of requests that clarify points of similarity or difference between species. DESCRIPTION: To support systematic comparative analyses of fungal genomes we have developed the e-Fungi database, which integrates a variety of data for more than 30 fungal genomes. Publicly available genome data, functional annotations, and pathway information has been integrated into a single data repository and complemented with results of comparative analyses, such as MCL and OrthoMCL cluster analysis, and predictions of signaling proteins and the sub-cellular localisation of proteins. To access the data, a library of analysis tasks is available through a web interface. The analysis tasks are motivated by recent comparative genomics studies, and aim to support the study of evolutionary biology as well as community efforts for improving the annotation of genomes. Web services for each query are also available, enabling the tasks to be incorporated into workflows. CONCLUSION: The e-Fungi database provides fungal biologists with a resource for comparative studies of a large range of fungal genomes. Its analysis library supports the comparative study of genome data, functional annotation, and results of large scale analyses over all the genomes stored in the database. The database is accessible at http://www.e-fungi.org.uk, as is the WSDL for the web services.


Asunto(s)
Bases de Datos Genéticas , Genoma Fúngico/genética , Biología Computacional/métodos , Sistemas de Administración de Bases de Datos , Internet , Interfaz Usuario-Computador
4.
Genome Res ; 17(12): 1809-22, 2007 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-17984228

RESUMEN

The recent proliferation of genome sequencing in diverse fungal species has provided the first opportunity for comparative genome analysis across a eukaryotic kingdom. Here, we report a comparative study of 34 complete fungal genome sequences, representing a broad diversity of Ascomycete, Basidiomycete, and Zygomycete species. We have clustered all predicted protein-encoding gene sequences from these species to provide a means of investigating gene innovations, gene family expansions, protein family diversification, and the conservation of essential gene functions-empirically determined in Saccharomyces cerevisiae-among the fungi. The results are presented with reference to a phylogeny of the 34 fungal species, based on 29 universally conserved protein-encoding gene sequences. We contrast this phylogeny with one based on gene presence and absence and show that, while the two phylogenies are largely in agreement, there are differences in the positioning of some species. We have investigated levels of gene duplication and demonstrate that this varies greatly between fungal species, although there are instances of coduplication in distantly related fungi. We have also investigated the extent of orthology for protein families and demonstrate unexpectedly high levels of diversity among genes involved in lipid metabolism. These analyses have been collated in the e-Fungi data warehouse, providing an online resource for comparative genomic analysis of the fungi.


Asunto(s)
Hongos/genética , Genes Fúngicos , Variación Genética , Genoma Fúngico , Análisis de Secuencia de ADN , Evolución Biológica , Duplicación de Gen , Filogenia
5.
Nucleic Acids Res ; 35(16): 5625-33, 2007.
Artículo en Inglés | MEDLINE | ID: mdl-17709344

RESUMEN

It is increasingly common to combine Microarray and Quantitative Trait Loci data to aid the search for candidate genes responsible for phenotypic variation. Workflows provide a means of systematically processing these large datasets and also represent a framework for the re-use and the explicit declaration of experimental methods. In this article, we highlight the issues facing the manual analysis of microarray and QTL data for the discovery of candidate genes underlying complex phenotypes. We show how automated approaches provide a systematic means to investigate genotype-phenotype correlations. This methodology was applied to a use case of resistance to African trypanosomiasis in the mouse. Pathways represented in the results identified Daxx as one of the candidate genes within the Tir1 QTL region. Subsequent re-sequencing in Daxx identified a deletion of an amino acid, identified in susceptible mouse strains, in the Daxx-p53 protein-binding region. This supports recent experimental evidence that apoptosis could be playing a role in the trypanosomiasis resistance phenotype. Workflows developed in this investigation, including a guide to loading and executing them with example data, are available at http://workflows.mygrid.org.uk/repository/myGrid/PaulFisher/.


Asunto(s)
Perfilación de la Expresión Génica , Predisposición Genética a la Enfermedad , Sitios de Carácter Cuantitativo , Tripanosomiasis Africana/genética , Animales , Secuencia de Bases , Proteínas Portadoras/genética , Proteínas Co-Represoras , Genotipo , Inmunidad Innata/genética , Péptidos y Proteínas de Señalización Intracelular/genética , Ratones , Chaperonas Moleculares , Datos de Secuencia Molecular , Proteínas Nucleares/genética , Análisis de Secuencia por Matrices de Oligonucleótidos , Fenotipo , Alineación de Secuencia , Programas Informáticos , Tripanosomiasis Africana/metabolismo
6.
BMC Bioinformatics ; 7: 532, 2006 Dec 14.
Artículo en Inglés | MEDLINE | ID: mdl-17169146

RESUMEN

BACKGROUND: The proliferation of data repositories in bioinformatics has resulted in the development of numerous interfaces that allow scientists to browse, search and analyse the data that they contain. Interfaces typically support repository access by means of web pages, but other means are also used, such as desktop applications and command line tools. Interfaces often duplicate functionality amongst each other, and this implies that associated development activities are repeated in different laboratories. Interfaces developed by public laboratories are often created with limited developer resources. In such environments, reducing the time spent on creating user interfaces allows for a better deployment of resources for specialised tasks, such as data integration or analysis. Laboratories maintaining data resources are challenged to reconcile requirements for software that is reliable, functional and flexible with limitations on software development resources. RESULTS: This paper proposes a model-driven approach for the partial generation of user interfaces for searching and browsing bioinformatics data repositories. Inspired by the Model Driven Architecture (MDA) of the Object Management Group (OMG), we have developed a system that generates interfaces designed for use with bioinformatics resources. This approach helps laboratory domain experts decrease the amount of time they have to spend dealing with the repetitive aspects of user interface development. As a result, the amount of time they can spend on gathering requirements and helping develop specialised features increases. The resulting system is known as Pierre, and has been validated through its application to use cases in the life sciences, including the PEDRoDB proteomics database and the e-Fungi data warehouse. CONCLUSION: MDAs focus on generating software from models that describe aspects of service capabilities, and can be applied to support rapid development of repository interfaces in bioinformatics. The Pierre MDA is capable of supporting common database access requirements with a variety of auto-generated interfaces and across a variety of repositories. With Pierre, four kinds of interfaces are generated: web, stand-alone application, text-menu, and command line. The kinds of repositories with which Pierre interfaces have been used are relational, XML and object databases.


Asunto(s)
Biología Computacional/métodos , Bases de Datos Factuales , Modelos Biológicos , Diseño de Software , Biología Computacional/tendencias , Bases de Datos Factuales/tendencias
7.
Yeast ; 20(15): 1291-306, 2003 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-14618567

RESUMEN

Effective analyses in functional genomics require access to many kinds of biological data. For example, the analysis of upregulated genes in a microarray experiment might be aided by information concerning protein interactions or proteins' cellular locations. However, such information is often stored in different formats at different sites, in ways that may not be amenable to integrated analysis. The Genome Information Management System (GIMS) is an object database that integrates genomic data with data on the transcriptome, protein-protein interactions, metabolic pathways and annotations, such as gene ontology terms and identifiers. The resulting system supports the running of analyses over this integrated data resource, and provides comprehensive facilities for handling and interrelating the results of these analyses. GIMS has been used to store Saccharomyces cerevisiae data, and we demonstrate how the integrated storage of diverse types of data can be beneficial for analysis, using combinations of complex queries. As an example, we describe how GIMS has been used to analyse a collection of aryl alcohol dehydrogenase gene deletion mutants. The GIMS database can be accessed remotely using a Java application that can be downloaded from http://img.cs.man.ac.uk/gims.


Asunto(s)
Sistemas de Administración de Bases de Datos , Bases de Datos Genéticas , Genómica/métodos , Saccharomyces cerevisiae/genética , Biología Computacional/métodos , Almacenamiento y Recuperación de la Información/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Mapeo de Interacción de Proteínas/métodos , Saccharomyces cerevisiae/fisiología , Diseño de Software
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...