Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
PLoS Comput Biol ; 18(6): e1010194, 2022 06.
Artículo en Inglés | MEDLINE | ID: mdl-35687595

RESUMEN

Atlantic salmon (Salmo salar) is the most valuable farmed fish globally and there is much interest in optimizing its genetics and rearing conditions for growth and feed efficiency. Marine feed ingredients must be replaced to meet global demand, with challenges for fish health and sustainability. Metabolic models can address this by connecting genomes to metabolism, which converts nutrients in the feed to energy and biomass, but such models are currently not available for major aquaculture species such as salmon. We present SALARECON, a model focusing on energy, amino acid, and nucleotide metabolism that links the Atlantic salmon genome to metabolic fluxes and growth. It performs well in standardized tests and captures expected metabolic (in)capabilities. We show that it can explain observed hypoxic growth in terms of metabolic fluxes and apply it to aquaculture by simulating growth with commercial feed ingredients. Predicted limiting amino acids and feed efficiencies agree with data, and the model suggests that marine feed efficiency can be achieved by supplementing a few amino acids to plant- and insect-based feeds. SALARECON is a high-quality model that makes it possible to simulate Atlantic salmon metabolism and growth. It can be used to explain Atlantic salmon physiology and address key challenges in aquaculture such as development of sustainable feeds.


Asunto(s)
Alimentación Animal , Salmo salar , Aminoácidos/genética , Alimentación Animal/análisis , Animales , Acuicultura , Salmo salar/genética
2.
Bioinformatics ; 34(8): 1401-1403, 2018 04 15.
Artículo en Inglés | MEDLINE | ID: mdl-29186322

RESUMEN

Summary: To unlock the full potential of genome data and to enhance data interoperability and reusability of genome annotations we have developed SAPP, a Semantic Annotation Platform with Provenance. SAPP is designed as an infrastructure supporting FAIR de novo computational genomics but can also be used to process and analyze existing genome annotations. SAPP automatically predicts, tracks and stores structural and functional annotations and associated dataset- and element-wise provenance in a Linked Data format, thereby enabling information mining and retrieval with Semantic Web technologies. This greatly reduces the administrative burden of handling multiple analysis tools and versions thereof and facilitates multi-level large scale comparative analysis. Availability and implementation: SAPP is written in JAVA and freely available at https://gitlab.com/sapp and runs on Unix-like operating systems. The documentation, examples and a tutorial are available at https://sapp.gitlab.io. Contact: jasperkoehorst@gmail.com or peter.schaap@wur.nl.


Asunto(s)
Genómica/métodos , Anotación de Secuencia Molecular , Programas Informáticos , Semántica
3.
BMC Bioinformatics ; 19(1): 403, 2018 Nov 06.
Artículo en Inglés | MEDLINE | ID: mdl-30400817

RESUMEN

BACKGROUND: Systems biology takes a holistic approach by handling biomolecules and their interactions as big systems. Network based approach has emerged as a natural way to model these systems with the idea of representing biomolecules as nodes and their interactions as edges. Very often the input data come from various sorts of omics analyses. Those resulting networks sometimes describe a wide range of aspects, for example different experiment conditions, species, tissue types, stimulating factors, mutants, or simply distinct interaction features of the same network produced by different algorithms. For these scenarios, synchronous visualization of more than one distinct network is an excellent mean to explore all the relevant networks efficiently. In addition, complementary analysis methods are needed and they should work in a workflow manner in order to gain maximal biological insights. RESULTS: In order to address the aforementioned needs, we have developed a Synchronous Network Data Integration (SyNDI) framework. This framework contains SyncVis, a Cytoscape application for user-friendly synchronous and simultaneous visualization of multiple biological networks, and it is seamlessly integrated with other bioinformatics tools via the Galaxy platform. We demonstrated the functionality and usability of the framework with three biological examples - we analyzed the distinct connectivity of plasma metabolites in networks associated with high or low latent cardiovascular disease risk; deeper insights were obtained from a few similar inflammatory response pathways in Staphylococcus aureus infection common to human and mouse; and regulatory motifs which have not been reported associated with transcriptional adaptations of Mycobacterium tuberculosis were identified. CONCLUSIONS: Our SyNDI framework couples synchronous network visualization seamlessly with additional bioinformatics tools. The user can easily tailor the framework for his/her needs by adding new tools and datasets to the Galaxy platform.


Asunto(s)
Biología Computacional/métodos , Regulación Bacteriana de la Expresión Génica , Redes Reguladoras de Genes , Mycobacterium tuberculosis/genética , Programas Informáticos , Staphylococcus aureus/genética , Algoritmos , Animales , Humanos , Ratones , Modelos Biológicos
4.
Int J Mol Sci ; 19(2)2018 Jan 24.
Artículo en Inglés | MEDLINE | ID: mdl-29364195

RESUMEN

Tuberculosis remains one of the deadliest diseases. Emergence of drug-resistant and multidrug-resistant M. tuberculosis strains makes treating tuberculosis increasingly challenging. In order to develop novel intervention strategies, detailed understanding of the molecular mechanisms behind the success of this pathogen is required. Here, we review recent literature to provide a systems level overview of the molecular and cellular components involved in divalent metal homeostasis and their role in regulating the three main virulence strategies of M. tuberculosis: immune modulation, dormancy and phagosomal rupture. We provide a visual and modular overview of these components and their regulation. Our analysis identified a single regulatory cascade for these three virulence strategies that respond to limited availability of divalent metals in the phagosome.


Asunto(s)
Interacciones Huésped-Patógeno , Mycobacterium tuberculosis/fisiología , Tuberculosis/microbiología , Cationes Bivalentes/metabolismo , Ambiente , Regulación Bacteriana de la Expresión Génica , Interacción Gen-Ambiente , Interacciones Huésped-Patógeno/inmunología , Humanos , Inmunomodulación , Tuberculosis Latente/inmunología , Tuberculosis Latente/microbiología , Macrófagos/inmunología , Macrófagos/metabolismo , Macrófagos/microbiología , Metales/metabolismo , Mycobacterium tuberculosis/patogenicidad , Oxidación-Reducción , Fagosomas , Transducción de Señal , Tuberculosis/tratamiento farmacológico , Tuberculosis/inmunología , Tuberculosis/patología , Virulencia
5.
BMC Genomics ; 13: 535, 2012 Oct 06.
Artículo en Inglés | MEDLINE | ID: mdl-23039964

RESUMEN

BACKGROUND: Although many diseases have been well characterized at the molecular level, the underlying mechanisms are often unknown. Nearly half of all human genes remain poorly studied, yet these genes may contribute to a number of disease processes. Genes involved in common biological processes and diseases are often co-expressed. Using known disease-associated genes in a co-expression analysis may help identify and prioritize novel candidate genes for further study. RESULTS: We have created an online tool, called GeneFriends, which identifies co-expressed genes in over 1,000 mouse microarray datasets. GeneFriends can be used to assign putative functions to poorly studied genes. Using a seed list of disease-associated genes and a guilt-by-association method, GeneFriends allows users to quickly identify novel genes and transcription factors associated with a disease or process. We tested GeneFriends using seed lists for aging, cancer, and mitochondrial complex I disease. We identified several candidate genes that have previously been predicted as relevant targets. Some of the genes identified are already being tested in clinical trials, indicating the effectiveness of this approach. Co-expressed transcription factors were investigated, identifying C/ebp genes as candidate regulators of aging. Furthermore, several novel candidate genes, that may be suitable for experimental or clinical follow-up, were identified. Two of the novel candidates of unknown function that were co-expressed with cancer-associated genes were selected for experimental validation. Knock-down of their human homologs (C1ORF112 and C12ORF48) in HeLa cells slowed growth, indicating that these genes of unknown function, identified by GeneFriends, may be involved in cancer. CONCLUSIONS: GeneFriends is a resource for biologists to identify and prioritize novel candidate genes involved in biological processes and complex diseases. It is an intuitive online resource that will help drive experimentation. GeneFriends is available online at: http://genefriends.org/.


Asunto(s)
Envejecimiento/genética , Biología Computacional/métodos , Bases de Datos Genéticas , Análisis de Secuencia por Matrices de Oligonucleótidos , Animales , Complejo I de Transporte de Electrón/deficiencia , Complejo I de Transporte de Electrón/genética , Humanos , Internet , Ratones , Enfermedades Mitocondriales/genética , Neoplasias/genética
6.
Front Genet ; 10: 1366, 2019.
Artículo en Inglés | MEDLINE | ID: mdl-32117417

RESUMEN

NG-Tax 2.0 is a semantic framework for FAIR high-throughput analysis and classification of marker gene amplicon sequences including bacterial and archaeal 16S ribosomal RNA (rRNA), eukaryotic 18S rRNA and ribosomal intergenic transcribed spacer sequences. It can directly use single or merged reads, paired-end reads and unmerged paired-end reads from long range fragments as input to generate de novo amplicon sequence variants (ASV). Using the RDF data model, ASV's can be automatically stored in a graph database as objects that link ASV sequences with the full data-wise and element-wise provenance, thereby achieving the level of interoperability required to utilize such data to its full potential. The graph database can be directly queried, allowing for comparative analyses of over thousands of samples and is connected with an interactive Rshiny toolbox for analysis and visualization of (meta) data. Additionally, NG-Tax 2.0 exports an extended BIOM 1.0 (JSON) file as starting point for further analyses by other means. The extended BIOM file contains new attribute types to include information about the command arguments used, the sequences of the ASVs formed, classification confidence scores and is backwards compatible. The performance of NG-Tax 2.0 was compared with DADA2, using the plugin in the QIIME 2 analysis pipeline. Fourteen 16S rRNA gene amplicon mock community samples were obtained from the literature and evaluated. Precision of NG-Tax 2.0 was significantly higher with an average of 0.95 vs 0.58 for QIIME2-DADA2 while recall was comparable with an average of 0.85 and 0.77, respectively. NG-Tax 2.0 is written in Java. The code, the ontology, a Galaxy platform implementation, the analysis toolbox, tutorials and example SPARQL queries are freely available at http://wurssb.gitlab.io/ngtax under the MIT License.

7.
Metabolites ; 9(2)2019 Feb 06.
Artículo en Inglés | MEDLINE | ID: mdl-30736318

RESUMEN

Genome-scale metabolic models (GEMs) are manually curated repositories describing the metabolic capabilities of an organism. GEMs have been successfully used in different research areas, ranging from systems medicine to biotechnology. However, the different naming conventions (namespaces) of databases used to build GEMs limit model reusability and prevent the integration of existing models. This problem is known in the GEM community, but its extent has not been analyzed in depth. In this study, we investigate the name ambiguity and the multiplicity of non-systematic identifiers and we highlight the (in)consistency in their use in 11 biochemical databases of biochemical reactions and the problems that arise when mapping between different namespaces and databases. We found that such inconsistencies can be as high as 83.1%, thus emphasizing the need for strategies to deal with these issues. Currently, manual verification of the mappings appears to be the only solution to remove inconsistencies when combining models. Finally, we discuss several possible approaches to facilitate (future) unambiguous mapping.

8.
Sci Data ; 6(1): 254, 2019 11 04.
Artículo en Inglés | MEDLINE | ID: mdl-31685817

RESUMEN

The RDF data model facilitates integration of diverse data available in structured and semi-structured formats. To obtain a coherent RDF graph the chosen ontology must be consistently applied. However, addition of new diverse data causes the ontology to evolve, which could lead to accumulation of unintended erroneous composites. Thus, there is a need for a gate keeping system that compares the intended content described in the ontology with the actual content of the resource. The Empusa code generator facilitates creation of composite RDF resources from disparate sources. Empusa can convert a schema into an associated application programming interface (API), that can be used to perform data consistency checks and generates Markdown documentation to make persistent URLs resolvable. Using Empusa consistency is ensured within and between the ontology and the content of the resource. As an illustration of the potential of Empusa, we present the Genome Biology Ontology Language (GBOL). GBOL uses and extends current ontologies to provide a formal representation of genomic entities, along with their properties, relations and provenance.


Asunto(s)
Ontología de Genes , Genoma , Programas Informáticos , Animales , Humanos , Anotación de Secuencia Molecular
9.
Sci Rep ; 6: 38699, 2016 12 06.
Artículo en Inglés | MEDLINE | ID: mdl-27922098

RESUMEN

Pseudomonas is a highly versatile genus containing species that can be harmful to humans and plants while others are widely used for bioengineering and bioremediation. We analysed 432 sequenced Pseudomonas strains by integrating results from a large scale functional comparison using protein domains with data from six metabolic models, nearly a thousand transcriptome measurements and four large scale transposon mutagenesis experiments. Through heterogeneous data integration we linked gene essentiality, persistence and expression variability. The pan-genome of Pseudomonas is closed indicating a limited role of horizontal gene transfer in the evolutionary history of this genus. A large fraction of essential genes are highly persistent, still non essential genes represent a considerable fraction of the core-genome. Our results emphasize the power of integrating large scale comparative functional genomics with heterogeneous data for exploring bacterial diversity and versatility.


Asunto(s)
Biología Computacional , Metabolismo Energético , Regulación Bacteriana de la Expresión Génica , Genoma Bacteriano , Genómica , Pseudomonas/genética , Pseudomonas/metabolismo , Biología Computacional/métodos , Genómica/métodos , Humanos , Anotación de Secuencia Molecular , Filogenia , Pseudomonas/clasificación , Flujo de Trabajo
10.
J Biomed Semantics ; 6: 39, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26500754

RESUMEN

BACKGROUND: Semantic web technologies have a tremendous potential for the integration of heterogeneous data sets. Therefore, an increasing number of widely used biological resources are becoming available in the RDF data model. There are however, no tools available that provide structural overviews of these resources. Such structural overviews are essential to efficiently query these resources and to assess their structural integrity and design, thereby strengthening their use and potential. RESULTS: Here we present RDF2Graph, a tool that automatically recovers the structure of an RDF resource. The generated overview allows to create complex queries on these resources and to structurally validate newly created resources. CONCLUSION: RDF2Graph facilitates the creation of complex queries thereby enabling access to knowledge stored across multiple RDF resources. RDF2Graph facilitates creation of high quality resources and resource descriptions, which in turn increases usability of the semantic web technologies.

11.
BMC Syst Biol ; 8: 111, 2014 Sep 26.
Artículo en Inglés | MEDLINE | ID: mdl-25279447

RESUMEN

BACKGROUND: Different methods have been developed to infer regulatory networks from heterogeneous omics datasets and to construct co-expression networks. Each algorithm produces different networks and efforts have been devoted to automatically integrate them into consensus sets. However each separate set has an intrinsic value that is diluted and partly lost when building a consensus network. Here we present a methodology to generate co-expression networks and, instead of a consensus network, we propose an integration framework where the different networks are kept and analysed with additional tools to efficiently combine the information extracted from each network. RESULTS: We developed a workflow to efficiently analyse information generated by different inference and prediction methods. Our methodology relies on providing the user the means to simultaneously visualise and analyse the coexisting networks generated by different algorithms, heterogeneous datasets, and a suite of analysis tools. As a show case, we have analysed the gene co-expression networks of Mycobacterium tuberculosis generated using over 600 expression experiments. Regarding DNA damage repair, we identified SigC as a key control element, 12 new targets for LexA, an updated LexA binding motif, and a potential mismatch repair system. We expanded the DevR regulon with 27 genes while identifying 9 targets wrongly assigned to this regulon. We discovered 10 new genes linked to zinc uptake and a new regulatory mechanism for ZuR. The use of co-expression networks to perform system level analysis allows the development of custom made methodologies. As show cases we implemented a pipeline to integrate ChIP-seq data and another method to uncover multiple regulatory layers. CONCLUSIONS: Our workflow is based on representing the multiple types of information as network representations and presenting these networks in a synchronous framework that allows their simultaneous visualization while keeping specific associations from the different networks. By simultaneously exploring these networks and metadata, we gained insights into regulatory mechanisms in M. tuberculosis that could not be obtained through the separate analysis of each data type.


Asunto(s)
Algoritmos , Regulación Bacteriana de la Expresión Génica/fisiología , Redes Reguladoras de Genes/fisiología , Modelos Biológicos , Mycobacterium tuberculosis/metabolismo , Inmunoprecipitación de Cromatina , Regulación Bacteriana de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Mycobacterium tuberculosis/genética , Análisis de Secuencia de ARN
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA