Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 60
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nat Methods ; 20(3): 400-402, 2023 03.
Artículo en Inglés | MEDLINE | ID: mdl-36759590

RESUMEN

The design of biocatalytic reaction systems is highly complex owing to the dependency of the estimated kinetic parameters on the enzyme, the reaction conditions, and the modeling method. Consequently, reproducibility of enzymatic experiments and reusability of enzymatic data are challenging. We developed the XML-based markup language EnzymeML to enable storage and exchange of enzymatic data such as reaction conditions, the time course of the substrate and the product, kinetic parameters and the kinetic model, thus making enzymatic data findable, accessible, interoperable and reusable (FAIR). The feasibility and usefulness of the EnzymeML toolbox is demonstrated in six scenarios, for which data and metadata of different enzymatic reactions are collected and analyzed. EnzymeML serves as a seamless communication channel between experimental platforms, electronic lab notebooks, tools for modeling of enzyme kinetics, publication platforms and enzymatic reaction databases. EnzymeML is open and transparent, and invites the community to contribute. All documents and codes are freely available at https://enzymeml.org .


Asunto(s)
Manejo de Datos , Metadatos , Reproducibilidad de los Resultados , Bases de Datos Factuales , Cinética
2.
Mol Syst Biol ; 16(8): e9110, 2020 08.
Artículo en Inglés | MEDLINE | ID: mdl-32845085

RESUMEN

Systems biology has experienced dramatic growth in the number, size, and complexity of computational models. To reproduce simulation results and reuse models, researchers must exchange unambiguous model descriptions. We review the latest edition of the Systems Biology Markup Language (SBML), a format designed for this purpose. A community of modelers and software authors developed SBML Level 3 over the past decade. Its modular form consists of a core suited to representing reaction-based models and packages that extend the core with features suited to other model types including constraint-based models, reaction-diffusion models, logical network models, and rule-based models. The format leverages two decades of SBML and a rich software ecosystem that transformed how systems biologists build and interact with models. More recently, the rise of multiscale models of whole cells and organs, and new data sources such as single-cell measurements and live imaging, has precipitated new ways of integrating data with models. We provide our perspectives on the challenges presented by these developments and how SBML Level 3 provides the foundation needed to support this evolution.


Asunto(s)
Biología de Sistemas/métodos , Animales , Humanos , Modelos Logísticos , Modelos Biológicos , Programas Informáticos
3.
Biochem J ; 477(23): 4559-4580, 2020 12 11.
Artículo en Inglés | MEDLINE | ID: mdl-33290527

RESUMEN

The number of 'small' molecules that may be of interest to chemical biologists - chemical space - is enormous, but the fraction that have ever been made is tiny. Most strategies are discriminative, i.e. have involved 'forward' problems (have molecule, establish properties). However, we normally wish to solve the much harder generative or inverse problem (describe desired properties, find molecule). 'Deep' (machine) learning based on large-scale neural networks underpins technologies such as computer vision, natural language processing, driverless cars, and world-leading performance in games such as Go; it can also be applied to the solution of inverse problems in chemical biology. In particular, recent developments in deep learning admit the in silico generation of candidate molecular structures and the prediction of their properties, thereby allowing one to navigate (bio)chemical space intelligently. These methods are revolutionary but require an understanding of both (bio)chemistry and computer science to be exploited to best advantage. We give a high-level (non-mathematical) background to the deep learning revolution, and set out the crucial issue for chemical biology and informatics as a two-way mapping from the discrete nature of individual molecules to the continuous but high-dimensional latent representation that may best reflect chemical space. A variety of architectures can do this; we focus on a particular type known as variational autoencoders. We then provide some examples of recent successes of these kinds of approach, and a look towards the future.


Asunto(s)
Quimioinformática , Simulación por Computador , Aprendizaje Profundo
4.
Metab Eng ; 60: 168-182, 2020 07.
Artículo en Inglés | MEDLINE | ID: mdl-32335188

RESUMEN

Bio-based production of industrial chemicals using synthetic biology can provide alternative green routes from renewable resources, allowing for cleaner production processes. To efficiently produce chemicals on-demand through microbial strain engineering, biomanufacturing foundries have developed automated pipelines that are largely compound agnostic in their time to delivery. Here we benchmark the capabilities of a biomanufacturing pipeline to enable rapid prototyping of microbial cell factories for the production of chemically diverse industrially relevant material building blocks. Over 85 days the pipeline was able to produce 17 potential material monomers and key intermediates by combining 160 genetic parts into 115 unique biosynthetic pathways. To explore the scale-up potential of our prototype production strains, we optimized the enantioselective production of mandelic acid and hydroxymandelic acid, achieving gram-scale production in fed-batch fermenters. The high success rate in the rapid design and prototyping of microbially-produced material building blocks reveals the potential role of biofoundries in leading the transition to sustainable materials production.


Asunto(s)
Bacterias/metabolismo , Microbiología Industrial/métodos , Ingeniería Metabólica/métodos , Benchmarking , Vías Biosintéticas , Industria Química , Simulación por Computador , Fermentación , Ácidos Mandélicos/metabolismo , Estereoisomerismo
5.
PLoS Biol ; 15(6): e2001414, 2017 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-28662064

RESUMEN

In many disciplines, data are highly decentralized across thousands of online databases (repositories, registries, and knowledgebases). Wringing value from such databases depends on the discipline of data science and on the humble bricks and mortar that make integration possible; identifiers are a core component of this integration infrastructure. Drawing on our experience and on work by other groups, we outline 10 lessons we have learned about the identifier qualities and best practices that facilitate large-scale data integration. Specifically, we propose actions that identifier practitioners (database providers) should take in the design, provision and reuse of identifiers. We also outline the important considerations for those referencing identifiers in various circumstances, including by authors and data generators. While the importance and relevance of each lesson will vary by context, there is a need for increased awareness about how to avoid and manage common identifier problems, especially those related to persistence and web-accessibility/resolvability. We focus strongly on web-based identifiers in the life sciences; however, the principles are broadly relevant to other disciplines.


Asunto(s)
Disciplinas de las Ciencias Biológicas/métodos , Biología Computacional/métodos , Minería de Datos/métodos , Diseño de Software , Programas Informáticos , Disciplinas de las Ciencias Biológicas/estadística & datos numéricos , Disciplinas de las Ciencias Biológicas/tendencias , Biología Computacional/tendencias , Minería de Datos/estadística & datos numéricos , Minería de Datos/tendencias , Bases de Datos Factuales/estadística & datos numéricos , Bases de Datos Factuales/tendencias , Predicción , Humanos , Internet
6.
Molecules ; 25(15)2020 Jul 29.
Artículo en Inglés | MEDLINE | ID: mdl-32751155

RESUMEN

Molecular similarity is an elusive but core "unsupervised" cheminformatics concept, yet different "fingerprint" encodings of molecular structures return very different similarity values, even when using the same similarity metric. Each encoding may be of value when applied to other problems with objective or target functions, implying that a priori none are "better" than the others, nor than encoding-free metrics such as maximum common substructure (MCSS). We here introduce a novel approach to molecular similarity, in the form of a variational autoencoder (VAE). This learns the joint distribution p(z|x) where z is a latent vector and x are the (same) input/output data. It takes the form of a "bowtie"-shaped artificial neural network. In the middle is a "bottleneck layer" or latent vector in which inputs are transformed into, and represented as, a vector of numbers (encoding), with a reverse process (decoding) seeking to return the SMILES string that was the input. We train a VAE on over six million druglike molecules and natural products (including over one million in the final holdout set). The VAE vector distances provide a rapid and novel metric for molecular similarity that is both easily and rapidly calculated. We describe the method and its application to a typical similarity problem in cheminformatics.


Asunto(s)
Quimioinformática/métodos , Modelos Moleculares , Estructura Molecular , Algoritmos , Descubrimiento de Drogas
7.
Bioinformatics ; 34(13): 2327-2329, 2018 07 01.
Artículo en Inglés | MEDLINE | ID: mdl-29949952

RESUMEN

Motivation: Synthetic biology is typified by developing novel genetic constructs from the assembly of reusable synthetic DNA parts, which contain one or more features such as promoters, ribosome binding sites, coding sequences and terminators. PartsGenie is introduced to facilitate the computational design of such synthetic biology parts, bridging the gap between optimization tools for the design of novel parts, the representation of such parts in community-developed data standards such as Synthetic Biology Open Language, and their sharing in journal-recommended data repositories. Consisting of a drag-and-drop web interface, a number of DNA optimization algorithms, and an interface to the well-used data repository JBEI ICE, PartsGenie facilitates the design, optimization and dissemination of reusable synthetic biology parts through an integrated application. Availability and implementation: PartsGenie is freely available at https://parts.synbiochem.co.uk.


Asunto(s)
ADN/análisis , Programas Informáticos , Biología Sintética , Algoritmos , ADN/química
8.
Bioinformatics ; 34(12): 2153-2154, 2018 06 15.
Artículo en Inglés | MEDLINE | ID: mdl-29425325

RESUMEN

Summary: Synthetic biology applies the principles of engineering to biology in order to create biological functionalities not seen before in nature. One of the most exciting applications of synthetic biology is the design of new organisms with the ability to produce valuable chemicals including pharmaceuticals and biomaterials in a greener; sustainable fashion. Selecting the right enzymes to catalyze each reaction step in order to produce a desired target compound is, however, not trivial. Here, we present Selenzyme, a free online enzyme selection tool for metabolic pathway design. The user is guided through several decision steps in order to shortlist the best candidates for a given pathway step. The tool graphically presents key information about enzymes based on existing databases and tools such as: similarity of sequences and of catalyzed reactions; phylogenetic distance between source organism and intended host species; multiple alignment highlighting conserved regions, predicted catalytic site, and active regions and relevant properties such as predicted solubility and transmembrane regions. Selenzyme provides bespoke sequence selection for automated workflows in biofoundries. Availability and implementation: The tool is integrated as part of the pathway design stage into the design-build-test-learn SYNBIOCHEM pipeline. The Selenzyme web server is available at http://selenzyme.synbiochem.co.uk. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Redes y Vías Metabólicas , Programas Informáticos , Biología Sintética/métodos , Bases de Datos Factuales , Enzimas/genética , Internet , Filogenia
10.
PLoS Biol ; 13(12): e1002310, 2015 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-26633141

RESUMEN

Synthetic Biology Open Language (SBOL) Visual is a graphical standard for genetic engineering. It consists of symbols representing DNA subsequences, including regulatory elements and DNA assembly features. These symbols can be used to draw illustrations for communication and instruction, and as image assets for computer-aided design. SBOL Visual is a community standard, freely available for personal, academic, and commercial use (Creative Commons CC0 license). We provide prototypical symbol images that have been used in scientific publications and software tools. We encourage users to use and modify them freely, and to join the SBOL Visual community: http://www.sbolstandard.org/visual.


Asunto(s)
Cromatina/química , ADN/química , Ingeniería Genética/métodos , Modelos Genéticos , Simbolismo , Animales , Cromatina/metabolismo , Ensamble y Desensamble de Cromatina , Diseño Asistido por Computadora , Conducta Cooperativa , ADN/metabolismo , Bases de Datos de Ácidos Nucleicos , Ingeniería Genética/normas , Ingeniería Genética/tendencias , Humanos , Internet , Motivos de Nucleótidos , Publicaciones , Secuencias Reguladoras de Ácidos Nucleicos , Programas Informáticos
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA