Pesquisa | Biblioteca Virtual em Saúde

Toward the Reconciliation of Inconsistent Molecular Structures from Biochemical Databases.

Eriksen, Casper Asbjørn; Andersen, Jakob Lykke; Fagerberg, Rolf; Merkle, Daniel.

J Comput Biol ; 31(6): 498-512, 2024 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-38758924

RESUMO

Information on the structure of molecules, retrieved via biochemical databases, plays a pivotal role in various disciplines, including metabolomics, systems biology, and drug discovery. No such database can be complete and it is often necessary to incorporate data from several sources. However, the molecular structure for a given compound is not necessarily consistent between databases. This article presents StructRecon, a novel tool for resolving unique molecular structures from database identifiers. Currently, identifiers from BiGG, ChEBI, Escherichia coli Metabolome Database (ECMDB), MetaNetX, and PubChem are supported. StructRecon traverses the cross-links between entries in different databases to construct what we call identifier graphs. The goal of these graphs is to offer a more complete view of the total information available on a given compound across all the supported databases. To reconcile discrepancies met during the traversal of the databases, we develop an extensible model for molecular structure supporting multiple independent levels of detail, which allows standardization of the structure to be applied iteratively. In some cases, our standardization approach results in multiple candidate structures for a given compound, in which case a random walk-based algorithm is used to select the most likely structure among incompatible alternatives. As a case study, we applied StructRecon to the EColiCore2 model. We found at least one structure for 98.66% of its compounds, which is more than twice as many as possible when using the databases in more standard ways not considering the complex network of cross-database references captured by our identifier graphs. StructRecon is open-source and modular, which enables support for more databases in the future.

Assuntos

Escherichia coli , Escherichia coli/genética , Bases de Dados Factuais , Software , Estrutura Molecular , Algoritmos , Metabolômica/métodos , Bases de Dados de Compostos Químicos , Biologia Computacional/métodos , Metaboloma

An open source computational workflow for the discovery of autocatalytic networks in abiotic reactions.

Arya, Aayush; Ray, Jessica; Sharma, Siddhant; Cruz Simbron, Romulo; Lozano, Alejandro; Smith, Harrison B; Andersen, Jakob Lykke; Chen, Huan; Meringer, Markus; Cleaves, Henderson James.

Chem Sci ; 13(17): 4838-4853, 2022 May 04.

Artigo em Inglês | MEDLINE | ID: mdl-35655880

RESUMO

A central question in origins of life research is how non-entailed chemical processes, which simply dissipate chemical energy because they can do so due to immediate reaction kinetics and thermodynamics, enabled the origin of highly-entailed ones, in which concatenated kinetically and thermodynamically favorable processes enhanced some processes over others. Some degree of molecular complexity likely had to be supplied by environmental processes to produce entailed self-replicating processes. The origin of entailment, therefore, must connect to fundamental chemistry that builds molecular complexity. We present here an open-source chemoinformatic workflow to model abiological chemistry to discover such entailment. This pipeline automates generation of chemical reaction networks and their analysis to discover novel compounds and autocatalytic processes. We demonstrate this pipeline's capabilities against a well-studied model system by vetting it against experimental data. This workflow can enable rapid identification of products of complex chemistries and their underlying synthetic relationships to help identify autocatalysis, and potentially self-organization, in such systems. The algorithms used in this study are open-source and reconfigurable by other user-developed workflows.

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA