Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters











Publication year range
1.
ACS Synth Biol ; 13(9): 3051-3055, 2024 Sep 20.
Article in English | MEDLINE | ID: mdl-39230953

ABSTRACT

The progress and utility of synthetic biology is currently hindered by the lengthy process of studying literature and replicating poorly documented work. Reconstruction of crucial design information through post hoc curation is highly noisy and error-prone. To combat this, author participation during the curation process is crucial. To encourage author participation without overburdening them, an ML-assisted curation tool called SeqImprove has been developed. Using named entity recognition, called entity normalization, and sequence matching, SeqImprove creates machine-accessible sequence data and metadata annotations, which authors can then review and edit before submitting a final sequence file. SeqImprove makes it easier for authors to submit sequence data that is FAIR (findable, accessible, interoperable, and reusable).


Subject(s)
Machine Learning , Synthetic Biology , Synthetic Biology/methods , Software , Gene Regulatory Networks/genetics , Data Curation/methods
2.
Brief Funct Genomics ; 2024 Aug 24.
Article in English | MEDLINE | ID: mdl-39183066

ABSTRACT

Transcriptomics is the study of RNA transcripts, the portion of the genome that is transcribed, in a specific cell, tissue, or organism. Transcriptomics provides insight into gene expression patterns, regulation, and the underlying mechanisms of cellular processes. Community transcriptomics takes this a step further by studying the RNA transcripts from environmental assemblies of organisms, with the intention of better understanding the interactions between members of the community. Community transcriptomics requires successful extraction of RNA from a diverse set of organisms and subsequent analysis via mapping those reads to a reference genome or de novo assembly of the reads. Both, extraction protocols and the analysis steps can pose hurdles for community transcriptomics. This review covers advances in transcriptomic techniques and assesses the viability of applying them to community transcriptomics.

3.
Nat Commun ; 14(1): 2953, 2023 May 23.
Article in English | MEDLINE | ID: mdl-37221178
4.
ACS Synth Biol ; 12(4): 1364-1370, 2023 04 21.
Article in English | MEDLINE | ID: mdl-36995948

ABSTRACT

Accelerating the development of synthetic biology applications requires reproducible experimental findings. Different standards and repositories exist to exchange experimental data and metadata. However, the associated software tools often do not support a uniform data capture, encoding, and exchange of information. A connection between digital repositories is required to prevent siloing and loss of information. To this end, we developed the Experimental Data Connector (XDC). It captures experimental data and related metadata by encoding it in standard formats and storing the converted data in digital repositories. Experimental data is then uploaded to Flapjack and the metadata to SynBioHub in a consistent manner linking these repositories. This produces complete connected experimental data sets that are exchangeable. The information is captured using a single template Excel Workbook, which can be integrated into existing experimental workflow automation processes and semiautomated capture of results.


Subject(s)
Metadata , Software , Synthetic Biology/methods , Workflow , Automation
5.
ACS Synth Biol ; 12(1): 340-346, 2023 01 20.
Article in English | MEDLINE | ID: mdl-36595709

ABSTRACT

Standards support synthetic biology research by enabling the exchange of component information. However, using formal representations, such as the Synthetic Biology Open Language (SBOL), typically requires either a thorough understanding of these standards or a suite of tools developed in concurrence with the ontologies. Since these tools may be a barrier for use by many practitioners, the Excel-SBOL Converter was developed to facilitate the use of SBOL and integration into existing workflows. The converter consists of two Python libraries: one that converts Excel templates to SBOL and another that converts SBOL to an Excel workbook. Both libraries can be used either directly or via a SynBioHub plugin.


Subject(s)
Programming Languages , Synthetic Biology , Language , Reference Standards , Workflow , Software
6.
ACS Synth Biol ; 11(2): 990-995, 2022 02 18.
Article in English | MEDLINE | ID: mdl-35060706

ABSTRACT

The ability to search for a part by its sequence is crucial for a large repository of parts. Prior to this work, however, this was not possible on SynBioHub. Sequence-based search is now integrated into SynBioHub, allowing users to find a part by a sequence provided in plain text or a supported file format. This sequence-based search feature is accessible to users via SynBioHub's web interface, or programmatically through its API. The core implementation of the tool uses VSEARCH, an open source, global alignment search tool, and it is integrated into SBOLExplorer, an open source distributed search engine used by SynBioHub. We present a new approach to scoring part similarity using SBOLExplorer, which takes into account both the popularity and percentage match of parts.


Subject(s)
Software , Synthetic Biology , Internet
7.
ACS Synth Biol ; 10(11): 3200-3204, 2021 11 19.
Article in English | MEDLINE | ID: mdl-34757736

ABSTRACT

Much progress has been made in developing tools to generate component-based design representations of biological systems from standard libraries of parts. Most biological designs, however, are still specified at the sequence level. Consequently, there exists a need for a tool that can be used to automatically infer component-based design representations from sequences, particularly in cases when those sequences have minimal levels of annotation. Such a tool would assist computational synthetic biologists in bridging the gap between the outputs of sequence editors and the inputs to more sophisticated design tools, and it would facilitate their development of automated workflows for design curation and quality control. Accordingly, we introduce Synthetic Biology Curation Tools (SYNBICT), a Python tool suite for automation-assisted annotation, curation, and functional inference for genetic designs. We have validated SYNBICT by applying it to genetic designs in the DARPA Synergistic Discovery & Design (SD2) program and the International Genetically Engineered Machines (iGEM) 2018 distribution. Most notably, SYNBICT is more automated and parallelizable than manual design editors, and it can be applied to interpret existing designs instead of only generating new ones.


Subject(s)
Synthetic Biology/methods , Automation/methods , Computational Biology/methods , Models, Biological , Quality Control , Software , Workflow
8.
ACS Synth Biol ; 10(10): 2532-2540, 2021 10 15.
Article in English | MEDLINE | ID: mdl-34606710

ABSTRACT

In synthetic biology, combinational circuits are used to program cells for various new applications like biosensors, drug delivery systems, and biofuels. Similar to asynchronous electronic circuits, some combinational genetic circuits may show unwanted switching variations (glitches) caused by multiple input changes. Depending on the biological circuit, glitches can cause irreversible effects and jeopardize the circuit's functionality. This paper presents a stochastic analysis to predict glitch propensities for three implementations of a genetic circuit with known glitching behavior. The analysis uses STochastic Approximate Model-checker for INfinite-state Analysis (STAMINA), a tool for stochastic verification. The STAMINA results were validated by comparison to stochastic simulation in iBioSim resulting in further improvements of STAMINA. This paper demonstrates that stochastic verification can be utilized by genetic designers to evaluate design choices and input restrictions to achieve a desired reliability of operation.


Subject(s)
Gene Regulatory Networks , Synthetic Biology/methods , Biosensing Techniques , Models, Theoretical , Probability , Stochastic Processes
9.
ACS Synth Biol ; 10(10): 2592-2606, 2021 10 15.
Article in English | MEDLINE | ID: mdl-34546707

ABSTRACT

As an engineering endeavor, synthetic biology requires effective sharing of genetic design information that can be reused in the construction of new designs. While there are a number of large community repositories of design information, curation of this information has been limited. This in turn limits the ways in which design information can be put to use. The aim of this work was to improve this situation by creating a curated library of parts from the International Genetically Engineered Machines (iGEM) registry data set. To this end, an analysis of the Synthetic Biology Open Language (SBOL) version of the iGEM registry was carried out using four different approaches-simple statistics, SnapGene autoannotation, SYNBICT autoannotation, and expert analysis-the results of which are presented herein. Key challenges encountered include the use of free text, insufficient part provenance, part duplication, lack of part removal, and insufficient continuous curation. On the basis of these analyses, the focus has shifted from the creation of a curated iGEM part library to instead the extraction of a set of lessons, which are presented here. These lessons can be exploited to facilitate the creation and curation of other part libraries using a simpler and less labor intensive process.


Subject(s)
Datasets as Topic , Synthetic Biology/methods , Automation , Programming Languages
10.
ACS Synth Biol ; 10(9): 2276-2285, 2021 09 17.
Article in English | MEDLINE | ID: mdl-34387462

ABSTRACT

The Synthetic Biology Knowledge System (SBKS) is an instance of the SynBioHub repository that includes text and data information that has been mined from papers published in ACS Synthetic Biology. This paper describes the SBKS curation framework that is being developed to construct the knowledge stored in this repository. The text mining pipeline performs automatic annotation of the articles using natural language processing techniques to identify salient content such as key terms, relationships between terms, and main topics. The data mining pipeline performs automatic annotation of the sequences extracted from the supplemental documents with the genetic parts used in them. Together these two pipelines link genetic parts to papers describing the context in which they are used. Ultimately, SBKS will reduce the time necessary for synthetic biologists to find the information necessary to complete their designs.


Subject(s)
Synthetic Biology , User-Computer Interface , Animals , Cell Line , Data Mining , Humans
11.
ACS Synth Biol ; 10(8): 2111-2115, 2021 08 20.
Article in English | MEDLINE | ID: mdl-34324811

ABSTRACT

VisBOL is a web-based visualization tool used to depict genetic circuit designs. This tool depicts simple DNA circuits adequately, but it has become increasingly outdated as new versions of SBOL Visual were released. This paper introduces VisBOL2, a heavily redesigned version of VisBOL that makes a number of improvements to the original VisBOL, including proper functional interaction rendering, dynamic viewing, a more maintainable code base, and modularity that facilitates compatibility with other software tools. This modularity is demonstrated by incorporating VisBOL2 into a sequence visualization plugin for SynBioHub.


Subject(s)
Gene Regulatory Networks , Models, Genetic , Software , Synthetic Biology
12.
ACS Synth Biol ; 9(5): 1216-1220, 2020 05 15.
Article in English | MEDLINE | ID: mdl-32275821

ABSTRACT

SynBioHub is a repository for synthetic genetic designs represented in the Synthetic Biology Open Language (SBOL). To integrate SynBioHub into more synthetic biology workflows, its data processing capabilities need to be expanded. To this end, a plugin interface has been developed. Plugins can be developed for data submission, visualization, and download. This framework was tested by the development of three example plugins, one of each type as follows: one allowing the submission of SnapGene files, one visualizing the course of different genetic parts, and one preparing plasmid maps for download.


Subject(s)
Software , Synthetic Biology
13.
Bioprocess Biosyst Eng ; 42(4): 657-663, 2019 Apr.
Article in English | MEDLINE | ID: mdl-30617419

ABSTRACT

The biologics sector has amassed a wealth of data in the past three decades, in line with the bioprocess development and manufacturing guidelines, and analysis of these data with precision is expected to reveal behavioural patterns in cell populations that can be used for making predictions on how future culture processes might behave. The historical bioprocessing data likely comprise experiments conducted using different cell lines, to produce different products and may be years apart; the situation causing inter-batch variability and missing data points to human- and instrument-associated technical oversights. These unavoidable complications necessitate the introduction of a pre-processing step prior to data mining. This study investigated the efficiency of mean imputation and multivariate regression for filling in the missing information in historical bio-manufacturing datasets, and evaluated their performance by symbolic regression models and Bayesian non-parametric models in subsequent data processing. Mean substitution was shown to be a simple and efficient imputation method for relatively smooth, non-dynamical datasets, and regression imputation was effective whilst maintaining the existing standard deviation and shape of the distribution in dynamical datasets with less than 30% missing data. The nature of the missing information, whether Missing Completely At Random, Missing At Random or Missing Not At Random, emerged as the key feature for selecting the imputation method.


Subject(s)
Biological Products , Databases, Factual , Electronic Data Processing , Heuristics , Models, Theoretical
14.
Pediatr Res ; 80(5): 675-680, 2016 11.
Article in English | MEDLINE | ID: mdl-27603562

ABSTRACT

BACKGROUND: Near-infrared spectroscopy (NIRS) may assist with characterization of a hemodynamically significant patent ductus arteriosus (hsPDA) by measuring cerebral and renal saturation (Csat and Rsat) levels. We hypothesized that Csat and Rsat in preterm infants with an hsPDA would be decreased compared to those with no PDA or nonsignificant PDA. METHODS: This non a-priori designed study retrospectively investigated clinical and ECHO characteristics of preterm infants <29 wk gestation who underwent routine NIRS monitoring. Logistic regression assessed association between NIRS measures and an hsPDA by ECHO. RESULTS: Of 47 infants, 21 had a confirmed hsPDA by ECHO, 14 had a nonsignificant PDA, and 12 had no ECHO performed due to low clinical suspicion for PDA. Logistic regression adjusted for gestational age found that lower Rsat was associated with an hsPDA by ECHO (OR 0.9, 95% CI 0.83-0.98, P = 0.01). Using ROC curves, Rsat < 66% identified an hsPDA with a sensitivity of 81% and specificity of 77%, while Csat was not significant. CONCLUSION: Low Rsat < 66% was associated with the presence of an hsPDA in the preterm infant. Csat may be preserved if cerebral autoregulation is largely intact. Bedside NIRS monitoring may reasonably increase suspicion for a significant PDA in the preterm infant.


Subject(s)
Ductus Arteriosus, Patent/diagnostic imaging , Spectroscopy, Near-Infrared , Ductus Arteriosus, Patent/diagnosis , Female , Gestational Age , Hemodynamics/physiology , Humans , Infant, Newborn , Infant, Premature/physiology , Intensive Care Units, Neonatal , Intensive Care, Neonatal , Male , Monitoring, Physiologic , ROC Curve , Regression Analysis , Retrospective Studies
SELECTION OF CITATIONS
SEARCH DETAIL