Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 24
Filter
1.
Biotechnol Bioeng ; 117(7): 2295-2299, 2020 07.
Article in English | MEDLINE | ID: mdl-32181887

ABSTRACT

In cell line development the identification of stable Chinese hamster ovary cells for production is a critical but onerous task. The stability trial focus upon high-level attributes can mask profound underlying cellular changes, leading to unstable clones mistakenly being chosen for production. The challenge is to assay underlying cell pathways and subsystems without pushing up cell line development costs. ChemStress® cell function profiling is a simple, multiwell plate-based assay that uses a panel of active chemicals to mimic known bioprocess stresses and challenge key pathways. After 3 days of static culture on the plate, functional responses are assayed, for example, titer and growth. Here this approach is used to monitor 131 clones as they change over real stability trials. A novel stability metric is defined over the data to identify stable clones that remain unperturbed across many components of cell function. This allows stability trials to look beneath the titer to identify clones that are internally more stable.


Subject(s)
Clone Cells/cytology , Animals , Biotechnology , CHO Cells , Cell Culture Techniques , Clone Cells/metabolism , Cricetulus , Phenotype
2.
BMC Biol ; 9: 70, 2011 Oct 24.
Article in English | MEDLINE | ID: mdl-22023736

ABSTRACT

BACKGROUND: The uptake of drugs into cells has traditionally been considered to be predominantly via passive diffusion through the bilayer portion of the cell membrane. The recent recognition that drug uptake is mostly carrier-mediated raises the question of which drugs use which carriers. RESULTS: To answer this, we have constructed a chemical genomics platform built upon the yeast gene deletion collection, using competition experiments in batch fermenters and robotic automation of cytotoxicity screens, including protection by 'natural' substrates. Using these, we tested 26 different drugs and identified the carriers required for 18 of the drugs to gain entry into yeast cells. CONCLUSIONS: As well as providing a useful platform technology, these results further substantiate the notion that the cellular uptake of pharmaceutical drugs normally occurs via carrier-mediated transport and indicates that establishing the identity and tissue distribution of such carriers should be a major consideration in the design of safe and effective drugs.


Subject(s)
Cell Membrane/metabolism , Genomics/methods , Pharmaceutical Preparations/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Biological Transport , Canavanine/metabolism , Cell Membrane Permeability , Drug Evaluation, Preclinical , Gene Deletion , Genome-Wide Association Study , Humans , Membrane Transport Proteins/metabolism , Polymerase Chain Reaction
3.
Bioinformatics ; 26(19): 2486-7, 2010 Oct 01.
Article in English | MEDLINE | ID: mdl-20709690

ABSTRACT

UNLABELLED: Text mining from the biomedical literature is of increasing importance, yet it is not easy for the bioinformatics community to create and run text mining workflows due to the lack of accessibility and interoperability of the text mining resources. The U-Compare system provides a wide range of bio text mining resources in a highly interoperable workflow environment where workflows can very easily be created, executed, evaluated and visualized without coding. We have linked U-Compare to Taverna, a generic workflow system, to expose text mining functionality to the bioinformatics community. AVAILABILITY: http://u-compare.org/taverna.html, http://u-compare.org.


Subject(s)
Data Mining/methods , Computational Biology , Databases, Factual , User-Computer Interface , Workflow
4.
Bioengineering (Basel) ; 6(1)2019 Mar 21.
Article in English | MEDLINE | ID: mdl-30901908

ABSTRACT

Escherichia coli strains have been modified in a variety of ways to enhance the production of different recombinant proteins, targeting membrane protein expression, proteins with disulphide bonds, and more recently, proteins which require N-linked glycosylation. The addition of glycans to proteins remains a relatively inefficient process and here we aimed to combine genetic modifications within central carbon metabolic pathways in order to increase glycan precursor pools, prior to transfer onto polypeptide backbones. Using a lectin screen that detects cell surface representation of glycans, together with Western blot analyses using an O-antigen ligase mutant strain, the enhanced uptake and phosphorylation of sugars (ptsA) from the media combined with conservation of carbon through the glyoxylate shunt (icl) improved glycosylation efficiency of a bacterial protein AcrA by 69% and over 100% in an engineered human protein IFN-α2b. Unexpectedly, overexpression of a gene involved in the production of DXP from pyruvate (dxs), which was previously seen to have a positive impact on glycosylation, was detrimental to process efficiency and the possible reasons for this are discussed.

5.
Methods Mol Biol ; 2049: 285-314, 2019.
Article in English | MEDLINE | ID: mdl-31602618

ABSTRACT

Computational systems biology involves integrating heterogeneous datasets in order to generate models. These models can assist with understanding and prediction of biological phenomena. Generating datasets and integrating them into models involves a wide range of scientific expertise. As a result these datasets are often collected by one set of researchers, and exchanged with others researchers for constructing the models. For this process to run smoothly the data and models must be FAIR-findable, accessible, interoperable, and reusable. In order for data and models to be FAIR they must be structured in consistent and predictable ways, and described sufficiently for other researchers to understand them. Furthermore, these data and models must be shared with other researchers, with appropriately controlled sharing permissions, before and after publication. In this chapter we explore the different data and model standards that assist with structuring, describing, and sharing. We also highlight the popular standards and sharing databases within computational systems biology.


Subject(s)
Data Management/methods , Systems Biology/methods , Computational Biology , Databases, Factual
6.
PLoS One ; 12(7): e0179130, 2017.
Article in English | MEDLINE | ID: mdl-28708831

ABSTRACT

Biologists and biochemists have at their disposal a number of excellent, publicly available data resources such as UniProt, KEGG, and NCBI Taxonomy, which catalogue biological entities. Despite the usefulness of these resources, they remain fundamentally unconnected. While links may appear between entries across these databases, users are typically only able to follow such links by manual browsing or through specialised workflows. Although many of the resources provide web-service interfaces for computational access, performing federated queries across databases remains a non-trivial but essential activity in interdisciplinary systems and synthetic biology programmes. What is needed are integrated repositories to catalogue both biological entities and-crucially-the relationships between them. Such a resource should be extensible, such that newly discovered relationships-for example, those between novel, synthetic enzymes and non-natural products-can be added over time. With the introduction of graph databases, the barrier to the rapid generation, extension and querying of such a resource has been lowered considerably. With a particular focus on metabolic engineering as an illustrative application domain, biochem4j, freely available at http://biochem4j.org, is introduced to provide an integrated, queryable database that warehouses chemical, reaction, enzyme and taxonomic data from a range of reliable resources. The biochem4j framework establishes a starting point for the flexible integration and exploitation of an ever-wider range of biological data sources, from public databases to laboratory-specific experimental datasets, for the benefit of systems biologists, biosystems engineers and the wider community of molecular biologists and biological chemists.


Subject(s)
Databases, Factual , User-Computer Interface , Computational Biology , Internet
7.
J Mol Biol ; 345(1): 187-99, 2005 Jan 07.
Article in English | MEDLINE | ID: mdl-15567421

ABSTRACT

Methods for predicting protein function from structure are becoming more important as the rate at which structures are solved increases more rapidly than experimental knowledge. As a result, protein structures now frequently lack functional annotations. The majority of methods for predicting protein function are reliant upon identifying a similar protein and transferring its annotations to the query protein. This method fails when a similar protein cannot be identified, or when any similar proteins identified also lack reliable annotations. Here, we describe a method that can assign function from structure without the use of algorithms reliant upon alignments. Using simple attributes that can be calculated from any crystal structure, such as secondary structure content, amino acid propensities, surface properties and ligands, we describe each enzyme in a non-redundant set. The set is split according to Enzyme Classification (EC) number. We combine the predictions of one-class versus one-class support vector machine models to make overall assignments of EC number to an accuracy of 35% with the top-ranked prediction, rising to 60% accuracy with the top two ranks. In doing so we demonstrate the utility of simple structural attributes in protein function prediction and shed light on the link between structure and function. We apply our methods to predict the function of every currently unclassified protein in the Protein Data Bank.


Subject(s)
Algorithms , Enzymes , Protein Conformation , Amino Acids , Databases, Protein , Enzymes/chemistry , Enzymes/classification , Enzymes/metabolism , Sequence Alignment
8.
Metabolomics ; 12: 109, 2016.
Article in English | MEDLINE | ID: mdl-27358602

ABSTRACT

INTRODUCTION: The human genome-scale metabolic reconstruction details all known metabolic reactions occurring in humans, and thereby holds substantial promise for studying complex diseases and phenotypes. Capturing the whole human metabolic reconstruction is an on-going task and since the last community effort generated a consensus reconstruction, several updates have been developed. OBJECTIVES: We report a new consensus version, Recon 2.2, which integrates various alternative versions with significant additional updates. In addition to re-establishing a consensus reconstruction, further key objectives included providing more comprehensive annotation of metabolites and genes, ensuring full mass and charge balance in all reactions, and developing a model that correctly predicts ATP production on a range of carbon sources. METHODS: Recon 2.2 has been developed through a combination of manual curation and automated error checking. Specific and significant manual updates include a respecification of fatty acid metabolism, oxidative phosphorylation and a coupling of the electron transport chain to ATP synthase activity. All metabolites have definitive chemical formulae and charges specified, and these are used to ensure full mass and charge reaction balancing through an automated linear programming approach. Additionally, improved integration with transcriptomics and proteomics data has been facilitated with the updated curation of relationships between genes, proteins and reactions. RESULTS: Recon 2.2 now represents the most predictive model of human metabolism to date as demonstrated here. Extensive manual curation has increased the reconstruction size to 5324 metabolites, 7785 reactions and 1675 associated genes, which now are mapped to a single standard. The focus upon mass and charge balancing of all reactions, along with better representation of energy generation, has produced a flux model that correctly predicts ATP yield on different carbon sources. CONCLUSION: Through these updates we have achieved the most complete and best annotated consensus human metabolic reconstruction available, thereby increasing the ability of this resource to provide novel insights into normal and disease states in human. The model is freely available from the Biomodels database (http://identifiers.org/biomodels.db/MODEL1603150001).

9.
J Mol Biol ; 330(4): 771-83, 2003 Jul 18.
Article in English | MEDLINE | ID: mdl-12850146

ABSTRACT

The ability to predict protein function from structure is becoming increasingly important as the number of structures resolved is growing more rapidly than our capacity to study function. Current methods for predicting protein function are mostly reliant on identifying a similar protein of known function. For proteins that are highly dissimilar or are only similar to proteins also lacking functional annotations, these methods fail. Here, we show that protein function can be predicted as enzymatic or not without resorting to alignments. We describe 1178 high-resolution proteins in a structurally non-redundant subset of the Protein Data Bank using simple features such as secondary-structure content, amino acid propensities, surface properties and ligands. The subset is split into two functional groupings, enzymes and non-enzymes. We use the support vector machine-learning algorithm to develop models that are capable of assigning the protein class. Validation of the method shows that the function can be predicted to an accuracy of 77% using 52 features to describe each protein. An adaptive search of possible subsets of features produces a simplified model based on 36 features that predicts at an accuracy of 80%. We compare the method to sequence-based methods that also avoid calculating alignments and predict a recently released set of unrelated proteins. The most useful features for distinguishing enzymes from non-enzymes are secondary-structure content, amino acid frequencies, number of disulphide bonds and size of the largest cleft. This method is applicable to any structure as it does not require the identification of sequence or structural similarity to a protein of known function.


Subject(s)
Enzymes/chemistry , Algorithms , Aspartic Acid/chemistry , Computational Biology , Databases as Topic , Genome , Ligands , Phenylalanine/chemistry , Protein Conformation , Protein Structure, Secondary , Proteome , Software
10.
Curr Med Chem ; 11(16): 2135-42, 2004 Aug.
Article in English | MEDLINE | ID: mdl-15279553

ABSTRACT

Tremendous progress in DNA sequencing has yielded the genomes of a host of important organisms. The utilisation of these resources requires understanding of the function of each gene. Standard methods of functional assignment involve sequence alignment to a gene of known function; however such methods often fail to find any significant matches. Here we discuss a number of recent alternative methods that may be of use when sequence alignment fails. Function can be defined in a number of ways including E.C. number and MIPS and KEGG functional classes. Phylogenetic profiles show the pattern of presence or absence of a protein between genomes. Protein-protein interactions can be identified by searching for interacting pairs of proteins that are fused to a single protein chain in another organism. The gene neighbour method uses the observation that if the genes that encode two proteins are close on a chromosome, the proteins tend to be functionally related. More general methods use sequence properties such as amino acid composition, mean hydrophobicity, predicted secondary structure and post-translational modification sites. Data mining methods devise rules in the form of IF... THEN statements that make predictions of function using sequence based attributes, predicted secondary structure and sequence similarity. Finally, structural features can be used, after modelling the structure of a protein from its sequence or solving its structure. Protein fold class can be strongly indicative of function, while other structural features, such as secondary structure content, cleft size and 3D structural motifs are also useful.


Subject(s)
Proteins/chemistry , Sequence Analysis, Protein , Computer Simulation , Protein Conformation , Protein Folding , Structure-Activity Relationship
11.
Addiction ; 109(12): 1994-2002, 2014 Dec.
Article in English | MEDLINE | ID: mdl-24957220

ABSTRACT

BACKGROUND AND AIMS: Effective use of alcohol duty to reduce consumption and harm depends partly on retailers passing duty increases on to consumers via price increases, also known as 'pass-through'. The aim of this analysis is to provide evidence of UK excise duty and sales tax (VAT) pass-through rates for alcohol products at different price points. SETTING: March 2008 to August 2011, United Kingdom. DESIGN AND MEASUREMENTS: Panel data quantile regression estimating the effects of three duty changes, two VAT changes and one combined duty and VAT change on UK alcohol prices, using product-level supermarket price data for 254 alcohol products available weekly. Products were analysed in four categories: beers, ciders/ready to drink (RTDs), spirits and wines. FINDINGS: Within all four categories there exists considerable heterogeneity in the level of duty pass-through for cheaper versus expensive products. Price increases for the cheapest 15% of products fall below duty rises (undershifting), while products sold above the median price are overshifted (price increases are higher than duty increases). The level of undershifting is greatest for beer [0.85 (0.79, 0.92)] and spirits [0.86 (0.83, 0.89)]. Undershifting affects approximately 67% of total beer sales and 38% of total spirits sales. CONCLUSIONS: Alcohol retailers in the United Kingdom appear to respond to increases in alcohol tax by undershifting their cheaper products (raising prices below the level of the tax increase) and overshifting their more expensive products (raising prices beyond the level of the tax increase). This is likely to impact negatively on tax policy effectiveness, because high-risk groups favour cheaper alcohol and undershifting is likely to produce smaller consumption reductions.


Subject(s)
Alcohol Drinking/economics , Alcohol Drinking/prevention & control , Alcoholic Beverages/economics , Commerce/economics , Commerce/legislation & jurisprudence , Taxes/economics , Taxes/legislation & jurisprudence , Costs and Cost Analysis/economics , Costs and Cost Analysis/legislation & jurisprudence , Humans , United Kingdom
12.
Drug Discov Today ; 18(5-6): 218-39, 2013 Mar.
Article in English | MEDLINE | ID: mdl-23207804

ABSTRACT

A recent paper in this journal sought to counter evidence for the role of transport proteins in effecting drug uptake into cells, and questions that transporters can recognize drug molecules in addition to their endogenous substrates. However, there is abundant evidence that both drugs and proteins are highly promiscuous. Most proteins bind to many drugs and most drugs bind to multiple proteins (on average more than six), including transporters (mutations in these can determine resistance); most drugs are known to recognise at least one transporter. In this response, we alert readers to the relevant evidence that exists or is required. This needs to be acquired in cells that contain the relevant proteins, and we highlight an experimental system for simultaneous genome-wide assessment of carrier-mediated uptake in a eukaryotic cell (yeast).


Subject(s)
Membrane Transport Proteins/metabolism , Pharmaceutical Preparations/metabolism , Biological Transport , Humans , Yeasts/genetics , Yeasts/metabolism
13.
Nat Biotechnol ; 31(5): 419-25, 2013 May.
Article in English | MEDLINE | ID: mdl-23455439

ABSTRACT

Multiple models of human metabolism have been reconstructed, but each represents only a subset of our knowledge. Here we describe Recon 2, a community-driven, consensus 'metabolic reconstruction', which is the most comprehensive representation of human metabolism that is applicable to computational modeling. Compared with its predecessors, the reconstruction has improved topological and functional features, including ∼2× more reactions and ∼1.7× more unique metabolites. Using Recon 2 we predicted changes in metabolite biomarkers for 49 inborn errors of metabolism with 77% accuracy when compared to experimental data. Mapping metabolomic data and drug information onto Recon 2 demonstrates its potential for integrating and analyzing diverse data types. Using protein expression data, we automatically generated a compendium of 65 cell type-specific models, providing a basis for manual curation or investigation of cell-specific metabolic properties. Recon 2 will facilitate many future biomedical studies and is freely available at http://humanmetabolism.org/.


Subject(s)
Databases, Protein , Metabolome/physiology , Models, Biological , Proteome/metabolism , Computer Simulation , Humans
15.
Drug Discov Today ; 16(15-16): 704-14, 2011 Aug.
Article in English | MEDLINE | ID: mdl-21624498

ABSTRACT

All cells necessarily contain tens, if not hundreds, of carriers for nutrients and intermediary metabolites, and the human genome codes for more than 1000 carriers of various kinds. Here, we illustrate using a typical literature example the widespread but erroneous nature of the assumption that the 'background' or 'passive' permeability to drugs occurs in the absence of carriers. Comparison of the rate of drug transport in natural versus artificial membranes shows discrepancies in absolute magnitudes of 100-fold or more, with the carrier-containing cells showing the greater permeability. Expression profiling data show exactly which carriers are expressed in which tissues. The recognition that drugs necessarily require carriers for uptake into cells provides many opportunities for improving the effectiveness of the drug discovery process.


Subject(s)
Cell Membrane/metabolism , Membrane Transport Proteins/metabolism , Pharmaceutical Preparations/metabolism , Animals , Biological Transport , Cell Membrane Permeability , Drug Design , Gene Expression Profiling , Humans , Membranes, Artificial
16.
Metabolomics ; 7(1): 94-101, 2011 Mar.
Article in English | MEDLINE | ID: mdl-21687783

ABSTRACT

Text mining methods have added considerably to our capacity to extract biological knowledge from the literature. Recently the field of systems biology has begun to model and simulate metabolic networks, requiring knowledge of the set of molecules involved. While genomics and proteomics technologies are able to supply the macromolecular parts list, the metabolites are less easily assembled. Most metabolites are known and reported through the scientific literature, rather than through large-scale experimental surveys. Thus it is important to recover them from the literature. Here we present a novel tool to automatically identify metabolite names in the literature, and associate structures where possible, to define the reported yeast metabolome. With ten-fold cross validation on a manually annotated corpus, our recognition tool generates an f-score of 78.49 (precision of 83.02) and demonstrates greater suitability in identifying metabolite names than other existing recognition tools for general chemical molecules. The metabolite recognition tool has been applied to the literature covering an important model organism, the yeast Saccharomyces cerevisiae, to define its reported metabolome. By coupling to ChemSpider, a major chemical database, we have identified structures for much of the reported metabolome and, where structure identification fails, been able to suggest extensions to ChemSpider. Our manually annotated gold-standard data on 296 abstracts are available as supplementary materials. Metabolite names and, where appropriate, structures are also available as supplementary materials. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11306-010-0251-6) contains supplementary material, which is available to authorized users.

18.
BMC Syst Biol ; 4: 114, 2010 Aug 16.
Article in English | MEDLINE | ID: mdl-20712863

ABSTRACT

BACKGROUND: Genome-scale metabolic reconstructions have been recognised as a valuable tool for a variety of applications ranging from metabolic engineering to evolutionary studies. However, the reconstruction of such networks remains an arduous process requiring a high level of human intervention. This process is further complicated by occurrences of missing or conflicting information and the absence of common annotation standards between different data sources. RESULTS: In this article, we report a semi-automated methodology aimed at streamlining the process of metabolic network reconstruction by enabling the integration of different genome-wide databases of metabolic reactions. We present results obtained by applying this methodology to the metabolic network of the plant Arabidopsis thaliana. A systematic comparison of compounds and reactions between two genome-wide databases allowed us to obtain a high-quality core consensus reconstruction, which was validated for stoichiometric consistency. A lower level of consensus led to a larger reconstruction, which has a lower quality standard but provides a baseline for further manual curation. CONCLUSION: This semi-automated methodology may be applied to other organisms and help to streamline the process of genome-scale network reconstruction in order to accelerate the transfer of such models to applications.


Subject(s)
Databases, Factual , Genomics/methods , Metabolic Networks and Pathways , Arabidopsis/genetics , Arabidopsis/metabolism , Genome , Reproducibility of Results
19.
BMC Syst Biol ; 4: 145, 2010 Oct 28.
Article in English | MEDLINE | ID: mdl-21029416

ABSTRACT

BACKGROUND: To date, several genome-scale network reconstructions have been used to describe the metabolism of the yeast Saccharomyces cerevisiae, each differing in scope and content. The recent community-driven reconstruction, while rigorously evidenced and well annotated, under-represented metabolite transport, lipid metabolism and other pathways, and was not amenable to constraint-based analyses because of lack of pathway connectivity. RESULTS: We have expanded the yeast network reconstruction to incorporate many new reactions from the literature and represented these in a well-annotated and standards-compliant manner. The new reconstruction comprises 1102 unique metabolic reactions involving 924 unique metabolites--significantly larger in scope than any previous reconstruction. The representation of lipid metabolism in particular has improved, with 234 out of 268 enzymes linked to lipid metabolism now present in at least one reaction. Connectivity is emphatically improved, with more than 90% of metabolites now reachable from the growth medium constituents. The present updates allow constraint-based analyses to be performed; viability predictions of single knockouts are comparable to results from in vivo experiments and to those of previous reconstructions. CONCLUSIONS: We report the development of the most complete reconstruction of yeast metabolism to date that is based upon reliable literature evidence and richly annotated according to MIRIAM standards. The reconstruction is available in the Systems Biology Markup Language (SBML) and via a publicly accessible database http://www.comp-sys-bio.org/yeastnet/.


Subject(s)
Genome, Fungal , Metabolomics/methods , Models, Biological , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Lipid Metabolism , Molecular Sequence Annotation , Software
20.
Drug Discov Today ; 14(1-2): 31-40, 2009 Jan.
Article in English | MEDLINE | ID: mdl-19049901

ABSTRACT

Present drug screening libraries are constrained by biophysical properties that predict desirable pharmacokinetics and structural descriptors of 'drug-likeness' or 'lead-likeness'. Recent surveys, however, indicate that to enter cells most drugs require solute carriers that normally transport the naturally occurring intermediary metabolites and many drugs are likely to interact similarly. The existence of increasingly comprehensive summaries of the human metabolome allows the assessment of the concept of 'metabolite-likeness'. We compare the similarity of known drugs and library compounds to naturally occurring metabolites (endogenites) using relevant cheminformatics molecular descriptor spaces in which known drugs are more akin to such endogenites than are most library compounds.


Subject(s)
Drug Design , Pharmaceutical Preparations/chemistry , Small Molecule Libraries , Biological Transport , Databases, Factual , Humans , Metabolome , Pharmaceutical Preparations/metabolism , Structure-Activity Relationship
SELECTION OF CITATIONS
SEARCH DETAIL