Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 19 de 19
Filtrar
1.
Nat Methods ; 2024 Oct 15.
Artículo en Inglés | MEDLINE | ID: mdl-39407016

RESUMEN

Spatial transcriptomics produces high-dimensional gene expression measurements with spatial context. Obtaining a biologically meaningful low-dimensional representation of such data is crucial for effective interpretation and downstream analysis. Here, we present Spatial Transcriptomics Analysis with topic Modeling to uncover spatial Patterns (STAMP), an interpretable spatially aware dimension reduction method built on a deep generative model that returns biologically relevant, low-dimensional spatial topics and associated gene modules. STAMP can analyze data ranging from a single section to multiple sections and from different technologies to time-series data, returning topics matching known biological domains and associated gene modules containing established markers highly ranked within. In a lung cancer sample, STAMP delineated cell states with supporting markers at a higher resolution than the original annotation and uncovered cancer-associated fibroblasts concentrated on the tumor edge's exterior. In time-series data of mouse embryonic development, STAMP disentangled the erythro-myeloid hematopoiesis and hepatocytes developmental trajectories within the liver. STAMP is highly scalable and can handle more than 500,000 cells.

2.
Nat Methods ; 21(9): 1658-1667, 2024 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-38907114

RESUMEN

Advances in spatial omics technologies now allow multiple types of data to be acquired from the same tissue slice. To realize the full potential of such data, we need spatially informed methods for data integration. Here, we introduce SpatialGlue, a graph neural network model with a dual-attention mechanism that deciphers spatial domains by intra-omics integration of spatial location and omics measurement followed by cross-omics integration. We demonstrated SpatialGlue on data acquired from different tissue types using different technologies, including spatial epigenome-transcriptome and transcriptome-proteome modalities. Compared to other methods, SpatialGlue captured more anatomical details and more accurately resolved spatial domains such as the cortex layers of the brain. Our method also identified cell types like spleen macrophage subsets located at three different zones that were not available in the original data annotations. SpatialGlue scales well with data size and can be used to integrate three modalities. Our spatial multi-omics analysis tool combines the information from complementary omics modalities to obtain a holistic view of cellular and tissue properties.


Asunto(s)
Transcriptoma , Animales , Redes Neurales de la Computación , Ratones , Humanos , Encéfalo/metabolismo , Proteoma , Proteómica/métodos , Genómica/métodos , Epigenoma , Biología Computacional/métodos , Bazo/metabolismo , Bazo/citología , Multiómica
3.
Nucleic Acids Res ; 50(D1): D596-D602, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34791375

RESUMEN

The ability to study cellular heterogeneity at single cell resolution is making single-cell sequencing increasingly popular. However, there is no publicly available resource that offers an integrated cell atlas with harmonized metadata that users can integrate new data with. Here, we present DISCO (https://www.immunesinglecell.org/), a database of Deeply Integrated Single-Cell Omics data. The current release of DISCO integrates more than 18 million cells from 4593 samples, covering 107 tissues/cell lines/organoids, 158 diseases, and 20 platforms. We standardized the associated metadata with a controlled vocabulary and ontology system. To allow large scale integration of single-cell data, we developed FastIntegration, a fast and high-capacity version of Seurat Integration. We also developed CELLiD, an atlas guided automatic cell type identification tool. Employing these two tools on the assembled data, we constructed one global atlas and 27 sub-atlases for different tissues, diseases, and cell types. DISCO provides three online tools, namely Online FastIntegration, Online CELLiD, and CellMapper, for users to integrate, annotate, and project uploaded single-cell RNA-seq data onto a selected atlas. Collectively, DISCO is a versatile platform for users to explore published single-cell data and efficiently perform integrated analysis with their own data.


Asunto(s)
Linaje de la Célula/genética , Bases de Datos Genéticas , Enfermedades Genéticas Congénitas/genética , Especificidad de Órganos/genética , Programas Informáticos , Enfermedades Genéticas Congénitas/clasificación , Humanos , RNA-Seq , Análisis de la Célula Individual
4.
Biotechnol Bioeng ; 118(11): 4305-4316, 2021 11.
Artículo en Inglés | MEDLINE | ID: mdl-34289087

RESUMEN

A robust monoclonal antibody (mAb) bioprocess requires physiological parameters such as temperature, pH, or dissolved oxygen to be well-controlled as even small variations in them could potentially impact the final product quality. For instance, pH substantially affects N-glycosylation, protein aggregation, and charge variant profiles, as well as mAb productivity. However, relatively less is known about how pH jointly influences product quality and titer. In this study, we investigated the effect of pH on culture performance, product titer, and quality profiles by applying longitudinal multi-omics profiling, including transcriptomics, proteomics, metabolomics, and glycomics, at three different culture pH set points. The subsequent systematic analysis of multi-omics data showed that pH set points differentially regulated various intracellular pathways including intracellular vesicular trafficking, cell cycle, and apoptosis, thereby resulting in differences in specific productivity, product titer, and quality profiles. In addition, a time-dependent variation in mAb N-glycosylation profiles, independent of pH, was identified to be mainly due to the accumulation of mAb proteins in the endoplasmic reticulum disrupting cellular homeostasis over culture time. Overall, this multi-omics-based study provides an in-depth understanding of the intracellular processes in mAb-producing CHO cell line under varied pH conditions, and could serve as a baseline for enabling the quality optimization and control of mAb production.


Asunto(s)
Anticuerpos Monoclonales/biosíntesis , Técnicas de Cultivo de Célula , Ciclo Celular , Metabolómica , Oxígeno/metabolismo , Animales , Células CHO , Cricetulus , Glicosilación , Concentración de Iones de Hidrógeno
5.
Curr Genomics ; 19(8): 712-722, 2018 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-30532650

RESUMEN

In nature, microbes do not exist in isolation but co-exist in a variety of ecological and biological environments and on various host organisms. Due to their close proximity, these microbes interact among themselves, and also with the hosts in both positive and negative manners. Moreover, these interactions may modulate dynamically upon external stimulus as well as internal community changes. This demands systematic techniques such as mathematical modeling to understand the intrinsic community behavior. Here, we reviewed various approaches for metabolic modeling of microbial communities. If detailed species-specific information is available, segregated models of individual organisms can be constructed and connected via metabolite exchanges; otherwise, the community may be represented as a lumped network of metabolic reactions. The constructed models can then be simulated to help fill knowledge gaps, and generate testable hypotheses for designing new experiments. More importantly, such community models have been developed to study microbial interactions in various niches such as host microbiome, biogeochemical and bioremediation, waste water treatment and synthetic consortia. As such, the metabolic modeling efforts have allowed us to gain new insights into the natural and synthetic microbial communities, and design interventions to achieve specific goals. Finally, potential directions for future development in metabolic modeling of microbial communities were also discussed.

6.
Methods ; 102: 26-35, 2016 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-26850284

RESUMEN

In this study, we analyzed multi-omics data and subsets thereof to establish reference codon usage biases for codon optimization in synthetic gene design. Specifically, publicly available genomic, transcriptomic, proteomic and translatomic data for microbial and mammalian expression hosts, Escherichia coli, Saccharomyces cerevisiae, Pichia pastoris and Chinese hamster ovary (CHO) cells, were compiled to derive their individual codon and codon pair frequencies. Then, host dependent and -omics specific codon biases were generated and compared by principal component analysis and hierarchical clustering. Interestingly, our results indicated the similar codon bias patterns of the highly expressed transcripts, highly abundant proteins, and efficiently translated mRNA in microbial cells, despite the general lack of correlation between mRNA and protein expression levels. However, for CHO cells, the codon bias patterns among various -omics subsets are not distinguishable, forming one cluster. Thus, we further investigated the effect of different input codon biases on codon optimized sequences using the codon context (CC) and individual codon usage (ICU) design parameters, via in silico case study on the expression of human IFNγ sequence in CHO cells. The results supported that CC is more robust design parameter than ICU for improved heterologous gene design.


Asunto(s)
Codón , Genómica/métodos , Mamíferos/genética , Animales , Células CHO , Simulación por Computador , Cricetulus , Minería de Datos/métodos , Escherichia coli/genética , Pichia/genética , Análisis de Componente Principal , Saccharomyces cerevisiae/genética
7.
Biotechnol Lett ; 38(12): 2137-2143, 2016 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-27613154

RESUMEN

OBJECTIVES: To evaluate different codon optimization parameters on the Saccharomyces cerevisiae-derived mating factor α prepro-leader sequence (MFLS) to improve Candida antarctica lipase B (CAL-B) secretory production in Pichia pastoris. RESULTS: Codon optimization based on the individual codon usage (ICU) and codon context (CC) design parameters enhanced secretory production of CAL-B to 7 U/ml and 12 U/ml, respectively. Only 3 U/ml was obtained with the wild type sequence while the sequence optimized using both ICU and CC objectives showed intermediate performance of 10 U/ml. These results clearly show that CC is the most relevant parameter for the codon optimization of MFLS in P. pastoris, and there is no synergistic effect achieved by considering both ICU and CC together. CONCLUSION: The CC optimized MFLS increased secretory protein production of CAL-B in P. pastoris by fourfold.


Asunto(s)
Codón/genética , Pichia/genética , Pichia/metabolismo , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Saccharomyces cerevisiae/genética , Factor de Apareamiento/genética , Biología Sintética
8.
Nat Commun ; 15(1): 5600, 2024 Jul 03.
Artículo en Inglés | MEDLINE | ID: mdl-38961061

RESUMEN

ezSingleCell is an interactive and easy-to-use application for analysing various single-cell and spatial omics data types without requiring prior programing knowledge. It combines the best-performing publicly available methods for in-depth data analysis, integration, and interactive data visualization. ezSingleCell consists of five modules, each designed to be a comprehensive workflow for one data type or task. In addition, ezSingleCell allows crosstalk between different modules within a unified interface. Acceptable input data can be in a variety of formats while the output consists of publication ready figures and tables. In-depth manuals and video tutorials are available to guide users on the analysis workflows and parameter adjustments to suit their study aims. ezSingleCell's streamlined interface can analyse a standard scRNA-seq dataset of 3000 cells in less than five minutes. ezSingleCell is available in two forms: an installation-free web application ( https://immunesinglecell.org/ezsc/ ) or a software package with a shinyApp interface ( https://github.com/JinmiaoChenLab/ezSingleCell2 ) for offline analysis.


Asunto(s)
Análisis de la Célula Individual , Programas Informáticos , Análisis de la Célula Individual/métodos , Humanos , Flujo de Trabajo , Biología Computacional/métodos , Interfaz Usuario-Computador , RNA-Seq/métodos
9.
Genome Med ; 16(1): 12, 2024 01 12.
Artículo en Inglés | MEDLINE | ID: mdl-38217035

RESUMEN

Optimal integration of transcriptomics data and associated spatial information is essential towards fully exploiting spatial transcriptomics to dissect tissue heterogeneity and map out inter-cellular communications. We present SEDR, which uses a deep autoencoder coupled with a masked self-supervised learning mechanism to construct a low-dimensional latent representation of gene expression, which is then simultaneously embedded with the corresponding spatial information through a variational graph autoencoder. SEDR achieved higher clustering performance on manually annotated 10 × Visium datasets and better scalability on high-resolution spatial transcriptomics datasets than existing methods. Additionally, we show SEDR's ability to impute and denoise gene expression (URL: https://github.com/JinmiaoChenLab/SEDR/ ).


Asunto(s)
Comunicación Celular , Perfilación de la Expresión Génica , Humanos , Análisis por Conglomerados
10.
Front Immunol ; 14: 1127879, 2023.
Artículo en Inglés | MEDLINE | ID: mdl-37006302

RESUMEN

Introduction: Ageing in the human bone marrow is associated with immune function decline that results in the elderly being vulnerable to illnesses. A comprehensive healthy bone marrow consensus atlas can serve as a reference to study the immunological changes associated with ageing, and to identify and study abnormal cell states. Methods: We collected publicly available single cell transcriptomic data of 145 healthy samples encompassing a wide spectrum of ages ranging from 2 to 84 years old to construct our human bone marrow atlas. The final atlas has 673,750 cells and 54 annotated cell types. Results: We first characterised the changes in cell population sizes with respect to age and the corresponding changes in gene expression and pathways. Overall, we found significant age-associated changes in the lymphoid lineage cells. The naïve CD8+ T cell population showed significant shrinkage with ageing while the effector/memory CD4+ T cells increased in proportion. We also found an age-correlated decline in the common lymphoid progenitor population, in line with the commonly observed myeloid skew in haematopoiesis among the elderly. We then employed our cell type-specific ageing gene signatures to develop a machine learning model that predicts the biological age of bone marrow samples, which we then applied to healthy individuals and those with blood diseases. Finally, we demonstrated how to identify abnormal cell states by mapping disease samples onto the atlas. We accurately identified abnormal plasma cells and erythroblasts in multiple myeloma samples, and abnormal cells in acute myeloid leukaemia samples. Discussion: The bone marrow is the site of haematopoiesis, a highly important bodily process. We believe that our healthy bone marrow atlas is a valuable reference for studying bone marrow processes and bone marrow-related diseases. It can be mined for novel discoveries, as well as serve as a reference scaffold for mapping samples to identify and investigate abnormal cells.


Asunto(s)
Enfermedades de la Médula Ósea , Médula Ósea , Humanos , Anciano , Preescolar , Niño , Adolescente , Adulto Joven , Adulto , Persona de Mediana Edad , Anciano de 80 o más Años , Envejecimiento/genética , Senescencia Celular , Linfocitos T
11.
Nat Commun ; 14(1): 1155, 2023 03 01.
Artículo en Inglés | MEDLINE | ID: mdl-36859400

RESUMEN

Spatial transcriptomics technologies generate gene expression profiles with spatial context, requiring spatially informed analysis tools for three key tasks, spatial clustering, multisample integration, and cell-type deconvolution. We present GraphST, a graph self-supervised contrastive learning method that fully exploits spatial transcriptomics data to outperform existing methods. It combines graph neural networks with self-supervised contrastive learning to learn informative and discriminative spot representations by minimizing the embedding distance between spatially adjacent spots and vice versa. We demonstrated GraphST on multiple tissue types and technology platforms. GraphST achieved 10% higher clustering accuracy and better delineated fine-grained tissue structures in brain and embryo tissues. GraphST is also the only method that can jointly analyze multiple tissue slices in vertical or horizontal integration while correcting batch effects. Lastly, GraphST demonstrated superior cell-type deconvolution to capture spatial niches like lymph node germinal centers and exhausted tumor infiltrating T cells in breast tumor tissue.


Asunto(s)
Perfilación de la Expresión Génica , Transcriptoma , Encéfalo , Análisis por Conglomerados , Centro Germinal
12.
J Invest Dermatol ; 143(6): 1031-1041.e8, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-36566875

RESUMEN

Zika virus (ZIKV) became a public health concern when it re-emerged in 2015 owing to its ability to cause congenital deformities in the fetus and neurological complications in adults. Despite extensive data on protection, the interplay of protective and pathogenic adaptive immune responses toward ZIKV infection remains poorly understood. In this study, using a T-cell‒deficient mouse model that retains persistent ZIKV viral titers in the blood and organs, we show that the adoptive transfer of CD8+ T cells led to a significant reduction in viral load. This mouse model reveals that ZIKV can induce grossly visible auricular dermatitis and blepharitis, mediated by ZIKV-specific CD8+ T cells. Single-cell RNA sequencing of these causative CD8+ T cells from the ears shows an overactivated and elevated cytotoxic signature in mice with severe symptoms. Our results strongly suggest a role for CD8+ T-cell‒associated pathologies after ZIKV infection in CD4+ T-cell‒immunodeficient patients.


Asunto(s)
Blefaritis , Dermatitis , Infección por el Virus Zika , Virus Zika , Ratones , Animales , Linfocitos T CD4-Positivos , Linfocitos T CD8-positivos , Modelos Animales de Enfermedad
13.
Genome Biol ; 21(1): 12, 2020 01 16.
Artículo en Inglés | MEDLINE | ID: mdl-31948481

RESUMEN

BACKGROUND: Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal. RESULTS: We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression. CONCLUSION: Based on our results, Harmony, LIGER, and Seurat 3 are the recommended methods for batch integration. Due to its significantly shorter runtime, Harmony is recommended as the first method to try, with the other methods as viable alternatives.


Asunto(s)
RNA-Seq/métodos , Análisis de la Célula Individual/métodos , Algoritmos , Animales , Benchmarking , Macrodatos , Humanos , Ratones
14.
Biotechnol J ; 13(3): e1700229, 2018 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-29027766

RESUMEN

Kinetic modeling is the most suitable framework to describe the dynamic behavior of mammalian cell culture although its industrial application is still in its infancy. Herein, the authors reviewed mammalian bioprocess relevant kinetic models, and found that the simple unstructured-unsegregated approach utilizing empirical Monod-type kinetics based on limiting substrates and inhibitory metabolites is commonly used due to its traceability and simple formalism. Notably, the available kinetic models are typically small to moderate in size, and the development of large-scale models is severely hampered by the scarcity of kinetic data and limitations in current parameter estimation methods. The recent availability of abundant high-throughput multi-omics datasets from mammalian cell cultures have now paved the way to improve parameterization of kinetic models, and integrate regulatory, signaling, and product quality related intracellular events, as well as cellular metabolism within the modeling framework. Ultimately, the authors foresee that multi-scale modeling is the way forward in building predictive kinetic models of mammalian cell culture to advance biomanufacturing.


Asunto(s)
Técnicas de Cultivo de Célula/métodos , Mamíferos/genética , Modelos Biológicos , Animales , Cinética , Transducción de Señal/genética
15.
J Biotechnol ; 283: 97-104, 2018 Oct 10.
Artículo en Inglés | MEDLINE | ID: mdl-30076878

RESUMEN

We explored the effects of media and clonal variation on the lactate shift which can be considered as one of the desirable features in CHO cell culture. Various culture profiles with the specific growth and antibody production rates under three different media conditions in two CHO producing clones were evaluated by resorting to multivariate statistical analysis. In most cases, glutamine depletion coincided with lactate consumption, suggesting that glutaminolysis rather than glycolysis was the preferred pathway for the pyruvate supply toward lactate production. With respect to the lactate shift, high performing medium showed higher glutamate uptake, higher aspartate secretion and lower serine uptake compared to other media conditions. In addition, clone itself exhibited the desired lactate consumption more consistently accompanying with distinguishing phenotype. The clone exhibiting lactate shift produced lesser lactate in exponential phase but two-fold higher non-toxic alanine, thus leading to better culture environment. Thus, we understand the balanced selection of clone and media composition enables cells to utilize the metabolic pathways for the desired lactate shift.


Asunto(s)
Técnicas de Cultivo de Célula/métodos , Medios de Cultivo/química , Glutamina/metabolismo , Ácido Láctico/química , Alanina , Animales , Ácido Aspártico/metabolismo , Células CHO , Proliferación Celular , Cricetinae , Cricetulus , Redes y Vías Metabólicas , Fenotipo , Serina/metabolismo
16.
Methods Mol Biol ; 1472: 13-34, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-27671929

RESUMEN

Codon optimization has been widely used for designing native or synthetic genes to enhance their expression in heterologous host organisms. We recently developed Codon Optimization On-Line (COOL) which is a web-based tool to provide multi-objective codon optimization functionality for synthetic gene design. COOL provides a simple and flexible interface for customizing codon optimization based on several design parameters such as individual codon usage, codon pairing, and codon adaptation index. User-defined sequences can also be compared against the COOL optimized ones to show the extent by which the user's sequences can be evaluated and further improved. The utility of COOL is demonstrated via a case study where the codon optimized sequence of an invertase enzyme is generated for the enhanced expression in E. coli.


Asunto(s)
Proteínas de Escherichia coli/genética , Genes Sintéticos , Programas Informáticos , beta-Fructofuranosidasa/genética , Clonación Molecular , Codón , Estabilidad de Enzimas , Escherichia coli/enzimología , Escherichia coli/genética , Expresión Génica , Sistemas en Línea , Ingeniería de Proteínas/métodos
17.
Metabolomics ; 12: 109, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-27358602

RESUMEN

INTRODUCTION: The human genome-scale metabolic reconstruction details all known metabolic reactions occurring in humans, and thereby holds substantial promise for studying complex diseases and phenotypes. Capturing the whole human metabolic reconstruction is an on-going task and since the last community effort generated a consensus reconstruction, several updates have been developed. OBJECTIVES: We report a new consensus version, Recon 2.2, which integrates various alternative versions with significant additional updates. In addition to re-establishing a consensus reconstruction, further key objectives included providing more comprehensive annotation of metabolites and genes, ensuring full mass and charge balance in all reactions, and developing a model that correctly predicts ATP production on a range of carbon sources. METHODS: Recon 2.2 has been developed through a combination of manual curation and automated error checking. Specific and significant manual updates include a respecification of fatty acid metabolism, oxidative phosphorylation and a coupling of the electron transport chain to ATP synthase activity. All metabolites have definitive chemical formulae and charges specified, and these are used to ensure full mass and charge reaction balancing through an automated linear programming approach. Additionally, improved integration with transcriptomics and proteomics data has been facilitated with the updated curation of relationships between genes, proteins and reactions. RESULTS: Recon 2.2 now represents the most predictive model of human metabolism to date as demonstrated here. Extensive manual curation has increased the reconstruction size to 5324 metabolites, 7785 reactions and 1675 associated genes, which now are mapped to a single standard. The focus upon mass and charge balancing of all reactions, along with better representation of energy generation, has produced a flux model that correctly predicts ATP yield on different carbon sources. CONCLUSION: Through these updates we have achieved the most complete and best annotated consensus human metabolic reconstruction available, thereby increasing the ability of this resource to provide novel insights into normal and disease states in human. The model is freely available from the Biomodels database (http://identifiers.org/biomodels.db/MODEL1603150001).

18.
Cell Syst ; 3(5): 434-443.e8, 2016 11 23.
Artículo en Inglés | MEDLINE | ID: mdl-27883890

RESUMEN

Chinese hamster ovary (CHO) cells dominate biotherapeutic protein production and are widely used in mammalian cell line engineering research. To elucidate metabolic bottlenecks in protein production and to guide cell engineering and bioprocess optimization, we reconstructed the metabolic pathways in CHO and associated them with >1,700 genes in the Cricetulus griseus genome. The genome-scale metabolic model based on this reconstruction, iCHO1766, and cell-line-specific models for CHO-K1, CHO-S, and CHO-DG44 cells provide the biochemical basis of growth and recombinant protein production. The models accurately predict growth phenotypes and known auxotrophies in CHO cells. With the models, we quantify the protein synthesis capacity of CHO cells and demonstrate that common bioprocess treatments, such as histone deacetylase inhibitors, inefficiently increase product yield. However, our simulations show that the metabolic resources in CHO are more than three times more efficiently utilized for growth or recombinant protein synthesis following targeted efforts to engineer the CHO secretory pathway. This model will further accelerate CHO cell engineering and help optimize bioprocesses.


Asunto(s)
Genoma , Animales , Células CHO , Consenso , Cricetinae , Cricetulus , Humanos , Redes y Vías Metabólicas , Proteínas Recombinantes
19.
Enzyme Microb Technol ; 75-76: 57-63, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26047917

RESUMEN

Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion.


Asunto(s)
Escherichia coli/enzimología , Escherichia coli/genética , Genes Sintéticos , beta-Fructofuranosidasa/genética , Codón/genética , Estabilidad de Enzimas , Expresión Génica , Microbiología Industrial , Modelos Moleculares , Ingeniería de Proteínas , Estructura Secundaria de Proteína , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Temperatura , beta-Fructofuranosidasa/química , beta-Fructofuranosidasa/metabolismo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA