Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 18 de 18
Filter
1.
Nat Methods ; 2024 Jun 21.
Article in English | MEDLINE | ID: mdl-38907114

ABSTRACT

Advances in spatial omics technologies now allow multiple types of data to be acquired from the same tissue slice. To realize the full potential of such data, we need spatially informed methods for data integration. Here, we introduce SpatialGlue, a graph neural network model with a dual-attention mechanism that deciphers spatial domains by intra-omics integration of spatial location and omics measurement followed by cross-omics integration. We demonstrated SpatialGlue on data acquired from different tissue types using different technologies, including spatial epigenome-transcriptome and transcriptome-proteome modalities. Compared to other methods, SpatialGlue captured more anatomical details and more accurately resolved spatial domains such as the cortex layers of the brain. Our method also identified cell types like spleen macrophage subsets located at three different zones that were not available in the original data annotations. SpatialGlue scales well with data size and can be used to integrate three modalities. Our spatial multi-omics analysis tool combines the information from complementary omics modalities to obtain a holistic view of cellular and tissue properties.

2.
Nucleic Acids Res ; 50(D1): D596-D602, 2022 01 07.
Article in English | MEDLINE | ID: mdl-34791375

ABSTRACT

The ability to study cellular heterogeneity at single cell resolution is making single-cell sequencing increasingly popular. However, there is no publicly available resource that offers an integrated cell atlas with harmonized metadata that users can integrate new data with. Here, we present DISCO (https://www.immunesinglecell.org/), a database of Deeply Integrated Single-Cell Omics data. The current release of DISCO integrates more than 18 million cells from 4593 samples, covering 107 tissues/cell lines/organoids, 158 diseases, and 20 platforms. We standardized the associated metadata with a controlled vocabulary and ontology system. To allow large scale integration of single-cell data, we developed FastIntegration, a fast and high-capacity version of Seurat Integration. We also developed CELLiD, an atlas guided automatic cell type identification tool. Employing these two tools on the assembled data, we constructed one global atlas and 27 sub-atlases for different tissues, diseases, and cell types. DISCO provides three online tools, namely Online FastIntegration, Online CELLiD, and CellMapper, for users to integrate, annotate, and project uploaded single-cell RNA-seq data onto a selected atlas. Collectively, DISCO is a versatile platform for users to explore published single-cell data and efficiently perform integrated analysis with their own data.


Subject(s)
Cell Lineage/genetics , Databases, Genetic , Genetic Diseases, Inborn/genetics , Organ Specificity/genetics , Software , Genetic Diseases, Inborn/classification , Humans , RNA-Seq , Single-Cell Analysis
3.
Biotechnol Bioeng ; 118(11): 4305-4316, 2021 11.
Article in English | MEDLINE | ID: mdl-34289087

ABSTRACT

A robust monoclonal antibody (mAb) bioprocess requires physiological parameters such as temperature, pH, or dissolved oxygen to be well-controlled as even small variations in them could potentially impact the final product quality. For instance, pH substantially affects N-glycosylation, protein aggregation, and charge variant profiles, as well as mAb productivity. However, relatively less is known about how pH jointly influences product quality and titer. In this study, we investigated the effect of pH on culture performance, product titer, and quality profiles by applying longitudinal multi-omics profiling, including transcriptomics, proteomics, metabolomics, and glycomics, at three different culture pH set points. The subsequent systematic analysis of multi-omics data showed that pH set points differentially regulated various intracellular pathways including intracellular vesicular trafficking, cell cycle, and apoptosis, thereby resulting in differences in specific productivity, product titer, and quality profiles. In addition, a time-dependent variation in mAb N-glycosylation profiles, independent of pH, was identified to be mainly due to the accumulation of mAb proteins in the endoplasmic reticulum disrupting cellular homeostasis over culture time. Overall, this multi-omics-based study provides an in-depth understanding of the intracellular processes in mAb-producing CHO cell line under varied pH conditions, and could serve as a baseline for enabling the quality optimization and control of mAb production.


Subject(s)
Antibodies, Monoclonal/biosynthesis , Cell Culture Techniques , Cell Cycle , Metabolomics , Oxygen/metabolism , Animals , CHO Cells , Cricetulus , Glycosylation , Hydrogen-Ion Concentration
4.
Curr Genomics ; 19(8): 712-722, 2018 Dec.
Article in English | MEDLINE | ID: mdl-30532650

ABSTRACT

In nature, microbes do not exist in isolation but co-exist in a variety of ecological and biological environments and on various host organisms. Due to their close proximity, these microbes interact among themselves, and also with the hosts in both positive and negative manners. Moreover, these interactions may modulate dynamically upon external stimulus as well as internal community changes. This demands systematic techniques such as mathematical modeling to understand the intrinsic community behavior. Here, we reviewed various approaches for metabolic modeling of microbial communities. If detailed species-specific information is available, segregated models of individual organisms can be constructed and connected via metabolite exchanges; otherwise, the community may be represented as a lumped network of metabolic reactions. The constructed models can then be simulated to help fill knowledge gaps, and generate testable hypotheses for designing new experiments. More importantly, such community models have been developed to study microbial interactions in various niches such as host microbiome, biogeochemical and bioremediation, waste water treatment and synthetic consortia. As such, the metabolic modeling efforts have allowed us to gain new insights into the natural and synthetic microbial communities, and design interventions to achieve specific goals. Finally, potential directions for future development in metabolic modeling of microbial communities were also discussed.

5.
Methods ; 102: 26-35, 2016 06 01.
Article in English | MEDLINE | ID: mdl-26850284

ABSTRACT

In this study, we analyzed multi-omics data and subsets thereof to establish reference codon usage biases for codon optimization in synthetic gene design. Specifically, publicly available genomic, transcriptomic, proteomic and translatomic data for microbial and mammalian expression hosts, Escherichia coli, Saccharomyces cerevisiae, Pichia pastoris and Chinese hamster ovary (CHO) cells, were compiled to derive their individual codon and codon pair frequencies. Then, host dependent and -omics specific codon biases were generated and compared by principal component analysis and hierarchical clustering. Interestingly, our results indicated the similar codon bias patterns of the highly expressed transcripts, highly abundant proteins, and efficiently translated mRNA in microbial cells, despite the general lack of correlation between mRNA and protein expression levels. However, for CHO cells, the codon bias patterns among various -omics subsets are not distinguishable, forming one cluster. Thus, we further investigated the effect of different input codon biases on codon optimized sequences using the codon context (CC) and individual codon usage (ICU) design parameters, via in silico case study on the expression of human IFNγ sequence in CHO cells. The results supported that CC is more robust design parameter than ICU for improved heterologous gene design.


Subject(s)
Codon , Genomics/methods , Mammals/genetics , Animals , CHO Cells , Computer Simulation , Cricetulus , Data Mining/methods , Escherichia coli/genetics , Pichia/genetics , Principal Component Analysis , Saccharomyces cerevisiae/genetics
6.
Biotechnol Lett ; 38(12): 2137-2143, 2016 Dec.
Article in English | MEDLINE | ID: mdl-27613154

ABSTRACT

OBJECTIVES: To evaluate different codon optimization parameters on the Saccharomyces cerevisiae-derived mating factor α prepro-leader sequence (MFLS) to improve Candida antarctica lipase B (CAL-B) secretory production in Pichia pastoris. RESULTS: Codon optimization based on the individual codon usage (ICU) and codon context (CC) design parameters enhanced secretory production of CAL-B to 7 U/ml and 12 U/ml, respectively. Only 3 U/ml was obtained with the wild type sequence while the sequence optimized using both ICU and CC objectives showed intermediate performance of 10 U/ml. These results clearly show that CC is the most relevant parameter for the codon optimization of MFLS in P. pastoris, and there is no synergistic effect achieved by considering both ICU and CC together. CONCLUSION: The CC optimized MFLS increased secretory protein production of CAL-B in P. pastoris by fourfold.


Subject(s)
Codon/genetics , Pichia/genetics , Pichia/metabolism , Recombinant Proteins/genetics , Recombinant Proteins/metabolism , Saccharomyces cerevisiae/genetics , Mating Factor/genetics , Synthetic Biology
7.
Nat Commun ; 15(1): 5600, 2024 Jul 03.
Article in English | MEDLINE | ID: mdl-38961061

ABSTRACT

ezSingleCell is an interactive and easy-to-use application for analysing various single-cell and spatial omics data types without requiring prior programing knowledge. It combines the best-performing publicly available methods for in-depth data analysis, integration, and interactive data visualization. ezSingleCell consists of five modules, each designed to be a comprehensive workflow for one data type or task. In addition, ezSingleCell allows crosstalk between different modules within a unified interface. Acceptable input data can be in a variety of formats while the output consists of publication ready figures and tables. In-depth manuals and video tutorials are available to guide users on the analysis workflows and parameter adjustments to suit their study aims. ezSingleCell's streamlined interface can analyse a standard scRNA-seq dataset of 3000 cells in less than five minutes. ezSingleCell is available in two forms: an installation-free web application ( https://immunesinglecell.org/ezsc/ ) or a software package with a shinyApp interface ( https://github.com/JinmiaoChenLab/ezSingleCell2 ) for offline analysis.


Subject(s)
Single-Cell Analysis , Software , Single-Cell Analysis/methods , Humans , Workflow , Computational Biology/methods , User-Computer Interface , RNA-Seq/methods
8.
Genome Med ; 16(1): 12, 2024 Jan 12.
Article in English | MEDLINE | ID: mdl-38217035

ABSTRACT

Optimal integration of transcriptomics data and associated spatial information is essential towards fully exploiting spatial transcriptomics to dissect tissue heterogeneity and map out inter-cellular communications. We present SEDR, which uses a deep autoencoder coupled with a masked self-supervised learning mechanism to construct a low-dimensional latent representation of gene expression, which is then simultaneously embedded with the corresponding spatial information through a variational graph autoencoder. SEDR achieved higher clustering performance on manually annotated 10 × Visium datasets and better scalability on high-resolution spatial transcriptomics datasets than existing methods. Additionally, we show SEDR's ability to impute and denoise gene expression (URL: https://github.com/JinmiaoChenLab/SEDR/ ).


Subject(s)
Cell Communication , Gene Expression Profiling , Humans , Cluster Analysis
9.
Front Immunol ; 14: 1127879, 2023.
Article in English | MEDLINE | ID: mdl-37006302

ABSTRACT

Introduction: Ageing in the human bone marrow is associated with immune function decline that results in the elderly being vulnerable to illnesses. A comprehensive healthy bone marrow consensus atlas can serve as a reference to study the immunological changes associated with ageing, and to identify and study abnormal cell states. Methods: We collected publicly available single cell transcriptomic data of 145 healthy samples encompassing a wide spectrum of ages ranging from 2 to 84 years old to construct our human bone marrow atlas. The final atlas has 673,750 cells and 54 annotated cell types. Results: We first characterised the changes in cell population sizes with respect to age and the corresponding changes in gene expression and pathways. Overall, we found significant age-associated changes in the lymphoid lineage cells. The naïve CD8+ T cell population showed significant shrinkage with ageing while the effector/memory CD4+ T cells increased in proportion. We also found an age-correlated decline in the common lymphoid progenitor population, in line with the commonly observed myeloid skew in haematopoiesis among the elderly. We then employed our cell type-specific ageing gene signatures to develop a machine learning model that predicts the biological age of bone marrow samples, which we then applied to healthy individuals and those with blood diseases. Finally, we demonstrated how to identify abnormal cell states by mapping disease samples onto the atlas. We accurately identified abnormal plasma cells and erythroblasts in multiple myeloma samples, and abnormal cells in acute myeloid leukaemia samples. Discussion: The bone marrow is the site of haematopoiesis, a highly important bodily process. We believe that our healthy bone marrow atlas is a valuable reference for studying bone marrow processes and bone marrow-related diseases. It can be mined for novel discoveries, as well as serve as a reference scaffold for mapping samples to identify and investigate abnormal cells.


Subject(s)
Bone Marrow Diseases , Bone Marrow , Humans , Aged , Child, Preschool , Child , Adolescent , Young Adult , Adult , Middle Aged , Aged, 80 and over , Aging/genetics , Cellular Senescence , T-Lymphocytes
10.
Nat Commun ; 14(1): 1155, 2023 03 01.
Article in English | MEDLINE | ID: mdl-36859400

ABSTRACT

Spatial transcriptomics technologies generate gene expression profiles with spatial context, requiring spatially informed analysis tools for three key tasks, spatial clustering, multisample integration, and cell-type deconvolution. We present GraphST, a graph self-supervised contrastive learning method that fully exploits spatial transcriptomics data to outperform existing methods. It combines graph neural networks with self-supervised contrastive learning to learn informative and discriminative spot representations by minimizing the embedding distance between spatially adjacent spots and vice versa. We demonstrated GraphST on multiple tissue types and technology platforms. GraphST achieved 10% higher clustering accuracy and better delineated fine-grained tissue structures in brain and embryo tissues. GraphST is also the only method that can jointly analyze multiple tissue slices in vertical or horizontal integration while correcting batch effects. Lastly, GraphST demonstrated superior cell-type deconvolution to capture spatial niches like lymph node germinal centers and exhausted tumor infiltrating T cells in breast tumor tissue.


Subject(s)
Gene Expression Profiling , Transcriptome , Brain , Cluster Analysis , Germinal Center
11.
J Invest Dermatol ; 143(6): 1031-1041.e8, 2023 06.
Article in English | MEDLINE | ID: mdl-36566875

ABSTRACT

Zika virus (ZIKV) became a public health concern when it re-emerged in 2015 owing to its ability to cause congenital deformities in the fetus and neurological complications in adults. Despite extensive data on protection, the interplay of protective and pathogenic adaptive immune responses toward ZIKV infection remains poorly understood. In this study, using a T-cell‒deficient mouse model that retains persistent ZIKV viral titers in the blood and organs, we show that the adoptive transfer of CD8+ T cells led to a significant reduction in viral load. This mouse model reveals that ZIKV can induce grossly visible auricular dermatitis and blepharitis, mediated by ZIKV-specific CD8+ T cells. Single-cell RNA sequencing of these causative CD8+ T cells from the ears shows an overactivated and elevated cytotoxic signature in mice with severe symptoms. Our results strongly suggest a role for CD8+ T-cell‒associated pathologies after ZIKV infection in CD4+ T-cell‒immunodeficient patients.


Subject(s)
Blepharitis , Dermatitis , Zika Virus Infection , Zika Virus , Mice , Animals , CD4-Positive T-Lymphocytes , CD8-Positive T-Lymphocytes , Disease Models, Animal
12.
Genome Biol ; 21(1): 12, 2020 01 16.
Article in English | MEDLINE | ID: mdl-31948481

ABSTRACT

BACKGROUND: Large-scale single-cell transcriptomic datasets generated using different technologies contain batch-specific systematic variations that present a challenge to batch-effect removal and data integration. With continued growth expected in scRNA-seq data, achieving effective batch integration with available computational resources is crucial. Here, we perform an in-depth benchmark study on available batch correction methods to determine the most suitable method for batch-effect removal. RESULTS: We compare 14 methods in terms of computational runtime, the ability to handle large datasets, and batch-effect correction efficacy while preserving cell type purity. Five scenarios are designed for the study: identical cell types with different technologies, non-identical cell types, multiple batches, big data, and simulated data. Performance is evaluated using four benchmarking metrics including kBET, LISI, ASW, and ARI. We also investigate the use of batch-corrected data to study differential gene expression. CONCLUSION: Based on our results, Harmony, LIGER, and Seurat 3 are the recommended methods for batch integration. Due to its significantly shorter runtime, Harmony is recommended as the first method to try, with the other methods as viable alternatives.


Subject(s)
RNA-Seq/methods , Single-Cell Analysis/methods , Algorithms , Animals , Benchmarking , Big Data , Humans , Mice
13.
Biotechnol J ; 13(3): e1700229, 2018 Mar.
Article in English | MEDLINE | ID: mdl-29027766

ABSTRACT

Kinetic modeling is the most suitable framework to describe the dynamic behavior of mammalian cell culture although its industrial application is still in its infancy. Herein, the authors reviewed mammalian bioprocess relevant kinetic models, and found that the simple unstructured-unsegregated approach utilizing empirical Monod-type kinetics based on limiting substrates and inhibitory metabolites is commonly used due to its traceability and simple formalism. Notably, the available kinetic models are typically small to moderate in size, and the development of large-scale models is severely hampered by the scarcity of kinetic data and limitations in current parameter estimation methods. The recent availability of abundant high-throughput multi-omics datasets from mammalian cell cultures have now paved the way to improve parameterization of kinetic models, and integrate regulatory, signaling, and product quality related intracellular events, as well as cellular metabolism within the modeling framework. Ultimately, the authors foresee that multi-scale modeling is the way forward in building predictive kinetic models of mammalian cell culture to advance biomanufacturing.


Subject(s)
Cell Culture Techniques/methods , Mammals/genetics , Models, Biological , Animals , Kinetics , Signal Transduction/genetics
14.
J Biotechnol ; 283: 97-104, 2018 Oct 10.
Article in English | MEDLINE | ID: mdl-30076878

ABSTRACT

We explored the effects of media and clonal variation on the lactate shift which can be considered as one of the desirable features in CHO cell culture. Various culture profiles with the specific growth and antibody production rates under three different media conditions in two CHO producing clones were evaluated by resorting to multivariate statistical analysis. In most cases, glutamine depletion coincided with lactate consumption, suggesting that glutaminolysis rather than glycolysis was the preferred pathway for the pyruvate supply toward lactate production. With respect to the lactate shift, high performing medium showed higher glutamate uptake, higher aspartate secretion and lower serine uptake compared to other media conditions. In addition, clone itself exhibited the desired lactate consumption more consistently accompanying with distinguishing phenotype. The clone exhibiting lactate shift produced lesser lactate in exponential phase but two-fold higher non-toxic alanine, thus leading to better culture environment. Thus, we understand the balanced selection of clone and media composition enables cells to utilize the metabolic pathways for the desired lactate shift.


Subject(s)
Cell Culture Techniques/methods , Culture Media/chemistry , Glutamine/metabolism , Lactic Acid/chemistry , Alanine , Animals , Aspartic Acid/metabolism , CHO Cells , Cell Proliferation , Cricetinae , Cricetulus , Metabolic Networks and Pathways , Phenotype , Serine/metabolism
15.
Methods Mol Biol ; 1472: 13-34, 2017.
Article in English | MEDLINE | ID: mdl-27671929

ABSTRACT

Codon optimization has been widely used for designing native or synthetic genes to enhance their expression in heterologous host organisms. We recently developed Codon Optimization On-Line (COOL) which is a web-based tool to provide multi-objective codon optimization functionality for synthetic gene design. COOL provides a simple and flexible interface for customizing codon optimization based on several design parameters such as individual codon usage, codon pairing, and codon adaptation index. User-defined sequences can also be compared against the COOL optimized ones to show the extent by which the user's sequences can be evaluated and further improved. The utility of COOL is demonstrated via a case study where the codon optimized sequence of an invertase enzyme is generated for the enhanced expression in E. coli.


Subject(s)
Escherichia coli Proteins/genetics , Genes, Synthetic , Software , beta-Fructofuranosidase/genetics , Cloning, Molecular , Codon , Enzyme Stability , Escherichia coli/enzymology , Escherichia coli/genetics , Gene Expression , Online Systems , Protein Engineering/methods
16.
Metabolomics ; 12: 109, 2016.
Article in English | MEDLINE | ID: mdl-27358602

ABSTRACT

INTRODUCTION: The human genome-scale metabolic reconstruction details all known metabolic reactions occurring in humans, and thereby holds substantial promise for studying complex diseases and phenotypes. Capturing the whole human metabolic reconstruction is an on-going task and since the last community effort generated a consensus reconstruction, several updates have been developed. OBJECTIVES: We report a new consensus version, Recon 2.2, which integrates various alternative versions with significant additional updates. In addition to re-establishing a consensus reconstruction, further key objectives included providing more comprehensive annotation of metabolites and genes, ensuring full mass and charge balance in all reactions, and developing a model that correctly predicts ATP production on a range of carbon sources. METHODS: Recon 2.2 has been developed through a combination of manual curation and automated error checking. Specific and significant manual updates include a respecification of fatty acid metabolism, oxidative phosphorylation and a coupling of the electron transport chain to ATP synthase activity. All metabolites have definitive chemical formulae and charges specified, and these are used to ensure full mass and charge reaction balancing through an automated linear programming approach. Additionally, improved integration with transcriptomics and proteomics data has been facilitated with the updated curation of relationships between genes, proteins and reactions. RESULTS: Recon 2.2 now represents the most predictive model of human metabolism to date as demonstrated here. Extensive manual curation has increased the reconstruction size to 5324 metabolites, 7785 reactions and 1675 associated genes, which now are mapped to a single standard. The focus upon mass and charge balancing of all reactions, along with better representation of energy generation, has produced a flux model that correctly predicts ATP yield on different carbon sources. CONCLUSION: Through these updates we have achieved the most complete and best annotated consensus human metabolic reconstruction available, thereby increasing the ability of this resource to provide novel insights into normal and disease states in human. The model is freely available from the Biomodels database (http://identifiers.org/biomodels.db/MODEL1603150001).

17.
Cell Syst ; 3(5): 434-443.e8, 2016 11 23.
Article in English | MEDLINE | ID: mdl-27883890

ABSTRACT

Chinese hamster ovary (CHO) cells dominate biotherapeutic protein production and are widely used in mammalian cell line engineering research. To elucidate metabolic bottlenecks in protein production and to guide cell engineering and bioprocess optimization, we reconstructed the metabolic pathways in CHO and associated them with >1,700 genes in the Cricetulus griseus genome. The genome-scale metabolic model based on this reconstruction, iCHO1766, and cell-line-specific models for CHO-K1, CHO-S, and CHO-DG44 cells provide the biochemical basis of growth and recombinant protein production. The models accurately predict growth phenotypes and known auxotrophies in CHO cells. With the models, we quantify the protein synthesis capacity of CHO cells and demonstrate that common bioprocess treatments, such as histone deacetylase inhibitors, inefficiently increase product yield. However, our simulations show that the metabolic resources in CHO are more than three times more efficiently utilized for growth or recombinant protein synthesis following targeted efforts to engineer the CHO secretory pathway. This model will further accelerate CHO cell engineering and help optimize bioprocesses.


Subject(s)
Genome , Animals , CHO Cells , Consensus , Cricetinae , Cricetulus , Humans , Metabolic Networks and Pathways , Recombinant Proteins
18.
Enzyme Microb Technol ; 75-76: 57-63, 2015.
Article in English | MEDLINE | ID: mdl-26047917

ABSTRACT

Various isoforms of invertases from prokaryotes, fungi, and higher plants has been expressed in Escherichia coli, and codon optimisation is a widely-adopted strategy for improvement of heterologous enzyme expression. Successful synthetic gene design for recombinant protein expression can be done by matching its translational elongation rate against heterologous host organisms via codon optimization. Amongst the various design parameters considered for the gene synthesis, codon context bias has been relatively overlooked compared to individual codon usage which is commonly adopted in most of codon optimization tools. In addition, matching the rates of transcription and translation based on secondary structure may lead to enhanced protein folding. In this study, we evaluated codon context fitness as design criterion for improving the expression of thermostable invertase from Thermotoga maritima in Escherichia coli and explored the relevance of secondary structure regions for folding and expression. We designed three coding sequences by using (1) a commercial vendor optimized gene algorithm, (2) codon context for the whole gene, and (3) codon context based on the secondary structure regions. Then, the codon optimized sequences were transformed and expressed in E. coli. From the resultant enzyme activities and protein yield data, codon context fitness proved to have the highest activity as compared to the wild-type control and other criteria while secondary structure-based strategy is comparable to the control. Codon context bias was shown to be a relevant parameter for enhancing enzyme production in Escherichia coli by codon optimization. Thus, we can effectively design synthetic genes within heterologous host organisms using this criterion.


Subject(s)
Escherichia coli/enzymology , Escherichia coli/genetics , Genes, Synthetic , beta-Fructofuranosidase/genetics , Codon/genetics , Enzyme Stability , Gene Expression , Industrial Microbiology , Models, Molecular , Protein Engineering , Protein Structure, Secondary , Recombinant Proteins/chemistry , Recombinant Proteins/genetics , Recombinant Proteins/metabolism , Temperature , beta-Fructofuranosidase/chemistry , beta-Fructofuranosidase/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL