Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 26
Filter
1.
Cell Syst ; 15(2): 193-203.e6, 2024 Feb 21.
Article in English | MEDLINE | ID: mdl-38340729

ABSTRACT

A strategy to obtain the greatest number of best-performing variants with least amount of experimental effort over the vast combinatorial mutational landscape would have enormous utility in boosting resource producibility for protein engineering. Toward this goal, we present a simple and effective machine learning-based strategy that outperforms other state-of-the-art methods. Our strategy integrates zero-shot prediction and multi-round sampling to direct active learning via experimenting with only a few predicted top variants. We find that four rounds of low-N pick-and-validate sampling of 12 variants for machine learning yielded the best accuracy of up to 92.6% in selecting the true top 1% variants in combinatorial mutant libraries, whereas two rounds of 24 variants can also be used. We demonstrate our strategy in successfully discovering high-performance protein variants from diverse families including the CRISPR-based genome editors, supporting its generalizable application for solving protein engineering tasks. A record of this paper's transparent peer review process is included in the supplemental information.


Subject(s)
Machine Learning , Protein Engineering , Humans , Mutation/genetics , Genome
3.
Nat Biomed Eng ; 8(3): 291-309, 2024 Mar.
Article in English | MEDLINE | ID: mdl-37996617

ABSTRACT

Mapping mutations and discovering cellular determinants that cause the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to induce infected cells to form syncytia would facilitate the development of strategies for blocking the formation of such cell-cell fusion. Here we describe high-throughput screening methods based on droplet microfluidics and the size-exclusion selection of syncytia, coupled with large-scale mutagenesis and genome-wide knockout screening via clustered regularly interspaced short palindromic repeats (CRISPR), for the large-scale identification of determinants of cell-cell fusion. We used the methods to perform deep mutational scans in spike-presenting cells to pinpoint mutable syncytium-enhancing substitutions in two regions of the spike protein (the fusion peptide proximal region and the furin-cleavage site). We also used a genome-wide CRISPR screen in cells expressing the receptor angiotensin-converting enzyme 2 to identify inhibitors of clathrin-mediated endocytosis that impede syncytium formation, which we validated in hamsters infected with SARS-CoV-2. Finding genetic and cellular determinants of the formation of syncytia may reveal insights into the physiological and pathological consequences of cell-cell fusion.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , High-Throughput Screening Assays , Spike Glycoprotein, Coronavirus/genetics , COVID-19/pathology , Giant Cells/metabolism , Giant Cells/pathology
4.
Cell Syst ; 14(12): 1103-1112.e6, 2023 12 20.
Article in English | MEDLINE | ID: mdl-38016465

ABSTRACT

The sequence in the 5' untranslated regions (UTRs) is known to affect mRNA translation rates. However, the underlying regulatory grammar remains elusive. Here, we propose MTtrans, a multi-task translation rate predictor capable of learning common sequence patterns from datasets across various experimental techniques. The core premise is that common motifs are more likely to be genuinely involved in translation control. MTtrans outperforms existing methods in both accuracy and the ability to capture transferable motifs across species, highlighting its strength in identifying evolutionarily conserved sequence motifs. Our independent fluorescence-activated cell sorting coupled with deep sequencing (FACS-seq) experiment validates the impact of most motifs identified by MTtrans. Additionally, we introduce "GRU-rewiring," a technique to interpret the hidden states of the recurrent units. Gated recurrent unit (GRU)-rewiring allows us to identify regulatory element-enriched positions and examine the local effects of 5' UTR mutations. MTtrans is a powerful tool for deciphering the translation regulatory motifs.


Subject(s)
Regulatory Sequences, Nucleic Acid , 5' Untranslated Regions/genetics , Conserved Sequence
5.
Cell Syst ; 14(5): 392-403.e4, 2023 05 17.
Article in English | MEDLINE | ID: mdl-37164010

ABSTRACT

Selecting the most suitable existing base editors and engineering new variants for installing specific base conversions with maximal efficiency and minimal undesired edits are pivotal for precise genome editing applications. Here, we present a platform for creating and analyzing a library of engineered base editor variants to enable head-to-head evaluation of their editing performance at scale. Our comprehensive comparison provides quantitative measures on each variant's editing efficiency, purity, motif preference, and bias in generating single and multiple base conversions, while uncovering undesired higher indel generation rate and noncanonical base conversion for some of the existing base editors. In addition to engineering the base editor protein, we further applied this platform to investigate a hitherto underexplored engineering route and created guide RNA scaffold variants that augment the editor's base-editing activity. With the unknown performance and compatibility of the growing number of engineered parts including deaminase, CRISPR-Cas enzyme, and guide RNA scaffold variants for assembling the expanding collection of base editor systems, our platform addresses the unmet need for an unbiased, scalable method to benchmark their editing outcomes and accelerate the engineering of next-generation precise genome editors.


Subject(s)
CRISPR-Cas Systems , Gene Editing , CRISPR-Cas Systems/genetics , Gene Editing/methods , Genome , Gene Library , RNA
7.
Essays Biochem ; 66(4): 305-318, 2022 09 16.
Article in English | MEDLINE | ID: mdl-35713228

ABSTRACT

Cancer stem cells (CSCs) are a subpopulation of tumor cells with self-renewal ability. Increasing evidence points to the critical roles of CSCs in tumorigenesis, metastasis, therapy resistance, and cancer relapse. As such, the elimination of CSCs improves cancer treatment outcomes. However, challenges remain due to limited understanding of the molecular mechanisms governing self-renewal and survival of CSCs. Clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 screening has been increasingly used to identify genetic determinants in cancers. In this primer, we discuss the progress made and emerging opportunities of coupling advanced CRISPR screening systems with CSC models to reveal the understudied vulnerabilities of CSCs.


Subject(s)
Early Detection of Cancer , Neoplasms , Humans , Neoplasms/genetics , Neoplasms/pathology , Neoplastic Stem Cells/pathology
8.
Nat Commun ; 13(1): 2219, 2022 04 25.
Article in English | MEDLINE | ID: mdl-35468907

ABSTRACT

The genome-editing Cas9 protein uses multiple amino-acid residues to bind the target DNA. Considering only the residues in proximity to the target DNA as potential sites to optimise Cas9's activity, the number of combinatorial variants to screen through is too massive for a wet-lab experiment. Here we generate and cross-validate ten in silico and experimental datasets of multi-domain combinatorial mutagenesis libraries for Cas9 engineering, and demonstrate that a machine learning-coupled engineering approach reduces the experimental screening burden by as high as 95% while enriching top-performing variants by ∼7.5-fold in comparison to the null model. Using this approach and followed by structure-guided engineering, we identify the N888R/A889Q variant conferring increased editing activity on the protospacer adjacent motif-relaxed KKH variant of Cas9 nuclease from Staphylococcus aureus (KKH-SaCas9) and its derived base editor in human cells. Our work validates a readily applicable workflow to enable resource-efficient high-throughput engineering of genome editor's activity.


Subject(s)
Bacterial Proteins , CRISPR-Cas Systems , Bacterial Proteins/metabolism , CRISPR-Cas Systems/genetics , DNA/metabolism , Humans , Machine Learning , Mutagenesis
9.
ACS Pharmacol Transl Sci ; 5(3): 149-155, 2022 Mar 11.
Article in English | MEDLINE | ID: mdl-35311017

ABSTRACT

As an important regulator of cell metabolism, proliferation, and survival, mTOR (mammalian target of rapamycin) signaling provides both a potential target for cancer treatment and a research tool for investigation of cell metabolism. One inhibitor for both mTORC1 and mTORC2 pathways, OSI-027, exhibited robust anticancer efficacy but induced side effects. Herein, we designed a photoactivatable OSI-027 prodrug, which allowed the release of OSI-027 after light irradiation to inhibit the mTOR signaling pathway, triggering autophagy and leading to cell death. This photoactivatable prodrug can provide novel strategies for mTOR-targeting cancer therapy and act as a new tool for investigating mTOR signaling and its related biological processes.

10.
Nucleic Acids Res ; 50(3): 1650-1660, 2022 02 22.
Article in English | MEDLINE | ID: mdl-35051997

ABSTRACT

The Cas9 nuclease from Staphylococcus aureus (SaCas9) holds great potential for use in gene therapy, and variants with increased fidelity have been engineered. However, we find that existing variants have not reached the greatest accuracy to discriminate base mismatches and exhibited much reduced activity when their mutations were grafted onto the KKH mutant of SaCas9 for editing an expanded set of DNA targets. We performed structure-guided combinatorial mutagenesis to re-engineer KKH-SaCas9 with enhanced accuracy. We uncover that introducing a Y239H mutation on KKH-SaCas9's REC domain substantially reduces off-target edits while retaining high on-target activity when added to a set of mutations on REC and RuvC domains that lessen its interactions with the target DNA strand. The Y239H mutation is modelled to have removed an interaction from the REC domain with the guide RNA backbone in the guide RNA-DNA heteroduplex structure. We further confirmed the greatly improved genome-wide editing accuracy and single-base mismatch discrimination of our engineered variants, named KKH-SaCas9-SAV1 and SAV2, in human cells. In addition to generating broadly useful KKH-SaCas9 variants with unprecedented accuracy, our findings demonstrate the feasibility for multi-domain combinatorial mutagenesis on SaCas9's DNA- and guide RNA- interacting residues to optimize its editing fidelity.


Subject(s)
CRISPR-Associated Protein 9/genetics , Gene Editing , Staphylococcus aureus , CRISPR-Cas Systems , Humans , Micrococcal Nuclease/genetics , RNA, Guide, Kinetoplastida , Staphylococcus aureus/genetics
11.
Cancer Res ; 81(24): 6219-6232, 2021 12 15.
Article in English | MEDLINE | ID: mdl-34666996

ABSTRACT

Systematic testing of existing drugs and their combinations is an attractive strategy to exploit approved drugs for repurposing and identifying the best actionable treatment options. To expedite the search among many possible drug combinations, we designed a combinatorial CRISPR-Cas9 screen to inhibit druggable targets. Coblockade of the N-methyl-d-aspartate receptor (NMDAR) with targets of first-line kinase inhibitors reduced hepatocellular carcinoma (HCC) cell growth. Clinically, HCC patients with low NMDAR1 expression showed better survival. The clinically approved NMDAR antagonist ifenprodil synergized with sorafenib to induce the unfolded protein response, trigger cell-cycle arrest, downregulate genes associated with WNT signaling and stemness, and reduce self-renewal ability of HCC cells. In multiple HCC patient-derived organoids and human tumor xenograft models, the drug combination, but neither single drug alone, markedly reduced tumor-initiating cancer cell frequency. Because ifenprodil has an established safety history for its use as a vasodilator in humans, our findings support the repurposing of this drug as an adjunct for HCC treatment to improve clinical outcome and reduce tumor recurrence. These results also validate an approach for readily discovering actionable combinations for cancer therapy. SIGNIFICANCE: Combinatorial CRISPR-Cas9 screening identifies actionable targets for HCC therapy, uncovering the potential of combining the clinically approved drugs ifenprodil and sorafenib as a new effective treatment regimen.


Subject(s)
Antineoplastic Combined Chemotherapy Protocols/pharmacology , Biomarkers, Tumor/metabolism , CRISPR-Cas Systems , Carcinoma, Hepatocellular/drug therapy , Gene Expression Regulation, Neoplastic/drug effects , Liver Neoplasms/drug therapy , Animals , Apoptosis , Biomarkers, Tumor/genetics , Carcinoma, Hepatocellular/metabolism , Carcinoma, Hepatocellular/pathology , Cell Proliferation , Humans , Liver Neoplasms/metabolism , Liver Neoplasms/pathology , Male , Mice , Mice, Inbred BALB C , Mice, Inbred C57BL , Mice, Inbred NOD , Mice, Nude , Mice, SCID , Piperidines/administration & dosage , Sorafenib/administration & dosage , Tumor Cells, Cultured , Xenograft Model Antitumor Assays
12.
Nat Commun ; 12(1): 4138, 2021 07 06.
Article in English | MEDLINE | ID: mdl-34230498

ABSTRACT

Despite significant clinical progress in cell and gene therapies, maximizing protein expression in order to enhance potency remains a major technical challenge. Here, we develop a high-throughput strategy to design, screen, and optimize 5' UTRs that enhance protein expression from a strong human cytomegalovirus (CMV) promoter. We first identify naturally occurring 5' UTRs with high translation efficiencies and use this information with in silico genetic algorithms to generate synthetic 5' UTRs. A total of ~12,000 5' UTRs are then screened using a recombinase-mediated integration strategy that greatly enhances the sensitivity of high-throughput screens by eliminating copy number and position effects that limit lentiviral approaches. Using this approach, we identify three synthetic 5' UTRs that outperform commonly used non-viral gene therapy plasmids in expressing protein payloads. In summary, we demonstrate that high-throughput screening of 5' UTR libraries with recombinase-mediated integration can identify genetic elements that enhance protein expression, which should have numerous applications for engineered cell and gene therapies.


Subject(s)
5' Untranslated Regions/genetics , Genetic Engineering , Genetic Therapy , Algorithms , Cell Line , Gene Expression , HEK293 Cells , High-Throughput Screening Assays , Humans , Plasmids , Promoter Regions, Genetic , Recombinases
14.
STAR Protoc ; 2(1): 100255, 2021 03 19.
Article in English | MEDLINE | ID: mdl-33490975

ABSTRACT

The CRISPR-Cas system coupled with Combinatorial Genetics En Masse (CombiGEM) enables systematic analysis of high-order genetic perturbations that are important for understanding biological processes and discovering therapeutic target combinations. Here, we present detailed steps and technical considerations for building multiplexed guide RNA libraries and carrying out a combinatorial CRISPR screen in mammalian cells. We also present an analytical pipeline, CombiPIPE, for mapping two- and three-way genetic interactions. For complete details on the use and execution of this protocol, please refer to Zhou et al. (2020).


Subject(s)
CRISPR-Cas Systems , Genetic Testing , Animals , Cell Line , Humans , RNA, Guide, Kinetoplastida/genetics , RNA, Guide, Kinetoplastida/metabolism
15.
Adv Genet (Hoboken) ; 2(4): 2100038, 2021 Dec.
Article in English | MEDLINE | ID: mdl-36619853

ABSTRACT

Protein design plays an important role in recent medical advances from antibody therapy to vaccine design. Typically, exhaustive mutational screens or directed evolution experiments are used for the identification of the best design or for improvements to the wild-type variant. Even with a high-throughput screening on pooled libraries and Next-Generation Sequencing to boost the scale of read-outs, surveying all the variants with combinatorial mutations for their empirical fitness scores is still of magnitudes beyond the capacity of existing experimental settings. To tackle this challenge, in-silico approaches using machine learning to predict the fitness of novel variants based on a subset of empirical measurements are now employed. These machine learning models turn out to be useful in many cases, with the premise that the experimentally determined fitness scores and the amino-acid descriptors of the models are informative. The machine learning models can guide the search for the highest fitness variants, resolve complex epistatic relationships, and highlight bio-physical rules for protein folding. Using machine learning-guided approaches, researchers can build more focused libraries, thus relieving themselves from labor-intensive screens and fast-tracking the optimization process. Here, we describe the current advances in massive-scale variant screens, and how machine learning and mutagenesis strategies can be integrated to accelerate protein engineering. More specifically, we examine strategies to make screens more economical, informative, and effective in discovery of useful variants.

16.
Methods Mol Biol ; 2199: 3-12, 2021.
Article in English | MEDLINE | ID: mdl-33125641

ABSTRACT

Exploring how combinatorial mutations can be combined to optimize protein functions is important to guide protein engineering. Given the vast combinatorial space of changing multiple amino acids, identifying the top-performing variants from a large number of mutants might not be possible without a high-throughput gene assembly and screening strategy. Here we describe the CombiSEAL platform, a strategy that allows for modularization of any protein sequence into multiple segments for mutagenesis and barcoding, and seamless single-pot ligations of different segments to generate a library of combination mutants linked with concatenated barcodes at one end. By reading the barcodes using next-generation sequencing, activities of each protein variant during the protein selection process can be easily tracked in a high-throughput manner. CombiSEAL not only allows the identification of better protein variants but also enables the systematic analyses to distinguish the beneficial, deleterious, and neutral effects of combining different mutations on protein functions.


Subject(s)
High-Throughput Nucleotide Sequencing , Mutagenesis , Protein Engineering , Recombinant Proteins/genetics
17.
Cell Rep ; 32(6): 108020, 2020 08 11.
Article in English | MEDLINE | ID: mdl-32783942

ABSTRACT

We present a CRISPR-based multi-gene knockout screening system and toolkits for extensible assembly of barcoded high-order combinatorial guide RNA libraries en masse. We apply this system for systematically identifying not only pairwise but also three-way synergistic therapeutic target combinations and successfully validate double- and triple-combination regimens for suppression of cancer cell growth and protection against Parkinson's disease-associated toxicity. This system overcomes the practical challenges of experimenting on a large number of high-order genetic and drug combinations and can be applied to uncover the rare synergistic interactions between druggable targets.


Subject(s)
CRISPR-Cas Systems , Drug Combinations , Drug Delivery Systems/methods , High-Throughput Screening Assays/methods , Animals , Antineoplastic Agents/pharmacology , Drosophila melanogaster , Gene Knockout Techniques , HEK293 Cells , Humans , Mice , Neoplasms/drug therapy , Parkinson Disease/drug therapy , RNA, Guide, Kinetoplastida
18.
Nat Methods ; 16(8): 789, 2019 Aug.
Article in English | MEDLINE | ID: mdl-31337886

ABSTRACT

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

19.
Nat Methods ; 16(8): 722-730, 2019 08.
Article in English | MEDLINE | ID: mdl-31308554

ABSTRACT

The combined effect of multiple mutations on protein function is hard to predict; thus, the ability to functionally assess a vast number of protein sequence variants would be practically useful for protein engineering. Here we present a high-throughput platform that enables scalable assembly and parallel characterization of barcoded protein variants with combinatorial modifications. We demonstrate this platform, which we name CombiSEAL, by systematically characterizing a library of 948 combination mutants of the widely used Streptococcus pyogenes Cas9 (SpCas9) nuclease to optimize its genome-editing activity in human cells. The ease with which the editing activities of the pool of SpCas9 variants can be assessed at multiple on- and off-target sites accelerates the identification of optimized variants and facilitates the study of mutational epistasis. We successfully identify Opti-SpCas9, which possesses enhanced editing specificity without sacrificing potency and broad targeting range. This platform is broadly applicable for engineering proteins through combinatorial modifications en masse.


Subject(s)
CRISPR-Associated Protein 9/genetics , CRISPR-Cas Systems/genetics , Gene Editing , Mutagenesis , Mutation , RNA, Guide, Kinetoplastida/genetics , Software , Humans , Protein Engineering , Streptococcus pyogenes/enzymology , Substrate Specificity
20.
FASEB J ; 33(1): 484-493, 2019 01.
Article in English | MEDLINE | ID: mdl-30004796

ABSTRACT

Like conventional transplants, immunosuppression is required to facilitate survival and function of human embryonic stem cell (hESC) derivatives after implantation into xenogeneic recipients. We have previously reported that T cells alone are sufficient to reject allogeneic murine ESC derivatives; and strategies that inhibit T-cell activation, including coreceptor and costimulation blockade, prevent hESC derivatives from being rejected. This study aimed to investigate, in addition to T cells, whether macrophages contribute to transplant rejection of hESC xenografts with nonobese diabetic (NOD)/SCID mice that lack functional T and B cells but have macrophages. We show that acute rejection against hESC-derived endothelial cells (hESC-ECs) was mediated, to some degree, by infiltrating macrophages that phagocytosed them. Transgenic expression of murine CD47 on cell surface of hESC-ECs mitigates macrophage-mediated phagocytosis and improves their survival after transplantation. Our results highlight that innate immune cells, such as macrophages, can reject hESC derivatives, raising concern against the use of NOD/SCID as transplant recipients for testing in vivo function of hESC-derived tissues. Augmenting CD47 signaling promotes survival and function of hESC derivatives after xenogeneic transplantation.-Leung, C. S., Li, J., Xu, F., Wong, A. S. L., Lui, K. O. Ectopic expression of recipient CD47 inhibits mouse macrophage-mediated immune rejection against human stem cell transplants.


Subject(s)
CD47 Antigen/metabolism , Ectopic Gene Expression , Embryonic Stem Cells/cytology , Graft Rejection/prevention & control , Immune Tolerance/immunology , Macrophages/immunology , Stem Cell Transplantation/methods , Animals , CD47 Antigen/genetics , Cells, Cultured , Graft Rejection/immunology , Graft Rejection/metabolism , Graft Survival , Humans , Macrophages/metabolism , Mice , Mice, Inbred C57BL , Mice, Inbred ICR , Mice, Inbred NOD , Mice, SCID , Phagocytosis , Transplantation, Heterologous
SELECTION OF CITATIONS
SEARCH DETAIL
...