Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 10 de 10
Filter
1.
Nat Commun ; 15(1): 5141, 2024 Jun 20.
Article in English | MEDLINE | ID: mdl-38902262

ABSTRACT

A major challenge in protein design is to augment existing functional proteins with multiple property enhancements. Altering several properties likely necessitates numerous primary sequence changes, and novel methods are needed to accurately predict combinations of mutations that maintain or enhance function. Models of sequence co-variation (e.g., EVcouplings), which leverage extensive information about various protein properties and activities from homologous protein sequences, have proven effective for many applications including structure determination and mutation effect prediction. We apply EVcouplings to computationally design variants of the model protein TEM-1 ß-lactamase. Nearly all the 14 experimentally characterized designs were functional, including one with 84 mutations from the nearest natural homolog. The designs also had large increases in thermostability, increased activity on multiple substrates, and nearly identical structure to the wild type enzyme. This study highlights the efficacy of evolutionary models in guiding large sequence alterations to generate functional diversity for protein design applications.


Subject(s)
Evolution, Molecular , Mutation , Protein Engineering , beta-Lactamases , beta-Lactamases/genetics , beta-Lactamases/metabolism , beta-Lactamases/chemistry , Protein Engineering/methods , Models, Molecular , Amino Acid Sequence , Enzyme Stability , Protein Conformation
2.
Sci Rep ; 14(1): 14449, 2024 06 24.
Article in English | MEDLINE | ID: mdl-38914665

ABSTRACT

As genomic databases expand and artificial intelligence tools advance, there is a growing demand for efficient characterization of large numbers of proteins. To this end, here we describe a generalizable pipeline for high-throughput protein purification using small-scale expression in E. coli and an affordable liquid-handling robot. This low-cost platform enables the purification of 96 proteins in parallel with minimal waste and is scalable for processing hundreds of proteins weekly per user. We demonstrate the performance of this method with the expression and purification of the leading poly(ethylene terephthalate) hydrolases reported in the literature. Replicate experiments demonstrated reproducibility and enzyme purity and yields (up to 400 µg) sufficient for comprehensive analyses of both thermostability and activity, generating a standardized benchmark dataset for comparing these plastic-degrading enzymes. The cost-effectiveness and ease of implementation of this platform render it broadly applicable to diverse protein characterization challenges in the biological sciences.


Subject(s)
Escherichia coli , Robotics , Robotics/methods , Escherichia coli/genetics , Protein Engineering/methods , High-Throughput Screening Assays/methods , High-Throughput Screening Assays/economics , Hydrolases/metabolism , Hydrolases/chemistry , Hydrolases/genetics , Polyethylene Terephthalates/chemistry , Reproducibility of Results
3.
bioRxiv ; 2023 May 09.
Article in English | MEDLINE | ID: mdl-37214973

ABSTRACT

Designing optimized proteins is important for a range of practical applications. Protein design is a rapidly developing field that would benefit from approaches that enable many changes in the amino acid primary sequence, rather than a small number of mutations, while maintaining structure and enhancing function. Homologous protein sequences contain extensive information about various protein properties and activities that have emerged over billions of years of evolution. Evolutionary models of sequence co-variation, derived from a set of homologous sequences, have proven effective in a range of applications including structure determination and mutation effect prediction. In this work we apply one of these models (EVcouplings) to computationally design highly divergent variants of the model protein TEM-1 ß-lactamase, and characterize these designs experimentally using multiple biochemical and biophysical assays. Nearly all designed variants were functional, including one with 84 mutations from the nearest natural homolog. Surprisingly, all functional designs had large increases in thermostability and most had a broadening of available substrates. These property enhancements occurred while maintaining a nearly identical structure to the wild type enzyme. Collectively, this work demonstrates that evolutionary models of sequence co-variation (1) are able to capture complex epistatic interactions that successfully guide large sequence departures from natural contexts, and (2) can be applied to generate functional diversity useful for many applications in protein design.

4.
PLoS Comput Biol ; 16(7): e1007909, 2020 07.
Article in English | MEDLINE | ID: mdl-32667922

ABSTRACT

Cancer cells have genetic alterations that often directly affect intracellular protein signaling processes allowing them to bypass control mechanisms for cell death, growth and division. Cancer drugs targeting these alterations often work initially, but resistance is common. Combinations of targeted drugs may overcome or prevent resistance, but their selection requires context-specific knowledge of signaling pathways including complex interactions such as feedback loops and crosstalk. To infer quantitative pathway models, we collected a rich dataset on a melanoma cell line: Following perturbation with 54 drug combinations, we measured 124 (phospho-)protein levels and phenotypic response (cell growth, apoptosis) in a time series from 10 minutes to 67 hours. From these data, we trained time-resolved mathematical models that capture molecular interactions and the coupling of molecular levels to cellular phenotype, which in turn reveal the main direct or indirect molecular responses to each drug. Systematic model simulations identified novel combinations of drugs predicted to reduce the survival of melanoma cells, with partial experimental verification. This particular application of perturbation biology demonstrates the potential impact of combining time-resolved data with modeling for the discovery of new combinations of cancer drugs.


Subject(s)
Antineoplastic Agents/pharmacology , Melanoma , Phosphoproteins , Cell Line, Tumor , Cell Survival/drug effects , Drug Therapy, Combination , Humans , Models, Biological , Phosphoproteins/analysis , Phosphoproteins/metabolism , Signal Transduction/drug effects , Systems Biology
5.
Cell Syst ; 10(1): 15-24.e5, 2020 01 22.
Article in English | MEDLINE | ID: mdl-31838147

ABSTRACT

Natural evolution encodes rich information about the structure and function of biomolecules in the genetic record. Previously, statistical analysis of co-variation patterns in natural protein families has enabled the accurate computation of 3D structures. Here, we explored generating similar information by experimental evolution, starting from a single gene and performing multiple cycles of in vitro mutagenesis and functional selection in Escherichia coli. We evolved two antibiotic resistance proteins, ß-lactamase PSE1 and acetyltransferase AAC6, and obtained hundreds of thousands of diverse functional sequences. Using evolutionary coupling analysis, we inferred residue interaction constraints that were in agreement with contacts in known 3D structures, confirming genetic encoding of structural constraints in the selected sequences. Computational protein folding with interaction constraints then yielded 3D structures with the same fold as natural relatives. This work lays the foundation for a new experimental method (3Dseq) for protein structure determination, combining evolution experiments with inference of residue interactions from sequence information. A record of this paper's Transparent Peer Review process is included in the Supplemental Information.


Subject(s)
Evolution, Molecular , Proteins/chemistry , Humans , Protein Conformation
6.
Nat Genet ; 51(7): 1170-1176, 2019 07.
Article in English | MEDLINE | ID: mdl-31209393

ABSTRACT

We describe an experimental method of three-dimensional (3D) structure determination that exploits the increasing ease of high-throughput mutational scans. Inspired by the success of using natural, evolutionary sequence covariation to compute protein and RNA folds, we explored whether 'laboratory', synthetic sequence variation might also yield 3D structures. We analyzed five large-scale mutational scans and discovered that the pairs of residues with the largest positive epistasis in the experiments are sufficient to determine the 3D fold. We show that the strongest epistatic pairings from genetic screens of three proteins, a ribozyme and a protein interaction reveal 3D contacts within and between macromolecules. Using these experimental epistatic pairs, we compute ab initio folds for a GB1 domain (within 1.8 Å of the crystal structure) and a WW domain (2.1 Å). We propose strategies that reduce the number of mutants needed for contact prediction, suggesting that genomics-based techniques can efficiently predict 3D structure.


Subject(s)
Adaptor Proteins, Signal Transducing/chemistry , Bacterial Proteins/chemistry , Epistasis, Genetic , Mutation , Poly(A)-Binding Proteins/chemistry , Protein Conformation , RNA, Catalytic/chemistry , Saccharomyces cerevisiae Proteins/chemistry , Transcription Factors/chemistry , Adaptor Proteins, Signal Transducing/genetics , Bacterial Proteins/genetics , Humans , Poly(A)-Binding Proteins/genetics , Protein Domains , Protein Folding , RNA, Catalytic/genetics , Saccharomyces cerevisiae Proteins/genetics , Transcription Factors/genetics , YAP-Signaling Proteins
7.
Cell Syst ; 1(3): 197-209, 2015 Sep 23.
Article in English | MEDLINE | ID: mdl-27135912

ABSTRACT

In cancer genomics, recurrence of mutations in independent tumor samples is a strong indicator of functional impact. However, rare functional mutations can escape detection by recurrence analysis owing to lack of statistical power. We enhance statistical power by extending the notion of recurrence of mutations from single genes to gene families that share homologous protein domains. Domain mutation analysis also sharpens the functional interpretation of the impact of mutations, as domains more succinctly embody function than entire genes. By mapping mutations in 22 different tumor types to equivalent positions in multiple sequence alignments of domains, we confirm well-known functional mutation hotspots, identify uncharacterized rare variants in one gene that are equivalent to well-characterized mutations in another gene, detect previously unknown mutation hotspots, and provide hypotheses about molecular mechanisms and downstream effects of domain mutations. With the rapid expansion of cancer genomics projects, protein domain hotspot analysis will likely provide many more leads linking mutations in proteins to the cancer phenotype.

8.
PLoS Comput Biol ; 9(12): e1003290, 2013.
Article in English | MEDLINE | ID: mdl-24367245

ABSTRACT

We present a powerful experimental-computational technology for inferring network models that predict the response of cells to perturbations, and that may be useful in the design of combinatorial therapy against cancer. The experiments are systematic series of perturbations of cancer cell lines by targeted drugs, singly or in combination. The response to perturbation is quantified in terms of relative changes in the measured levels of proteins, phospho-proteins and cellular phenotypes such as viability. Computational network models are derived de novo, i.e., without prior knowledge of signaling pathways, and are based on simple non-linear differential equations. The prohibitively large solution space of all possible network models is explored efficiently using a probabilistic algorithm, Belief Propagation (BP), which is three orders of magnitude faster than standard Monte Carlo methods. Explicit executable models are derived for a set of perturbation experiments in SKMEL-133 melanoma cell lines, which are resistant to the therapeutically important inhibitor of RAF kinase. The resulting network models reproduce and extend known pathway biology. They empower potential discoveries of new molecular interactions and predict efficacious novel drug perturbations, such as the inhibition of PLK1, which is verified experimentally. This technology is suitable for application to larger systems in diverse areas of molecular biology.


Subject(s)
Models, Biological , Signal Transduction , Systems Biology , Cell Line, Tumor , Humans , Monte Carlo Method , Probability
9.
Nat Methods ; 10(8): 768-73, 2013 Aug.
Article in English | MEDLINE | ID: mdl-23817070

ABSTRACT

We report a technique to selectively and continuously label the proteomes of individual cell types in coculture, named cell type-specific labeling using amino acid precursors (CTAP). Through transgenic expression of exogenous amino acid biosynthesis enzymes, vertebrate cells overcome their dependence on supplemented essential amino acids and can be selectively labeled through metabolic incorporation of amino acids produced from heavy isotope-labeled precursors. When testing CTAP in several human and mouse cell lines, we could differentially label the proteomes of distinct cell populations in coculture and determine the relative expression of proteins by quantitative mass spectrometry. In addition, using CTAP we identified the cell of origin of extracellular proteins secreted from cells in coculture. We believe that this method, which allows linking of proteins to their cell source, will be useful in studies of cell-cell communication and potentially for discovery of biomarkers.


Subject(s)
Lysine/metabolism , Proteome/biosynthesis , Proteomics/methods , Animals , Base Sequence , Cell Line , Coculture Techniques/methods , Humans , Isotope Labeling/methods , Mice , Molecular Sequence Data , Oligonucleotide Array Sequence Analysis , Organisms, Genetically Modified , Proteome/genetics , RNA, Messenger/chemistry , RNA, Messenger/genetics , Sequence Analysis, DNA , Tandem Mass Spectrometry
10.
Breast Cancer Res ; 8(2): R23, 2006.
Article in English | MEDLINE | ID: mdl-16626501

ABSTRACT

INTRODUCTION: Predicting the clinical course of breast cancer is often difficult because it is a diverse disease comprised of many biological subtypes. Gene expression profiling by microarray analysis has identified breast cancer signatures that are important for prognosis and treatment. In the current article, we use microarray analysis and a real-time quantitative reverse-transcription (qRT)-PCR assay to risk-stratify breast cancers based on biological 'intrinsic' subtypes and proliferation. METHODS: Gene sets were selected from microarray data to assess proliferation and to classify breast cancers into four different molecular subtypes, designated Luminal, Normal-like, HER2+/ER-, and Basal-like. One-hundred and twenty-three breast samples (117 invasive carcinomas, one fibroadenoma and five normal tissues) and three breast cancer cell lines were prospectively analyzed using a microarray (Agilent) and a qRT-PCR assay comprised of 53 genes. Biological subtypes were assigned from the microarray and qRT-PCR data by hierarchical clustering. A proliferation signature was used as a single meta-gene (log2 average of 14 genes) to predict outcome within the context of estrogen receptor status and biological 'intrinsic' subtype. RESULTS: We found that the qRT-PCR assay could determine the intrinsic subtype (93% concordance with microarray-based assignments) and that the intrinsic subtypes were predictive of outcome. The proliferation meta-gene provided additional prognostic information for patients with the Luminal subtype (P = 0.0012), and for patients with estrogen receptor-positive tumors (P = 3.4 x 10-6). High proliferation in the Luminal subtype conferred a 19-fold relative risk of relapse (confidence interval = 95%) compared with Luminal tumors with low proliferation. CONCLUSION: A real-time qRT-PCR assay can recapitulate microarray classifications of breast cancer and can risk-stratify patients using the intrinsic subtype and proliferation. The proliferation meta-gene offers an objective and quantitative measurement for grade and adds significant prognostic information to the biological subtypes.


Subject(s)
Breast Neoplasms/classification , Breast Neoplasms/genetics , Reverse Transcriptase Polymerase Chain Reaction/methods , Breast Neoplasms/epidemiology , Breast Neoplasms/mortality , Breast Neoplasms/pathology , Cell Line, Tumor , Cohort Studies , Female , Humans , Neoplasm Invasiveness/genetics , Oligonucleotide Array Sequence Analysis , Risk Assessment , Survival Analysis
SELECTION OF CITATIONS
SEARCH DETAIL
...