Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 33
Filter
1.
Cell ; 158(6): 1402-1414, 2014 Sep 11.
Article in English | MEDLINE | ID: mdl-25215495

ABSTRACT

In complex biological systems, small molecules often mediate microbe-microbe and microbe-host interactions. Using a systematic approach, we identified 3,118 small-molecule biosynthetic gene clusters (BGCs) in genomes of human-associated bacteria and studied their representation in 752 metagenomic samples from the NIH Human Microbiome Project. Remarkably, we discovered that BGCs for a class of antibiotics in clinical trials, thiopeptides, are widely distributed in genomes and metagenomes of the human microbiota. We purified and solved the structure of a thiopeptide antibiotic, lactocillin, from a prominent member of the vaginal microbiota. We demonstrate that lactocillin has potent antibacterial activity against a range of Gram-positive vaginal pathogens, and we show that lactocillin and other thiopeptide BGCs are expressed in vivo by analyzing human metatranscriptomic sequencing data. Our findings illustrate the widespread distribution of small-molecule-encoding BGCs in the human microbiome, and they demonstrate the bacterial production of drug-like molecules in humans. PAPERCLIP:


Subject(s)
Bacteria/chemistry , Bacteria/genetics , Metagenomics/methods , Microbiota , Amino Acid Sequence , Bacteria/classification , Bacteria/metabolism , Biosynthetic Pathways , Gastrointestinal Tract/microbiology , Humans , Molecular Sequence Data , Mouth/microbiology , Multigene Family , Peptide Biosynthesis, Nucleic Acid-Independent , Polyketides/analysis
2.
Cell ; 158(5): 1123-1135, 2014 08 28.
Article in English | MEDLINE | ID: mdl-25171412

ABSTRACT

Eukaryotic translation initiation requires the recruitment of the large, multiprotein eIF3 complex to the 40S ribosomal subunit. We present X-ray structures of all major components of the minimal, six-subunit Saccharomyces cerevisiae eIF3 core. These structures, together with electron microscopy reconstructions, cross-linking coupled to mass spectrometry, and integrative structure modeling, allowed us to position and orient all eIF3 components on the 40S⋅eIF1 complex, revealing an extended, modular arrangement of eIF3 subunits. Yeast eIF3 engages 40S in a clamp-like manner, fully encircling 40S to position key initiation factors on opposite ends of the mRNA channel, providing a platform for the recruitment, assembly, and regulation of the translation initiation machinery. The structures of eIF3 components reported here also have implications for understanding the architecture of the mammalian 43S preinitiation complex and the complex of eIF3, 40S, and the hepatitis C internal ribosomal entry site RNA.


Subject(s)
Eukaryotic Initiation Factor-1/chemistry , Eukaryotic Initiation Factor-3/chemistry , Peptide Chain Initiation, Translational , Ribosome Subunits, Small, Eukaryotic/chemistry , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae/chemistry , Amino Acid Sequence , Animals , Crystallography, X-Ray , Dimerization , Eukaryotic Initiation Factor-1/metabolism , Eukaryotic Initiation Factor-3/metabolism , Hepacivirus/chemistry , Humans , Mammals/metabolism , Microscopy, Electron , Models, Molecular , Molecular Sequence Data , Ribonucleoproteins/chemistry , Ribosome Subunits, Small, Eukaryotic/metabolism , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Sequence Alignment
3.
Cell ; 158(2): 412-421, 2014 Jul 17.
Article in English | MEDLINE | ID: mdl-25036635

ABSTRACT

Although biosynthetic gene clusters (BGCs) have been discovered for hundreds of bacterial metabolites, our knowledge of their diversity remains limited. Here, we used a novel algorithm to systematically identify BGCs in the extensive extant microbial sequencing data. Network analysis of the predicted BGCs revealed large gene cluster families, the vast majority uncharacterized. We experimentally characterized the most prominent family, consisting of two subfamilies of hundreds of BGCs distributed throughout the Proteobacteria; their products are aryl polyenes, lipids with an aryl head group conjugated to a polyene tail. We identified a distant relationship to a third subfamily of aryl polyene BGCs, and together the three subfamilies represent the largest known family of biosynthetic gene clusters, with more than 1,000 members. Although these clusters are widely divergent in sequence, their small molecule products are remarkably conserved, indicating for the first time the important roles these compounds play in Gram-negative cell biology.


Subject(s)
Algorithms , Bacteria/genetics , Bacteria/metabolism , Bacteria/chemistry , Bacteria/classification , Mutation , Oxidative Stress , Phylogeny , Secondary Metabolism
4.
Mod Pathol ; 37(2): 100377, 2024 Feb.
Article in English | MEDLINE | ID: mdl-37926422

ABSTRACT

Conventional histopathology involves expensive and labor-intensive processes that often consume tissue samples, rendering them unavailable for other analyses. We present a novel end-to-end workflow for pathology powered by hyperspectral microscopy and deep learning. First, we developed a custom hyperspectral microscope to nondestructively image the autofluorescence of unstained tissue sections. We then trained a deep learning model to use autofluorescence to generate virtual histologic stains, which avoids the cost and variability of chemical staining procedures and conserves tissue samples. We showed that the virtual images reproduce the histologic features present in the real-stained images using a randomized nonalcoholic steatohepatitis (NASH) scoring comparison study, where both real and virtual stains are scored by pathologists (D.T., A.D.B., R.K.P.). The test showed moderate-to-good concordance between pathologists' scoring on corresponding real and virtual stains. Finally, we developed deep learning-based models for automated NASH Clinical Research Network score prediction. We showed that the end-to-end automated pathology platform is comparable with an independent panel of pathologists for NASH Clinical Research Network scoring when evaluated against the expert pathologist consensus scores. This study provides proof of concept for this virtual staining strategy, which could improve cost, efficiency, and reliability in pathology and enable novel approaches to spatial biology research.


Subject(s)
Deep Learning , Non-alcoholic Fatty Liver Disease , Humans , Microscopy , Reproducibility of Results , Pathologists
5.
Mod Pathol ; : 100573, 2024 Jul 26.
Article in English | MEDLINE | ID: mdl-39069201

ABSTRACT

The tissue diagnosis of adenocarcinoma and intraductal carcinoma of the prostate (IDC-P) includes Gleason grading of tumor morphology on the hematoxylin and eosin (H&E) stain, and immunohistochemistry (IHC) markers on the PIN-4 stain (CK5/6, P63, AMACR). In this work, we create an automated system for producing both virtual H&E and PIN-4 IHC stains from unstained prostate tissue using a high-throughput hyperspectral fluorescence microscope and artificial intelligence & machine learning. We demonstrate that the virtual stainer models can produce high-quality images suitable for diagnosis by genitourinary pathologists. Specifically, we validate our system through extensive human review and computational analysis, using a previously-validated Gleason scoring model, and an expert panel, on a large dataset of test slides. This study extends our previous work on virtual staining from autofluorescence, demonstrates the clinical utility of this technology for prostate cancer, and exemplifies a rigorous standard of qualitative and quantitative evaluation for digital pathology.

7.
PLoS Biol ; 18(12): e3001026, 2020 12.
Article in English | MEDLINE | ID: mdl-33351797

ABSTRACT

Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesized and post-translationally modified peptides (RiPPs). While variants of biosynthetic gene clusters (BGCs) for known classes of natural products are easy to identify in genome sequences, BGCs for new compound classes escape attention. In particular, evidence is accumulating that for RiPPs, subclasses known thus far may only represent the tip of an iceberg. Here, we present decRiPPter (Data-driven Exploratory Class-independent RiPP TrackER), a RiPP genome mining algorithm aimed at the discovery of novel RiPP classes. DecRiPPter combines a Support Vector Machine (SVM) that identifies candidate RiPP precursors with pan-genomic analyses to identify which of these are encoded within operon-like structures that are part of the accessory genome of a genus. Subsequently, it prioritizes such regions based on the presence of new enzymology and based on patterns of gene cluster and precursor peptide conservation across species. We then applied decRiPPter to mine 1,295 Streptomyces genomes, which led to the identification of 42 new candidate RiPP families that could not be found by existing programs. One of these was studied further and elucidated as a representative of a novel subfamily of lanthipeptides, which we designate class V. The 2D structure of the new RiPP, which we name pristinin A3 (1), was solved using nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS) data, and chemical labeling. Two previously unidentified modifying enzymes are proposed to create the hallmark lanthionine bridges. Taken together, our work highlights how novel natural product families can be discovered by methods going beyond sequence similarity searches to integrate multiple pathway discovery criteria.


Subject(s)
Bacteriocins/genetics , Genomics/methods , Protein Processing, Post-Translational/genetics , Algorithms , Bacteriocins/metabolism , Biological Products/analysis , Biological Products/metabolism , Computational Biology/methods , Genome/genetics , Machine Learning , Multigene Family/genetics , Peptides/genetics , Protein Processing, Post-Translational/physiology , Ribosomes/metabolism
8.
Mol Cell ; 59(5): 794-806, 2015 Sep 03.
Article in English | MEDLINE | ID: mdl-26340423

ABSTRACT

TFIIH is essential for both RNA polymerase II transcription and DNA repair, and mutations in TFIIH can result in human disease. Here, we determine the molecular architecture of human and yeast TFIIH by an integrative approach using chemical crosslinking/mass spectrometry (CXMS) data, biochemical analyses, and previously published electron microscopy maps. We identified four new conserved "topological regions" that function as hubs for TFIIH assembly and more than 35 conserved topological features within TFIIH, illuminating a network of interactions involved in TFIIH assembly and regulation of its activities. We show that one of these conserved regions, the p62/Tfb1 Anchor region, directly interacts with the DNA helicase subunit XPD/Rad3 in native TFIIH and is required for the integrity and function of TFIIH. We also reveal the structural basis for defects in patients with xeroderma pigmentosum and trichothiodystrophy, with mutations found at the interface between the p62 Anchor region and the XPD subunit.


Subject(s)
Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae Proteins/metabolism , Transcription Factor TFIIH/chemistry , Transcription Factor TFIIH/metabolism , Cross-Linking Reagents , DNA Helicases/chemistry , DNA Helicases/genetics , DNA Helicases/metabolism , DNA Repair , Humans , Mass Spectrometry , Models, Molecular , Mutation , Protein Interaction Domains and Motifs , Protein Subunits , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/genetics , Transcription Factor TFIIH/genetics , Transcription Factors, TFII/chemistry , Transcription Factors, TFII/genetics , Transcription Factors, TFII/metabolism , Transcription, Genetic , Xeroderma Pigmentosum/genetics , Xeroderma Pigmentosum/metabolism , Xeroderma Pigmentosum Group D Protein/chemistry , Xeroderma Pigmentosum Group D Protein/genetics , Xeroderma Pigmentosum Group D Protein/metabolism
9.
Nat Methods ; 16(6): 519-525, 2019 06.
Article in English | MEDLINE | ID: mdl-31133761

ABSTRACT

Peptide fragmentation spectra are routinely predicted in the interpretation of mass-spectrometry-based proteomics data. However, the generation of fragment ions has not been understood well enough for scientists to estimate fragment ion intensities accurately. Here, we demonstrate that machine learning can predict peptide fragmentation patterns in mass spectrometers with accuracy within the uncertainty of measurement. Moreover, analysis of our models reveals that peptide fragmentation depends on long-range interactions within a peptide sequence. We illustrate the utility of our models by applying them to the analysis of both data-dependent and data-independent acquisition datasets. In the former case, we observe a q-value-dependent increase in the total number of peptide identifications. In the latter case, we confirm that the use of predicted tandem mass spectrometry spectra is nearly equivalent to the use of spectra from experimental libraries.


Subject(s)
Biomarkers/blood , Data Analysis , Peptide Fragments/analysis , Peptide Library , Proteome/analysis , Software , Tandem Mass Spectrometry/methods , Algorithms , Amino Acid Sequence , Databases, Protein , HeLa Cells , Humans , Peptide Fragments/metabolism , Proteome/metabolism
10.
Nat Rev Genet ; 14(12): 865-79, 2013 Dec.
Article in English | MEDLINE | ID: mdl-24197012

ABSTRACT

Proteins are not monolithic entities; rather, they can contain multiple domains that mediate distinct interactions, and their functionality can be regulated through post-translational modifications at multiple distinct sites. Traditionally, network biology has ignored such properties of proteins and has instead examined either the physical interactions of whole proteins or the consequences of removing entire genes. In this Review, we discuss experimental and computational methods to increase the resolution of protein-protein, genetic and drug-gene interaction studies to the domain and residue levels. Such work will be crucial for using interaction networks to connect sequence and structural information, and to understand the biological consequences of disease-associated mutations, which will hopefully lead to more effective therapeutic strategies.


Subject(s)
Gene Expression Regulation/drug effects , Gene Regulatory Networks , Protein Interaction Maps , Animals , Chromosome Mapping , Computational Biology , Gene-Environment Interaction , Humans , Models, Genetic , Protein Interaction Domains and Motifs , Sequence Analysis, DNA
11.
Mol Cell Proteomics ; 16(5): 840-854, 2017 05.
Article in English | MEDLINE | ID: mdl-28292943

ABSTRACT

The 26S proteasome is the macromolecular machine responsible for ATP/ubiquitin dependent degradation. As aberration in proteasomal degradation has been implicated in many human diseases, structural analysis of the human 26S proteasome complex is essential to advance our understanding of its action and regulation mechanisms. In recent years, cross-linking mass spectrometry (XL-MS) has emerged as a powerful tool for elucidating structural topologies of large protein assemblies, with its unique capability of studying protein complexes in cells. To facilitate the identification of cross-linked peptides, we have previously developed a robust amine reactive sulfoxide-containing MS-cleavable cross-linker, disuccinimidyl sulfoxide (DSSO). To better understand the structure and regulation of the human 26S proteasome, we have established new DSSO-based in vivo and in vitro XL-MS workflows by coupling with HB-tag based affinity purification to comprehensively examine protein-protein interactions within the 26S proteasome. In total, we have identified 447 unique lysine-to-lysine linkages delineating 67 interprotein and 26 intraprotein interactions, representing the largest cross-link dataset for proteasome complexes. In combination with EM maps and computational modeling, the architecture of the 26S proteasome was determined to infer its structural dynamics. In particular, three proteasome subunits Rpn1, Rpn6, and Rpt6 displayed multiple conformations that have not been previously reported. Additionally, cross-links between proteasome subunits and 15 proteasome interacting proteins including 9 known and 6 novel ones have been determined to demonstrate their physical interactions at the amino acid level. Our results have provided new insights on the dynamics of the 26S human proteasome and the methodologies presented here can be applied to study other protein complexes.


Subject(s)
Proteasome Endopeptidase Complex/chemistry , Proteasome Endopeptidase Complex/metabolism , Cell Line , Humans , Models, Molecular , Protein Binding , Protein Interaction Mapping , Protein Interaction Maps , Reproducibility of Results , Tandem Mass Spectrometry
12.
Genes Dev ; 25(20): 2158-72, 2011 Oct 15.
Article in English | MEDLINE | ID: mdl-22012619

ABSTRACT

Various cyclin-dependent kinase (Cdk) complexes have been implicated in the regulation of transcription. In this study, we identified a 70-kDa Cyclin K (CycK) that binds Cdk12 and Cdk13 to form two different complexes (CycK/Cdk12 or CycK/Cdk13) in human cells. The CycK/Cdk12 complex regulates phosphorylation of Ser2 in the C-terminal domain of RNA polymerase II and expression of a small subset of human genes, as revealed in expression microarrays. Depletion of CycK/Cdk12 results in decreased expression of predominantly long genes with high numbers of exons. The most prominent group of down-regulated genes are the DNA damage response genes, including the critical regulators of genomic stability: BRCA1 (breast and ovarian cancer type 1 susceptibility protein 1), ATR (ataxia telangiectasia and Rad3-related), FANCI, and FANCD2. We show that CycK/Cdk12, rather than CycK/Cdk13, is necessary for their expression. Nuclear run-on assays and chromatin immunoprecipitations with RNA polymerase II on the BRCA1 and FANCI genes suggest a transcriptional defect in the absence of CycK/Cdk12. Consistent with these findings, cells without CycK/Cdk12 induce spontaneous DNA damage and are sensitive to a variety of DNA damage agents. We conclude that through regulation of expression of DNA damage response genes, CycK/Cdk12 protects cells from genomic instability. The essential role of CycK for organisms in vivo is further supported by the result that genetic inactivation of CycK in mice causes early embryonic lethality.


Subject(s)
Cyclin-Dependent Kinases/metabolism , Cyclins/metabolism , DNA Damage/genetics , Gene Expression Regulation, Developmental , Genomic Instability , Animals , CDC2 Protein Kinase/metabolism , Cyclin-Dependent Kinase 9/metabolism , Cyclin-Dependent Kinases/genetics , Cyclins/genetics , HEK293 Cells , HeLa Cells , Humans , Mice , Mice, Knockout , Phosphorylation
13.
J Biol Chem ; 292(39): 16310-16320, 2017 09 29.
Article in English | MEDLINE | ID: mdl-28821611

ABSTRACT

Oxidative stress has been implicated in multiple human neurological and other disorders. Proteasomes are multi-subunit proteases critical for the removal of oxidatively damaged proteins. To understand stress-associated human pathologies, it is important to uncover the molecular events underlying the regulation of proteasomes upon oxidative stress. To this end, we investigated H2O2 stress-induced molecular changes of the human 26S proteasome and determined that stress-induced 26S proteasome disassembly is conserved from yeast to human. Moreover, we developed and employed a new proteomic approach, XAP (in vivo cross-linking-assisted affinity purification), coupled with stable isotope labeling with amino acids in cell culture (SILAC)-based quantitative MS, to capture and quantify several weakly bound proteasome-interacting proteins and examine their roles in stress-mediated proteasomal remodeling. Our results indicate that the adapter protein Ecm29 is the main proteasome-interacting protein responsible for stress-triggered remodeling of the 26S proteasome in human cells. Importantly, using a disuccinimidyl sulfoxide-based cross-linking MS platform, we mapped the interactions of Ecm29 within itself and with proteasome subunits and determined the architecture of the Ecm29-proteasome complex with integrative structure modeling. These results enabled us to propose a structural model in which Ecm29 intrudes on the interaction between the 20S core particle and the 19S regulatory particle in the 26S proteasome, disrupting the proteasome structure in response to oxidative stress.


Subject(s)
Models, Molecular , Oxidative Stress , Proteasome Endopeptidase Complex/metabolism , ATPases Associated with Diverse Cellular Activities , Adaptor Proteins, Signal Transducing/chemistry , Adaptor Proteins, Signal Transducing/genetics , Adaptor Proteins, Signal Transducing/metabolism , Affinity Labels , Cross-Linking Reagents/pharmacology , HEK293 Cells , Humans , Isotope Labeling , LIM Domain Proteins/chemistry , LIM Domain Proteins/genetics , LIM Domain Proteins/metabolism , Proteasome Endopeptidase Complex/chemistry , Proteasome Endopeptidase Complex/genetics , Protein Conformation , Protein Interaction Domains and Motifs , Protein Interaction Mapping , Protein Multimerization , Proteolysis , RNA Interference , Recombinant Fusion Proteins/chemistry , Recombinant Fusion Proteins/metabolism , Tandem Mass Spectrometry , Transcription Factors/chemistry , Transcription Factors/genetics , Transcription Factors/metabolism , Ubiquitins/chemistry , Ubiquitins/genetics , Ubiquitins/metabolism
14.
Biophys J ; 113(11): 2344-2353, 2017 Dec 05.
Article in English | MEDLINE | ID: mdl-29211988

ABSTRACT

Modeling of macromolecular structures involves structural sampling guided by a scoring function, resulting in an ensemble of good-scoring models. By necessity, the sampling is often stochastic, and must be exhaustive at a precision sufficient for accurate modeling and assessment of model uncertainty. Therefore, the very first step in analyzing the ensemble is an estimation of the highest precision at which the sampling is exhaustive. Here, we present an objective and automated method for this task. As a proxy for sampling exhaustiveness, we evaluate whether two independently and stochastically generated sets of models are sufficiently similar. The protocol includes testing 1) convergence of the model score, 2) whether model scores for the two samples were drawn from the same parent distribution, 3) whether each structural cluster includes models from each sample proportionally to its size, and 4) whether there is sufficient structural similarity between the two model samples in each cluster. The evaluation also provides the sampling precision, defined as the smallest clustering threshold that satisfies the third, most stringent test. We validate the protocol with the aid of enumerated good-scoring models for five illustrative cases of binary protein complexes. Passing the proposed four tests is necessary, but not sufficient for thorough sampling. The protocol is general in nature and can be applied to the stochastic sampling of any set of models, not just structural models. In addition, the tests can be used to stop stochastic sampling as soon as exhaustiveness at desired precision is reached, thereby improving sampling efficiency; they may also help in selecting a model representation that is sufficiently detailed to be informative, yet also sufficiently coarse for sampling to be exhaustive.


Subject(s)
Macromolecular Substances/chemistry , Models, Molecular , Cluster Analysis , Stochastic Processes
15.
Nature ; 481(7381): 371-5, 2011 Dec 21.
Article in English | MEDLINE | ID: mdl-22190037

ABSTRACT

Restriction factors, such as the retroviral complementary DNA deaminase APOBEC3G, are cellular proteins that dominantly block virus replication. The AIDS virus, human immunodeficiency virus type 1 (HIV-1), produces the accessory factor Vif, which counteracts the host's antiviral defence by hijacking a ubiquitin ligase complex, containing CUL5, ELOC, ELOB and a RING-box protein, and targeting APOBEC3G for degradation. Here we reveal, using an affinity tag/purification mass spectrometry approach, that Vif additionally recruits the transcription cofactor CBF-ß to this ubiquitin ligase complex. CBF-ß, which normally functions in concert with RUNX DNA binding proteins, allows the reconstitution of a recombinant six-protein assembly that elicits specific polyubiquitination activity with APOBEC3G, but not the related deaminase APOBEC3A. Using RNA knockdown and genetic complementation studies, we also demonstrate that CBF-ß is required for Vif-mediated degradation of APOBEC3G and therefore for preserving HIV-1 infectivity. Finally, simian immunodeficiency virus (SIV) Vif also binds to and requires CBF-ß to degrade rhesus macaque APOBEC3G, indicating functional conservation. Methods of disrupting the CBF-ß-Vif interaction might enable HIV-1 restriction and provide a supplement to current antiviral therapies that primarily target viral proteins.


Subject(s)
Core Binding Factor beta Subunit/metabolism , Cytidine Deaminase/metabolism , Gene Products, vif/metabolism , HIV Infections/metabolism , HIV Infections/virology , HIV-1/physiology , vif Gene Products, Human Immunodeficiency Virus/metabolism , APOBEC-3G Deaminase , Affinity Labels , Animals , Cullin Proteins/metabolism , Gene Knockdown Techniques , Genetic Complementation Test , HEK293 Cells , Host-Pathogen Interactions , Humans , Jurkat Cells , Macaca mulatta/metabolism , Macaca mulatta/virology , Mass Spectrometry , Models, Biological , Protein Binding , Proteolysis , Simian Immunodeficiency Virus/metabolism , Ubiquitin-Protein Ligases/chemistry , Ubiquitin-Protein Ligases/metabolism , Ubiquitination , Virus Replication
16.
Nature ; 481(7381): 365-70, 2011 Dec 21.
Article in English | MEDLINE | ID: mdl-22190034

ABSTRACT

Human immunodeficiency virus (HIV) has a small genome and therefore relies heavily on the host cellular machinery to replicate. Identifying which host proteins and complexes come into physical contact with the viral proteins is crucial for a comprehensive understanding of how HIV rewires the host's cellular machinery during the course of infection. Here we report the use of affinity tagging and purification mass spectrometry to determine systematically the physical interactions of all 18 HIV-1 proteins and polyproteins with host proteins in two different human cell lines (HEK293 and Jurkat). Using a quantitative scoring system that we call MiST, we identified with high confidence 497 HIV-human protein-protein interactions involving 435 individual human proteins, with ∼40% of the interactions being identified in both cell types. We found that the host proteins hijacked by HIV, especially those found interacting in both cell types, are highly conserved across primates. We uncovered a number of host complexes targeted by viral proteins, including the finding that HIV protease cleaves eIF3d, a subunit of eukaryotic translation initiation factor 3. This host protein is one of eleven identified in this analysis that act to inhibit HIV replication. This data set facilitates a more comprehensive and detailed understanding of how the host machinery is manipulated during the course of HIV infection.


Subject(s)
HIV-1/chemistry , HIV-1/metabolism , Host-Pathogen Interactions , Human Immunodeficiency Virus Proteins/metabolism , Protein Interaction Mapping/methods , Protein Interaction Maps/physiology , Affinity Labels , Amino Acid Sequence , Conserved Sequence , Eukaryotic Initiation Factor-3/chemistry , Eukaryotic Initiation Factor-3/metabolism , HEK293 Cells , HIV Infections/metabolism , HIV Infections/virology , HIV Protease/metabolism , HIV-1/physiology , Human Immunodeficiency Virus Proteins/analysis , Human Immunodeficiency Virus Proteins/chemistry , Human Immunodeficiency Virus Proteins/isolation & purification , Humans , Immunoprecipitation , Jurkat Cells , Mass Spectrometry , Protein Binding , Reproducibility of Results , Virus Replication
17.
Nat Chem Biol ; 10(12): 1066-72, 2014 Dec.
Article in English | MEDLINE | ID: mdl-25344815

ABSTRACT

Chemical probes that form a covalent bond with a protein target often show enhanced selectivity, potency and utility for biological studies. Despite these advantages, protein-reactive compounds are usually avoided in high-throughput screening campaigns. Here we describe a general method (DOCKovalent) for screening large virtual libraries of electrophilic small molecules. We apply this method prospectively to discover reversible covalent fragments that target distinct protein nucleophiles, including the catalytic serine of AmpC ß-lactamase and noncatalytic cysteines in RSK2, MSK1 and JAK3 kinases. We identify submicromolar to low-nanomolar hits with high ligand efficiency, cellular activity and selectivity, including what are to our knowledge the first reported reversible covalent inhibitors of JAK3. Crystal structures of inhibitor complexes with AmpC and RSK2 confirm the docking predictions and guide further optimization. As covalent virtual screening may have broad utility for the rapid discovery of chemical probes, we have made the method freely available through an automated web server (http://covalent.docking.org/).


Subject(s)
Molecular Docking Simulation , Molecular Probes/chemistry , Protein Kinase Inhibitors/chemistry , Small Molecule Libraries/chemistry , beta-Lactamase Inhibitors/chemistry , Animals , Bacterial Proteins/antagonists & inhibitors , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , COS Cells , Cysteine/chemistry , Cysteine/metabolism , Drug Discovery , Gram-Negative Bacteria/drug effects , Gram-Negative Bacteria/enzymology , Gram-Negative Bacteria/growth & development , Humans , Hydrophobic and Hydrophilic Interactions , Janus Kinase 3/antagonists & inhibitors , Janus Kinase 3/chemistry , Janus Kinase 3/genetics , Ligands , Molecular Probes/pharmacology , Protein Binding , Protein Kinase Inhibitors/pharmacology , Recombinant Proteins/chemistry , Recombinant Proteins/genetics , Ribosomal Protein S6 Kinases, 90-kDa/antagonists & inhibitors , Ribosomal Protein S6 Kinases, 90-kDa/chemistry , Ribosomal Protein S6 Kinases, 90-kDa/genetics , Serine/chemistry , Serine/metabolism , Small Molecule Libraries/pharmacology , beta-Lactamase Inhibitors/pharmacology , beta-Lactamases/chemistry , beta-Lactamases/genetics
18.
Mol Cell Proteomics ; 13(11): 2855-70, 2014 Nov.
Article in English | MEDLINE | ID: mdl-25073740

ABSTRACT

The TORC1 signaling pathway plays a major role in the control of cell growth and response to stress. Here we demonstrate that the SEA complex physically interacts with TORC1 and is an important regulator of its activity. During nitrogen starvation, deletions of SEA complex components lead to Tor1 kinase delocalization, defects in autophagy, and vacuolar fragmentation. TORC1 inactivation, via nitrogen deprivation or rapamycin treatment, changes cellular levels of SEA complex members. We used affinity purification and chemical cross-linking to generate the data for an integrative structure modeling approach, which produced a well-defined molecular architecture of the SEA complex and showed that the SEA complex comprises two regions that are structurally and functionally distinct. The SEA complex emerges as a platform that can coordinate both structural and enzymatic activities necessary for the effective functioning of the TORC1 pathway.


Subject(s)
Autophagy/physiology , Nuclear Pore Complex Proteins/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Transcription Factors/metabolism , Gene Expression Regulation, Fungal , Mitochondria/metabolism , Nitrogen/metabolism , Nuclear Pore Complex Proteins/genetics , Nuclear Pore Complex Proteins/ultrastructure , Protein Structure, Tertiary , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Signal Transduction
19.
PLoS Comput Biol ; 10(12): e1004016, 2014 Dec.
Article in English | MEDLINE | ID: mdl-25474254

ABSTRACT

Bacterial secondary metabolites are widely used as antibiotics, anticancer drugs, insecticides and food additives. Attempts to engineer their biosynthetic gene clusters (BGCs) to produce unnatural metabolites with improved properties are often frustrated by the unpredictability and complexity of the enzymes that synthesize these molecules, suggesting that genetic changes within BGCs are limited by specific constraints. Here, by performing a systematic computational analysis of BGC evolution, we derive evidence for three findings that shed light on the ways in which, despite these constraints, nature successfully invents new molecules: 1) BGCs for complex molecules often evolve through the successive merger of smaller sub-clusters, which function as independent evolutionary entities. 2) An important subset of polyketide synthases and nonribosomal peptide synthetases evolve by concerted evolution, which generates sets of sequence-homogenized domains that may hold promise for engineering efforts since they exhibit a high degree of functional interoperability, 3) Individual BGC families evolve in distinct ways, suggesting that design strategies should take into account family-specific functional constraints. These findings suggest novel strategies for using synthetic biology to rationally engineer biosynthetic pathways.


Subject(s)
Bioengineering/methods , Computational Biology/methods , Evolution, Molecular , Models, Genetic , Multigene Family/genetics
20.
Mol Cell Proteomics ; 11(4): M111.015313, 2012 Apr.
Article in English | MEDLINE | ID: mdl-22174317

ABSTRACT

The HIV-1 Rev protein plays a key role in the late phase of virus replication. It binds to the Rev Response Element found in underspliced HIV mRNAs, and drives their nuclear export by the CRM1 receptor pathway. Moreover, mounting evidence suggests that Rev has additional functions in viral replication. Here we employed proteomics and statistical analysis to identify candidate host cell factors that interact with Rev. For this we studied Rev complexes assembled in vitro with nuclear or cytosolic extracts under conditions emulating various intracellular environments of Rev. We ranked the protein-protein interactions by combining several statistical features derived from pairwise comparison of conditions in which the abundance of the binding partners changed. As a validation set, we selected the eight DEAD/H box proteins of the RNA helicase family from the top-ranking 5% of the proteins. These proteins all associate with ectopically expressed Rev in immunoprecipitates of cultured cells. From gene knockdown approaches, our work in combination with previous studies indicates that six of the eight DEAD/H proteins are linked to HIV production in our cell model. In a more detailed analysis of infected cells where either DDX3X, DDX5, DDX17, or DDX21 was silenced, we observed distinctive phenotypes for multiple replication features, variously involving virus particle release, the levels of unspliced and spliced HIV mRNAs, and the nuclear and cytoplasmic concentrations of these transcripts. Altogether the work indicates that our top-scoring data set is enriched in Rev-interacting proteins relevant to HIV replication. Our more detailed analysis of several Rev-interacting DEAD proteins suggests a complex set of functions for the helicases in regulation of HIV mRNAs. The strategy used here for identifying Rev interaction partners should prove effective for analyzing other viral and cellular proteins.


Subject(s)
DEAD-box RNA Helicases/metabolism , HIV-1/pathogenicity , Host-Pathogen Interactions , Virus Replication/physiology , rev Gene Products, Human Immunodeficiency Virus/metabolism , DEAD-box RNA Helicases/genetics , Escherichia coli/genetics , HIV Infections/metabolism , HIV-1/metabolism , HeLa Cells , Humans , Proteomics , RNA, Small Interfering/genetics
SELECTION OF CITATIONS
SEARCH DETAIL