Pesquisa | Portal Regional da BVS

1.

Ollikainen, Noah; Sen, Ranjan.

J Exp Med ; 221(2)2024 Feb 05.

Artigo em Inglês | MEDLINE | ID: mdl-38284995

RESUMO

In this issue of JEM, Allyn et al. (https://doi.org/10.1084/jem.20230985) provide mechanistic insights into the nuclear organization of the Tcrb locus that permits long-range genomic rearrangements.

Assuntos

Rearranjo Gênico da Cadeia alfa dos Receptores de Antígenos dos Linfócitos T , Receptores de Antígenos de Linfócitos T alfa-beta , Receptores de Antígenos de Linfócitos T alfa-beta/genética

2.

Simultaneous mapping of 3D structure and nascent RNAs argues against nuclear compartments that preclude transcription.

Goronzy, Isabel N; Quinodoz, Sofia A; Jachowicz, Joanna W; Ollikainen, Noah; Bhat, Prashant; Guttman, Mitchell.

Cell Rep ; 41(9): 111730, 2022 11 29.

Artigo em Inglês | MEDLINE | ID: mdl-36450242

RESUMO

Mammalian genomes are organized into three-dimensional DNA structures called A/B compartments that are associated with transcriptional activity/inactivity. However, whether these structures are simply correlated with gene expression or are permissive/impermissible to transcription has remained largely unknown because we lack methods to measure DNA organization and transcription simultaneously. Recently, we developed RNA & DNA (RD)-SPRITE, which enables genome-wide measurements of the spatial organization of RNA and DNA. Here we show that RD-SPRITE measures genomic structure surrounding nascent pre-mRNAs and maps their spatial contacts. We find that transcription occurs within B compartments-with multiple active genes simultaneously colocalizing within the same B compartment-and at genes proximal to nucleoli. These results suggest that localization near or within nuclear structures thought to be inactive does not preclude transcription and that active transcription can occur throughout the nucleus. In general, we anticipate RD-SPRITE will be a powerful tool for exploring relationships between genome structure and transcription.

Assuntos

Núcleo Celular , RNA , Animais , RNA/genética , Núcleo Celular/genética , Nucléolo Celular , Precursores de RNA , Genômica , Mamíferos

3.

SPRITE: a genome-wide method for mapping higher-order 3D interactions in the nucleus using combinatorial split-and-pool barcoding.

Quinodoz, Sofia A; Bhat, Prashant; Chovanec, Peter; Jachowicz, Joanna W; Ollikainen, Noah; Detmar, Elizabeth; Soehalim, Elizabeth; Guttman, Mitchell.

Nat Protoc ; 17(1): 36-75, 2022 01.

Artigo em Inglês | MEDLINE | ID: mdl-35013617

RESUMO

A fundamental question in gene regulation is how cell-type-specific gene expression is influenced by the packaging of DNA within the nucleus of each cell. We recently developed Split-Pool Recognition of Interactions by Tag Extension (SPRITE), which enables mapping of higher-order interactions within the nucleus. SPRITE works by cross-linking interacting DNA, RNA and protein molecules and then mapping DNA-DNA spatial arrangements through an iterative split-and-pool barcoding method. All DNA molecules within a cross-linked complex are barcoded by repeatedly splitting complexes across a 96-well plate, ligating molecules with a unique tag sequence, and pooling all complexes into a single well before repeating the tagging. Because all molecules in a cross-linked complex are covalently attached, they will sort together throughout each round of split-and-pool and will obtain the same series of SPRITE tags, which we refer to as a barcode. The DNA fragments and their associated barcodes are sequenced, and all reads sharing identical barcodes are matched to reconstruct interactions. SPRITE accurately maps pairwise DNA interactions within the nucleus and measures higher-order spatial contacts occurring among up to thousands of simultaneously interacting molecules. Here, we provide a detailed protocol for the experimental steps of SPRITE, including a video ( https://youtu.be/6SdWkBxQGlg ). Furthermore, we provide an automated computational pipeline available on GitHub that allows experimenters to seamlessly generate SPRITE interaction matrices starting with raw fastq files. The protocol takes ~5 d from cell cross-linking to high-throughput sequencing for the experimental steps and 1 d for data processing.

Assuntos

Núcleo Celular , Código de Barras de DNA Taxonômico/métodos , DNA , Genômica/métodos , Software , Animais , Linhagem Celular , Núcleo Celular/genética , Núcleo Celular/fisiologia , DNA/genética , DNA/metabolismo , Feminino , Técnicas Genéticas , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Camundongos

4.

Single-cell measurement of higher-order 3D genome organization with scSPRITE.

Arrastia, Mary V; Jachowicz, Joanna W; Ollikainen, Noah; Curtis, Matthew S; Lai, Charlotte; Quinodoz, Sofia A; Selck, David A; Ismagilov, Rustem F; Guttman, Mitchell.

Nat Biotechnol ; 40(1): 64-73, 2022 01.

Artigo em Inglês | MEDLINE | ID: mdl-34426703

RESUMO

Although three-dimensional (3D) genome organization is central to many aspects of nuclear function, it has been difficult to measure at the single-cell level. To address this, we developed 'single-cell split-pool recognition of interactions by tag extension' (scSPRITE). scSPRITE uses split-and-pool barcoding to tag DNA fragments in the same nucleus and their 3D spatial arrangement. Because scSPRITE measures multiway DNA contacts, it generates higher-resolution maps within an individual cell than can be achieved by proximity ligation. We applied scSPRITE to thousands of mouse embryonic stem cells and detected known genome structures, including chromosome territories, active and inactive compartments, and topologically associating domains (TADs) as well as long-range inter-chromosomal structures organized around various nuclear bodies. We observe that these structures exhibit different levels of heterogeneity across the population, with TADs representing dynamic units of genome organization across cells. We expect that scSPRITE will be a critical tool for studying genome structure within heterogeneous populations.

Assuntos

Núcleo Celular , Genoma , Animais , Núcleo Celular/genética , Cromatina , DNA/genética , Genoma/genética , Camundongos , Células-Tronco Embrionárias Murinas

5.

RNA promotes the formation of spatial compartments in the nucleus.

Quinodoz, Sofia A; Jachowicz, Joanna W; Bhat, Prashant; Ollikainen, Noah; Banerjee, Abhik K; Goronzy, Isabel N; Blanco, Mario R; Chovanec, Peter; Chow, Amy; Markaki, Yolanda; Thai, Jasmine; Plath, Kathrin; Guttman, Mitchell.

Cell ; 184(23): 5775-5790.e30, 2021 11 11.

Artigo em Inglês | MEDLINE | ID: mdl-34739832

RESUMO

RNA, DNA, and protein molecules are highly organized within three-dimensional (3D) structures in the nucleus. Although RNA has been proposed to play a role in nuclear organization, exploring this has been challenging because existing methods cannot measure higher-order RNA and DNA contacts within 3D structures. To address this, we developed RNA & DNA SPRITE (RD-SPRITE) to comprehensively map the spatial organization of RNA and DNA. These maps reveal higher-order RNA-chromatin structures associated with three major classes of nuclear function: RNA processing, heterochromatin assembly, and gene regulation. These data demonstrate that hundreds of ncRNAs form high-concentration territories throughout the nucleus, that specific RNAs are required to recruit various regulators into these territories, and that these RNAs can shape long-range DNA contacts, heterochromatin assembly, and gene expression. These results demonstrate a mechanism where RNAs form high-concentration territories, bind to diffusible regulators, and guide them into compartments to regulate essential nuclear functions.

Assuntos

Núcleo Celular/metabolismo , RNA/metabolismo , Animais , Núcleo Celular/efeitos dos fármacos , Homólogo 5 da Proteína Cromobox/metabolismo , Cromossomos/metabolismo , DNA/metabolismo , DNA Satélite/metabolismo , Proteínas de Ligação a DNA/metabolismo , Dactinomicina/farmacologia , Feminino , Genoma , Células HEK293 , Heterocromatina/metabolismo , Humanos , Camundongos , Modelos Biológicos , Família Multigênica , RNA Polimerase II/metabolismo , Processamento Pós-Transcricional do RNA/efeitos dos fármacos , Processamento Pós-Transcricional do RNA/genética , Splicing de RNA/genética , RNA Longo não Codificante/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Ribossômico/genética , Proteínas de Ligação a RNA/metabolismo , Transcrição Gênica/efeitos dos fármacos

6.

Systems-level effects of allosteric perturbations to a model molecular switch.

Perica, Tina; Mathy, Christopher J P; Xu, Jiewei; Jang, Gwendolyn Μ; Zhang, Yang; Kaake, Robyn; Ollikainen, Noah; Braberg, Hannes; Swaney, Danielle L; Lambright, David G; Kelly, Mark J S; Krogan, Nevan J; Kortemme, Tanja.

Nature ; 599(7883): 152-157, 2021 11.

Artigo em Inglês | MEDLINE | ID: mdl-34646016

RESUMO

Molecular switch proteins whose cycling between states is controlled by opposing regulators1,2 are central to biological signal transduction. As switch proteins function within highly connected interaction networks3, the fundamental question arises of how functional specificity is achieved when different processes share common regulators. Here we show that functional specificity of the small GTPase switch protein Gsp1 in Saccharomyces cerevisiae (the homologue of the human protein RAN)4 is linked to differential sensitivity of biological processes to different kinetics of the Gsp1 (RAN) switch cycle. We make 55 targeted point mutations to individual protein interaction interfaces of Gsp1 (RAN) and show through quantitative genetic5 and physical interaction mapping that Gsp1 (RAN) interface perturbations have widespread cellular consequences. Contrary to expectation, the cellular effects of the interface mutations group by their biophysical effects on kinetic parameters of the GTPase switch cycle and not by the targeted interfaces. Instead, we show that interface mutations allosterically tune the GTPase cycle kinetics. These results suggest a model in which protein partner binding, or post-translational modifications at distal sites, could act as allosteric regulators of GTPase switching. Similar mechanisms may underlie regulation by other GTPases, and other biological switches. Furthermore, our integrative platform to determine the quantitative consequences of molecular perturbations may help to explain the effects of disease mutations that target central molecular switches.

Assuntos

Regulação Alostérica/genética , Proteínas Monoméricas de Ligação ao GTP/genética , Proteínas Monoméricas de Ligação ao GTP/metabolismo , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Mutação Puntual , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo , Saccharomyces cerevisiae , Sítios de Ligação/genética , Domínio Catalítico/genética , Proteínas Ativadoras de GTPase/metabolismo , Fatores de Troca do Nucleotídeo Guanina/metabolismo , Guanosina Trifosfato/metabolismo , Cinética , Ligação Proteica/genética , Saccharomyces cerevisiae/enzimologia , Saccharomyces cerevisiae/genética

7.

Integrated spatial genomics reveals global architecture of single nuclei.

Takei, Yodai; Yun, Jina; Zheng, Shiwei; Ollikainen, Noah; Pierson, Nico; White, Jonathan; Shah, Sheel; Thomassie, Julian; Suo, Shengbao; Eng, Chee-Huat Linus; Guttman, Mitchell; Yuan, Guo-Cheng; Cai, Long.

Nature ; 590(7845): 344-350, 2021 02.

Artigo em Inglês | MEDLINE | ID: mdl-33505024

RESUMO

Identifying the relationships between chromosome structures, nuclear bodies, chromatin states and gene expression is an overarching goal of nuclear-organization studies1-4. Because individual cells appear to be highly variable at all these levels5, it is essential to map different modalities in the same cells. Here we report the imaging of 3,660 chromosomal loci in single mouse embryonic stem (ES) cells using DNA seqFISH+, along with 17 chromatin marks and subnuclear structures by sequential immunofluorescence and the expression profile of 70 RNAs. Many loci were invariably associated with immunofluorescence marks in single mouse ES cells. These loci form 'fixed points' in the nuclear organizations of single cells and often appear on the surfaces of nuclear bodies and zones defined by combinatorial chromatin marks. Furthermore, highly expressed genes appear to be pre-positioned to active nuclear zones, independent of bursting dynamics in single cells. Our analysis also uncovered several distinct mouse ES cell subpopulations with characteristic combinatorial chromatin states. Using clonal analysis, we show that the global levels of some chromatin marks, such as H3 trimethylation at lysine 27 (H3K27me3) and macroH2A1 (mH2A1), are heritable over at least 3-4 generations, whereas other marks fluctuate on a faster time scale. This seqFISH+-based spatial multimodal approach can be used to explore nuclear organization and cell states in diverse biological systems.

Assuntos

Compartimento Celular/genética , Núcleo Celular/genética , Genômica/métodos , Células-Tronco Embrionárias Murinas/citologia , Análise de Célula Única/métodos , Transcriptoma/genética , Animais , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Cromossomos de Mamíferos/genética , Células Clonais/citologia , Imunofluorescência , Marcadores Genéticos , Histonas/metabolismo , Lisina/metabolismo , Masculino , Camundongos , Fatores de Tempo

8.

SARS-CoV-2 Disrupts Splicing, Translation, and Protein Trafficking to Suppress Host Defenses.

Banerjee, Abhik K; Blanco, Mario R; Bruce, Emily A; Honson, Drew D; Chen, Linlin M; Chow, Amy; Bhat, Prashant; Ollikainen, Noah; Quinodoz, Sofia A; Loney, Colin; Thai, Jasmine; Miller, Zachary D; Lin, Aaron E; Schmidt, Madaline M; Stewart, Douglas G; Goldfarb, Daniel; De Lorenzo, Giuditta; Rihn, Suzannah J; Voorhees, Rebecca M; Botten, Jason W; Majumdar, Devdoot; Guttman, Mitchell.

Cell ; 183(5): 1325-1339.e21, 2020 11 25.

Artigo em Inglês | MEDLINE | ID: mdl-33080218

RESUMO

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is a recently identified coronavirus that causes the respiratory disease known as coronavirus disease 2019 (COVID-19). Despite the urgent need, we still do not fully understand the molecular basis of SARS-CoV-2 pathogenesis. Here, we comprehensively define the interactions between SARS-CoV-2 proteins and human RNAs. NSP16 binds to the mRNA recognition domains of the U1 and U2 splicing RNAs and acts to suppress global mRNA splicing upon SARS-CoV-2 infection. NSP1 binds to 18S ribosomal RNA in the mRNA entry channel of the ribosome and leads to global inhibition of mRNA translation upon infection. Finally, NSP8 and NSP9 bind to the 7SL RNA in the signal recognition particle and interfere with protein trafficking to the cell membrane upon infection. Disruption of each of these essential cellular functions acts to suppress the interferon response to viral infection. Our results uncover a multipronged strategy utilized by SARS-CoV-2 to antagonize essential cellular processes to suppress host defenses.

Assuntos

COVID-19/metabolismo , Interações Hospedeiro-Patógeno , Biossíntese de Proteínas , Splicing de RNA , SARS-CoV-2/metabolismo , Proteínas não Estruturais Virais/metabolismo , Células A549 , Animais , COVID-19/virologia , Chlorocebus aethiops , Células HEK293 , Humanos , Interferons/metabolismo , Transporte Proteico , RNA Mensageiro/metabolismo , RNA Ribossômico 18S/metabolismo , RNA Citoplasmático Pequeno/química , RNA Citoplasmático Pequeno/metabolismo , Partícula de Reconhecimento de Sinal/química , Partícula de Reconhecimento de Sinal/metabolismo , Células Vero , Proteínas não Estruturais Virais/química

9.

Computational design of a modular protein sense-response system.

Glasgow, Anum A; Huang, Yao-Ming; Mandell, Daniel J; Thompson, Michael; Ritterson, Ryan; Loshbaugh, Amanda L; Pellegrino, Jenna; Krivacic, Cody; Pache, Roland A; Barlow, Kyle A; Ollikainen, Noah; Jeon, Deborah; Kelly, Mark J S; Fraser, James S; Kortemme, Tanja.

Science ; 366(6468): 1024-1028, 2019 11 22.

Artigo em Inglês | MEDLINE | ID: mdl-31754004

RESUMO

Sensing and responding to signals is a fundamental ability of living systems, but despite substantial progress in the computational design of new protein structures, there is no general approach for engineering arbitrary new protein sensors. Here, we describe a generalizable computational strategy for designing sensor-actuator proteins by building binding sites de novo into heterodimeric protein-protein interfaces and coupling ligand sensing to modular actuation through split reporters. Using this approach, we designed protein sensors that respond to farnesyl pyrophosphate, a metabolic intermediate in the production of valuable compounds. The sensors are functional in vitro and in cells, and the crystal structure of the engineered binding site closely matches the design model. Our computational design strategy opens broad avenues to link biological outputs to new signals.

Assuntos

Fosfatos de Poli-Isoprenil/metabolismo , Engenharia de Proteínas , Multimerização Proteica , Proteínas/química , Sesquiterpenos/metabolismo , Repetição de Anquirina , Sítios de Ligação , Técnicas Biossensoriais , Biologia Computacional , Simulação por Computador , Cristalografia por Raios X , Ligantes , Proteínas Ligantes de Maltose/química , Proteínas Ligantes de Maltose/metabolismo , Modelos Moleculares , Proteínas/genética , Proteínas/metabolismo

10.

Higher-Order Inter-chromosomal Hubs Shape 3D Genome Organization in the Nucleus.

Quinodoz, Sofia A; Ollikainen, Noah; Tabak, Barbara; Palla, Ali; Schmidt, Jan Marten; Detmar, Elizabeth; Lai, Mason M; Shishkin, Alexander A; Bhat, Prashant; Takei, Yodai; Trinh, Vickie; Aznauryan, Erik; Russell, Pamela; Cheng, Christine; Jovanovic, Marko; Chow, Amy; Cai, Long; McDonel, Patrick; Garber, Manuel; Guttman, Mitchell.

Cell ; 174(3): 744-757.e24, 2018 07 26.

Artigo em Inglês | MEDLINE | ID: mdl-29887377

RESUMO

Eukaryotic genomes are packaged into a 3-dimensional structure in the nucleus. Current methods for studying genome-wide structure are based on proximity ligation. However, this approach can fail to detect known structures, such as interactions with nuclear bodies, because these DNA regions can be too far apart to directly ligate. Accordingly, our overall understanding of genome organization remains incomplete. Here, we develop split-pool recognition of interactions by tag extension (SPRITE), a method that enables genome-wide detection of higher-order interactions within the nucleus. Using SPRITE, we recapitulate known structures identified by proximity ligation and identify additional interactions occurring across larger distances, including two hubs of inter-chromosomal interactions that are arranged around the nucleolus and nuclear speckles. We show that a substantial fraction of the genome exhibits preferential organization relative to these nuclear bodies. Our results generate a global model whereby nuclear bodies act as inter-chromosomal hubs that shape the overall packaging of DNA in the nucleus.

Assuntos

Núcleo Celular/ultraestrutura , Mapeamento Cromossômico/métodos , Cromossomos/fisiologia , Nucléolo Celular , Núcleo Celular/fisiologia , Cromossomos/genética , DNA/fisiologia , Eucariotos , Genoma/genética , Genoma/fisiologia , Humanos , Relação Estrutura-Atividade

11.

Flexible Backbone Methods for Predicting and Designing Peptide Specificity.

Ollikainen, Noah.

Methods Mol Biol ; 1561: 173-187, 2017.

Artigo em Inglês | MEDLINE | ID: mdl-28236238

RESUMO

Protein-protein interactions play critical roles in essentially every cellular process. These interactions are often mediated by protein interaction domains that enable proteins to recognize their interaction partners, often by binding to short peptide motifs. For example, PDZ domains, which are among the most common protein interaction domains in the human proteome, recognize specific linear peptide sequences that are often at the C-terminus of other proteins. Determining the set of peptide sequences that a protein interaction domain binds, or it's "peptide specificity," is crucial for understanding its cellular function, and predicting how mutations impact peptide specificity is important for elucidating the mechanisms underlying human diseases. Moreover, engineering novel cellular functions for synthetic biology applications, such as the biosynthesis of biofuels or drugs, requires the design of protein interaction specificity to avoid crosstalk with native metabolic and signaling pathways. The ability to accurately predict and design protein-peptide interaction specificity is therefore critical for understanding and engineering biological function. One approach that has recently been employed toward accomplishing this goal is computational protein design. This chapter provides an overview of recent methodological advances in computational protein design and highlights examples of how these advances can enable increased accuracy in predicting and designing peptide specificity.

Assuntos

Biologia Computacional/métodos , Desenho de Fármacos , Fragmentos de Peptídeos/química , Fragmentos de Peptídeos/metabolismo , Mapeamento de Interação de Proteínas/métodos , Proteínas/metabolismo , Humanos , Modelos Moleculares , Domínios PDZ , Ligação Proteica , Proteínas/química , Especificidade por Substrato

12.

Long non-coding RNAs: spatial amplifiers that control nuclear structure and gene expression.

Engreitz, Jesse M; Ollikainen, Noah; Guttman, Mitchell.

Nat Rev Mol Cell Biol ; 17(12): 756-770, 2016 12.

Artigo em Inglês | MEDLINE | ID: mdl-27780979

RESUMO

Over the past decade, it has become clear that mammalian genomes encode thousands of long non-coding RNAs (lncRNAs), many of which are now implicated in diverse biological processes. Recent work studying the molecular mechanisms of several key examples - including Xist, which orchestrates X chromosome inactivation - has provided new insights into how lncRNAs can control cellular functions by acting in the nucleus. Here we discuss emerging mechanistic insights into how lncRNAs can regulate gene expression by coordinating regulatory proteins, localizing to target loci and shaping three-dimensional (3D) nuclear organization. We explore these principles to highlight biological challenges in gene regulation, in which lncRNAs are well-suited to perform roles that cannot be carried out by DNA elements or protein regulators alone, such as acting as spatial amplifiers of regulatory signals in the nucleus.

Assuntos

Núcleo Celular/ultraestrutura , Regulação da Expressão Gênica , RNA Longo não Codificante/fisiologia , Animais , Núcleo Celular/genética , Expressão Gênica , Humanos , Transporte de RNA

13.

Xist recruits the X chromosome to the nuclear lamina to enable chromosome-wide silencing.

Chen, Chun-Kan; Blanco, Mario; Jackson, Constanza; Aznauryan, Erik; Ollikainen, Noah; Surka, Christine; Chow, Amy; Cerase, Andrea; McDonel, Patrick; Guttman, Mitchell.

Science ; 354(6311): 468-472, 2016 10 28.

Artigo em Inglês | MEDLINE | ID: mdl-27492478

RESUMO

The Xist long noncoding RNA orchestrates X chromosome inactivation, a process that entails chromosome-wide silencing and remodeling of the three-dimensional (3D) structure of the X chromosome. Yet, it remains unclear whether these changes in nuclear structure are mediated by Xist and whether they are required for silencing. Here, we show that Xist directly interacts with the Lamin B receptor, an integral component of the nuclear lamina, and that this interaction is required for Xist-mediated silencing by recruiting the inactive X to the nuclear lamina and by doing so enables Xist to spread to actively transcribed genes across the X. Our results demonstrate that lamina recruitment changes the 3D structure of DNA, enabling Xist and its silencing proteins to spread across the X to silence transcription.

Assuntos

Inativação Gênica , Lâmina Nuclear/metabolismo , RNA Longo não Codificante/metabolismo , Receptores Citoplasmáticos e Nucleares/metabolismo , Inativação do Cromossomo X/genética , Cromossomo X/metabolismo , Animais , Linhagem Celular , Feminino , Camundongos , RNA Longo não Codificante/genética , Transcrição Gênica , Ativação Transcricional , Receptor de Lamina B

14.

A Web Resource for Standardized Benchmark Datasets, Metrics, and Rosetta Protocols for Macromolecular Modeling and Design.

Ó Conchúir, Shane; Barlow, Kyle A; Pache, Roland A; Ollikainen, Noah; Kundert, Kale; O'Meara, Matthew J; Smith, Colin A; Kortemme, Tanja.

PLoS One ; 10(9): e0130433, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26335248

RESUMO

The development and validation of computational macromolecular modeling and design methods depend on suitable benchmark datasets and informative metrics for comparing protocols. In addition, if a method is intended to be adopted broadly in diverse biological applications, there needs to be information on appropriate parameters for each protocol, as well as metrics describing the expected accuracy compared to experimental data. In certain disciplines, there exist established benchmarks and public resources where experts in a particular methodology are encouraged to supply their most efficient implementation of each particular benchmark. We aim to provide such a resource for protocols in macromolecular modeling and design. We present a freely accessible web resource (https://kortemmelab.ucsf.edu/benchmarks) to guide the development of protocols for protein modeling and design. The site provides benchmark datasets and metrics to compare the performance of a variety of modeling protocols using different computational sampling methods and energy functions, providing a "best practice" set of parameters for each method. Each benchmark has an associated downloadable benchmark capture archive containing the input files, analysis scripts, and tutorials for running the benchmark. The captures may be run with any suitable modeling method; we supply command lines for running the benchmarks using the Rosetta software suite. We have compiled initial benchmarks for the resource spanning three key areas: prediction of energetic effects of mutations, protein design, and protein structure prediction, each with associated state-of-the-art modeling protocols. With the help of the wider macromolecular modeling community, we hope to expand the variety of benchmarks included on the website and continue to evaluate new iterations of current methods as they become available.

Assuntos

Benchmarking , Conjuntos de Dados como Assunto , Internet , Modelos Moleculares , Proteínas/química , Aminoácidos/química , Evolução Química , Mutação , Proteínas/genética , Termodinâmica

15.

Coupling Protein Side-Chain and Backbone Flexibility Improves the Re-design of Protein-Ligand Specificity.

Ollikainen, Noah; de Jong, René M; Kortemme, Tanja.

PLoS Comput Biol ; 11(9): e1004335, 2015.

Artigo em Inglês | MEDLINE | ID: mdl-26397464

RESUMO

Interactions between small molecules and proteins play critical roles in regulating and facilitating diverse biological functions, yet our ability to accurately re-engineer the specificity of these interactions using computational approaches has been limited. One main difficulty, in addition to inaccuracies in energy functions, is the exquisite sensitivity of protein-ligand interactions to subtle conformational changes, coupled with the computational problem of sampling the large conformational search space of degrees of freedom of ligands, amino acid side chains, and the protein backbone. Here, we describe two benchmarks for evaluating the accuracy of computational approaches for re-engineering protein-ligand interactions: (i) prediction of enzyme specificity altering mutations and (ii) prediction of sequence tolerance in ligand binding sites. After finding that current state-of-the-art "fixed backbone" design methods perform poorly on these tests, we develop a new "coupled moves" design method in the program Rosetta that couples changes to protein sequence with alterations in both protein side-chain and protein backbone conformations, and allows for changes in ligand rigid-body and torsion degrees of freedom. We show significantly increased accuracy in both predicting ligand specificity altering mutations and binding site sequences. These methodological improvements should be useful for many applications of protein-ligand design. The approach also provides insights into the role of subtle conformational adjustments that enable functional changes not only in engineering applications but also in natural protein evolution.

Assuntos

Sítios de Ligação , Biologia Computacional/métodos , Conformação Proteica , Proteínas/química , Proteínas/metabolismo , Sequência de Aminoácidos , Ligantes , Modelos Moleculares , Dados de Sequência Molecular , Maleabilidade , Especificidade por Substrato

16.

Quantification of the transferability of a designed protein specificity switch reveals extensive epistasis in molecular recognition.

Melero, Cristina; Ollikainen, Noah; Harwood, Ian; Karpiak, Joel; Kortemme, Tanja.

Proc Natl Acad Sci U S A ; 111(43): 15426-31, 2014 Oct 28.

Artigo em Inglês | MEDLINE | ID: mdl-25313039

RESUMO

Reengineering protein-protein recognition is an important route to dissecting and controlling complex interaction networks. Experimental approaches have used the strategy of "second-site suppressors," where a functional interaction is inferred between two proteins if a mutation in one protein can be compensated by a mutation in the second. Mimicking this strategy, computational design has been applied successfully to change protein recognition specificity by predicting such sets of compensatory mutations in protein-protein interfaces. To extend this approach, it would be advantageous to be able to "transplant" existing engineered and experimentally validated specificity changes to other homologous protein-protein complexes. Here, we test this strategy by designing a pair of mutations that modulates peptide recognition specificity in the Syntrophin PDZ domain, confirming the designed interaction biochemically and structurally, and then transplanting the mutations into the context of five related PDZ domain-peptide complexes. We find a wide range of energetic effects of identical mutations in structurally similar positions, revealing a dramatic context dependence (epistasis) of designed mutations in homologous protein-protein interactions. To better understand the structural basis of this context dependence, we apply a structure-based computational model that recapitulates these energetic effects and we use this model to make and validate forward predictions. Although the context dependence of these mutations is captured by computational predictions, our results both highlight the considerable difficulties in designing protein-protein interactions and provide challenging benchmark cases for the development of improved protein modeling and design methods that accurately account for the context.

Assuntos

Proteínas Associadas à Distrofina/química , Proteínas Associadas à Distrofina/genética , Engenharia de Proteínas , Epistasia Genética , Modelos Moleculares , Mutação/genética , Óxido Nítrico Sintase Tipo I/química , Óxido Nítrico Sintase Tipo I/metabolismo , Domínios PDZ , Termodinâmica

17.

Computational protein design quantifies structural constraints on amino acid covariation.

Ollikainen, Noah; Kortemme, Tanja.

PLoS Comput Biol ; 9(11): e1003313, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-24244128

RESUMO

Amino acid covariation, where the identities of amino acids at different sequence positions are correlated, is a hallmark of naturally occurring proteins. This covariation can arise from multiple factors, including selective pressures for maintaining protein structure, requirements imposed by a specific function, or from phylogenetic sampling bias. Here we employed flexible backbone computational protein design to quantify the extent to which protein structure has constrained amino acid covariation for 40 diverse protein domains. We find significant similarities between the amino acid covariation in alignments of natural protein sequences and sequences optimized for their structures by computational protein design methods. These results indicate that the structural constraints imposed by protein architecture play a dominant role in shaping amino acid covariation and that computational protein design methods can capture these effects. We also find that the similarity between natural and designed covariation is sensitive to the magnitude and mechanism of backbone flexibility used in computational protein design. Our results thus highlight the necessity of including backbone flexibility to correctly model precise details of correlated amino acid changes and give insights into the pressures underlying these correlations.

Assuntos

Aminoácidos/química , Biologia Computacional/métodos , Estrutura Terciária de Proteína , Proteínas/química , Sequência de Aminoácidos , Evolução Molecular , Modelos Moleculares , Dados de Sequência Molecular , Proteínas/metabolismo

18.

Amino-acid site variability among natural and designed proteins.

Jackson, Eleisha L; Ollikainen, Noah; Covert, Arthur W; Kortemme, Tanja; Wilke, Claus O.

PeerJ ; 1: e211, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-24255821

RESUMO

Computational protein design attempts to create protein sequences that fold stably into pre-specified structures. Here we compare alignments of designed proteins to alignments of natural proteins and assess how closely designed sequences recapitulate patterns of sequence variation found in natural protein sequences. We design proteins using RosettaDesign, and we evaluate both fixed-backbone designs and variable-backbone designs with different amounts of backbone flexibility. We find that proteins designed with a fixed backbone tend to underestimate the amount of site variability observed in natural proteins while proteins designed with an intermediate amount of backbone flexibility result in more realistic site variability. Further, the correlation between solvent exposure and site variability in designed proteins is lower than that in natural proteins. This finding suggests that site variability is too uniform across different solvent exposure states (i.e., buried residues are too variable or exposed residues too conserved). When comparing the amino acid frequencies in the designed proteins with those in natural proteins we find that in the designed proteins hydrophobic residues are underrepresented in the core. From these results we conclude that intermediate backbone flexibility during design results in more accurate protein design and that either scoring functions or backbone sampling methods require further improvement to accurately replicate structural constraints on site variability.

19.

Flexible backbone sampling methods to model and design protein alternative conformations.

Ollikainen, Noah; Smith, Colin A; Fraser, James S; Kortemme, Tanja.

Methods Enzymol ; 523: 61-85, 2013.

Artigo em Inglês | MEDLINE | ID: mdl-23422426

RESUMO

Sampling alternative conformations is key to understanding how proteins work and engineering them for new functions. However, accurately characterizing and modeling protein conformational ensembles remain experimentally and computationally challenging. These challenges must be met before protein conformational heterogeneity can be exploited in protein engineering and design. Here, as a stepping stone, we describe methods to detect alternative conformations in proteins and strategies to model these near-native conformational changes based on backrub-type Monte Carlo moves in Rosetta. We illustrate how Rosetta simulations that apply backrub moves improve modeling of point mutant side-chain conformations, native side-chain conformational heterogeneity, functional conformational changes, tolerated sequence space, protein interaction specificity, and amino acid covariation across protein-protein interfaces. We include relevant Rosetta command lines and RosettaScripts to encourage the application of these types of simulations to other systems. Our work highlights that critical scoring and sampling improvements will be necessary to approximate conformational landscapes. Challenges for the future development of these methods include modeling conformational changes that propagate away from designed mutation sites and modulating backbone flexibility to predictively design functionally important conformational heterogeneity.

Assuntos

Biologia Computacional/métodos , Proteínas/química , Método de Monte Carlo , Conformação Proteica

20.

Widespread protein aggregation as an inherent part of aging in C. elegans.

David, Della C; Ollikainen, Noah; Trinidad, Jonathan C; Cary, Michael P; Burlingame, Alma L; Kenyon, Cynthia.

PLoS Biol ; 8(8): e1000450, 2010 Aug 10.

Artigo em Inglês | MEDLINE | ID: mdl-20711477

RESUMO

Aberrant protein aggregation is a hallmark of many age-related diseases, yet little is known about whether proteins aggregate with age in a non-disease setting. Using a systematic proteomics approach, we identified several hundred proteins that become more insoluble with age in the multicellular organism Caenorhabditis elegans. These proteins are predicted to be significantly enriched in beta-sheets, which promote disease protein aggregation. Strikingly, these insoluble proteins are highly over-represented in aggregates found in human neurodegeneration. We examined several of these proteins in vivo and confirmed their propensity to aggregate with age. Different proteins aggregated in different tissues and cellular compartments. Protein insolubility and aggregation were significantly delayed or even halted by reduced insulin/IGF-1-signaling, which also slows aging. We found a significant overlap between proteins that become insoluble and proteins that influence lifespan and/or polyglutamine-repeat aggregation. Moreover, overexpressing one aggregating protein enhanced polyglutamine-repeat pathology. Together our findings indicate that widespread protein insolubility and aggregation is an inherent part of aging and that it may influence both lifespan and neurodegenerative disease.

Assuntos

Envelhecimento/metabolismo , Proteínas de Caenorhabditis elegans/metabolismo , Caenorhabditis elegans/fisiologia , Doença de Huntington/fisiopatologia , Envelhecimento/fisiologia , Animais , Caenorhabditis elegans/metabolismo , Proteínas de Caenorhabditis elegans/genética , Modelos Animais de Doenças , Humanos , Doença de Huntington/metabolismo , Doenças Neurodegenerativas/metabolismo , Doenças Neurodegenerativas/fisiopatologia , Proteínas/genética , Proteínas/metabolismo , Proteômica

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA