Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
Add more filters










Publication year range
1.
bioRxiv ; 2024 Jun 03.
Article in English | MEDLINE | ID: mdl-38895487

ABSTRACT

Intrinsically disordered regions (IDRs) are critical for a wide variety of cellular functions, many of which involve interactions with partner proteins. Molecular recognition is typically considered through the lens of sequence-specific binding events. However, a growing body of work has shown that IDRs often interact with partners in a manner that does not depend on the precise order of the amino acid order, instead driven by complementary chemical interactions leading to disordered bound-state complexes. Despite this emerging paradigm, we lack tools to describe, quantify, predict, and interpret these types of structurally heterogeneous interactions from the underlying amino acid sequences. Here, we repurpose the chemical physics developed originally for molecular simulations to develop an approach for predicting intermolecular interactions between IDRs and partner proteins. Our approach enables the direct prediction of phase diagrams, the identification of chemically-specific interaction hotspots on IDRs, and a route to develop and test mechanistic hypotheses regarding IDR function in the context of molecular recognition. We use our approach to examine a range of systems and questions to highlight its versatility and applicability.

2.
Nat Commun ; 15(1): 3523, 2024 Apr 25.
Article in English | MEDLINE | ID: mdl-38664421

ABSTRACT

Organismal physiology is widely regulated by the molecular circadian clock, a feedback loop composed of protein complexes whose members are enriched in intrinsically disordered regions. These regions can mediate protein-protein interactions via SLiMs, but the contribution of these disordered regions to clock protein interactions had not been elucidated. To determine the functionality of these disordered regions, we applied a synthetic peptide microarray approach to the disordered clock protein FRQ in Neurospora crassa. We identified residues required for FRQ's interaction with its partner protein FRH, the mutation of which demonstrated FRH is necessary for persistent clock oscillations but not repression of transcriptional activity. Additionally, the microarray demonstrated an enrichment of FRH binding to FRQ peptides with a net positive charge. We found that positively charged residues occurred in significant "blocks" within the amino acid sequence of FRQ and that ablation of one of these blocks affected both core clock timing and physiological clock output. Finally, we found positive charge clusters were a commonly shared molecular feature in repressive circadian clock proteins. Overall, our study suggests a mechanistic purpose for positive charge blocks and yielded insights into repressive arm protein roles in clock function.


Subject(s)
Circadian Clocks , Fungal Proteins , Neurospora crassa , Neurospora crassa/genetics , Neurospora crassa/metabolism , Circadian Clocks/genetics , Circadian Clocks/physiology , Fungal Proteins/metabolism , Fungal Proteins/genetics , Fungal Proteins/chemistry , Protein Binding , Circadian Rhythm/physiology , Circadian Rhythm/genetics , CLOCK Proteins/metabolism , CLOCK Proteins/genetics , CLOCK Proteins/chemistry , Mutation , Amino Acid Sequence , Gene Expression Regulation, Fungal , Intrinsically Disordered Proteins/metabolism , Intrinsically Disordered Proteins/chemistry , Intrinsically Disordered Proteins/genetics , Protein Array Analysis
3.
Nat Methods ; 21(3): 465-476, 2024 Mar.
Article in English | MEDLINE | ID: mdl-38297184

ABSTRACT

Intrinsically disordered regions (IDRs) are ubiquitous across all domains of life and play a range of functional roles. While folded domains are generally well described by a stable three-dimensional structure, IDRs exist in a collection of interconverting states known as an ensemble. This structural heterogeneity means that IDRs are largely absent from the Protein Data Bank, contributing to a lack of computational approaches to predict ensemble conformational properties from sequence. Here we combine rational sequence design, large-scale molecular simulations and deep learning to develop ALBATROSS, a deep-learning model for predicting ensemble dimensions of IDRs, including the radius of gyration, end-to-end distance, polymer-scaling exponent and ensemble asphericity, directly from sequences at a proteome-wide scale. ALBATROSS is lightweight, easy to use and accessible as both a locally installable software package and a point-and-click-style interface via Google Colab notebooks. We first demonstrate the applicability of our predictors by examining the generalizability of sequence-ensemble relationships in IDRs. Then, we leverage the high-throughput nature of ALBATROSS to characterize the sequence-specific biophysical behavior of IDRs within and between proteomes.


Subject(s)
Intrinsically Disordered Proteins , Intrinsically Disordered Proteins/chemistry , Protein Conformation , Polymers
4.
Trends Biochem Sci ; 48(12): 1019-1034, 2023 12.
Article in English | MEDLINE | ID: mdl-37657994

ABSTRACT

Intrinsically disordered proteins and protein regions (IDRs) are abundant in eukaryotic proteomes and play a wide variety of essential roles. Instead of folding into a stable structure, IDRs exist in an ensemble of interconverting conformations whose structure is biased by sequence-dependent interactions. The absence of a stable 3D structure, combined with high solvent accessibility, means that IDR conformational biases are inherently sensitive to changes in their environment. Here, we argue that IDRs are ideally poised to act as sensors and actuators of cellular physicochemistry. We review the physical principles that underlie IDR sensitivity, the molecular mechanisms that translate this sensitivity to function, and recent studies where environmental sensing by IDRs may play a key role in their downstream function.


Subject(s)
Intrinsically Disordered Proteins , Protein Conformation , Intrinsically Disordered Proteins/metabolism , Protein Domains
5.
Bioinformatics ; 39(8)2023 08 01.
Article in English | MEDLINE | ID: mdl-37540173

ABSTRACT

MOTIVATION: The emergence of high-throughput experiments and high-resolution computational predictions has led to an explosion in the quality and volume of protein sequence annotations at proteomic scales. Unfortunately, sanity checking, integrating, and analyzing complex sequence annotations remains logistically challenging and introduces a major barrier to entry for even superficial integrative bioinformatics. RESULTS: To address this technical burden, we have developed SHEPHARD, a Python framework that trivializes large-scale integrative protein bioinformatics. SHEPHARD combines an object-oriented hierarchical data structure with database-like features, enabling programmatic annotation, integration, and analysis of complex datatypes. Importantly SHEPHARD is easy to use and enables a Pythonic interrogation of largescale protein datasets with millions of unique annotations. We use SHEPHARD to examine three orthogonal proteome-wide questions relating protein sequence to molecular function, illustrating its ability to uncover novel biology. AVAILABILITY AND IMPLEMENTATION: We provided SHEPHARD as both a stand-alone software package (https://github.com/holehouse-lab/shephard), and as a Google Colab notebook with a collection of precomputed proteome-wide annotations (https://github.com/holehouse-lab/shephard-colab).


Subject(s)
Proteome , Proteomics , Software , Computational Biology , Molecular Sequence Annotation
6.
J Phys Chem B ; 127(21): 4746-4760, 2023 06 01.
Article in English | MEDLINE | ID: mdl-37200094

ABSTRACT

Denatured, unfolded, and intrinsically disordered proteins (collectively referred to here as unfolded proteins) can be described using analytical polymer models. These models capture various polymeric properties and can be fit to simulation results or experimental data. However, the model parameters commonly require users' decisions, making them useful for data interpretation but less clearly applicable as stand-alone reference models. Here we use all-atom simulations of polypeptides in conjunction with polymer scaling theory to parameterize an analytical model of unfolded polypeptides that behave as ideal chains (ν = 0.50). The model, which we call the analytical Flory random coil (AFRC), requires only the amino acid sequence as input and provides direct access to probability distributions of global and local conformational order parameters. The model defines a specific reference state to which experimental and computational results can be compared and normalized. As a proof-of-concept, we use the AFRC to identify sequence-specific intramolecular interactions in simulations of disordered proteins. We also use the AFRC to contextualize a curated set of 145 different radii of gyration obtained from previously published small-angle X-ray scattering experiments of disordered proteins. The AFRC is implemented as a stand-alone software package and is also available via a Google Colab notebook. In summary, the AFRC provides a simple-to-use reference polymer model that can guide intuition and aid in interpreting experimental or simulation results.


Subject(s)
Intrinsically Disordered Proteins , Intrinsically Disordered Proteins/chemistry , Peptides/chemistry , Molecular Conformation , Amino Acid Sequence , Polymers , Protein Conformation , Protein Folding
7.
bioRxiv ; 2023 Mar 09.
Article in English | MEDLINE | ID: mdl-36945394

ABSTRACT

Positively charged repeat peptides are emerging as key players in neurodegenerative diseases. These peptides can perturb diverse cellular pathways but a unifying framework for how such promiscuous toxicity arises has remained elusive. We used mass-spectrometry-based proteomics to define the protein targets of these neurotoxic peptides and found that they all share similar sequence features that drive their aberrant condensation with these positively charged peptides. We trained a machine learning algorithm to detect such sequence features and unexpectedly discovered that this mode of toxicity is not limited to human repeat expansion disorders but has evolved countless times across the tree of life in the form of cationic antimicrobial and venom peptides. We demonstrate that an excess in positive charge is necessary and sufficient for this killer activity, which we name 'polycation poisoning'. These findings reveal an ancient and conserved mechanism and inform ways to leverage its design rules for new generations of bioactive peptides.

8.
bioRxiv ; 2023 Mar 13.
Article in English | MEDLINE | ID: mdl-36993592

ABSTRACT

Denatured, unfolded, and intrinsically disordered proteins (collectively referred to here as unfolded proteins) can be described using analytical polymer models. These models capture various polymeric properties and can be fit to simulation results or experimental data. However, the model parameters commonly require users' decisions, making them useful for data interpretation but less clearly applicable as stand-alone reference models. Here we use all-atom simulations of polypeptides in conjunction with polymer scaling theory to parameterize an analytical model of unfolded polypeptides that behave as ideal chains (ν = 0.50). The model, which we call the analytical Flory Random Coil (AFRC), requires only the amino acid sequence as input and provides direct access to probability distributions of global and local conformational order parameters. The model defines a specific reference state to which experimental and computational results can be compared and normalized. As a proof-of-concept, we use the AFRC to identify sequence-specific intramolecular interactions in simulations of disordered proteins. We also use the AFRC to contextualize a curated set of 145 different radii of gyration obtained from previously published small-angle X-ray scattering experiments of disordered proteins. The AFRC is implemented as a stand-alone software package and is also available via a Google colab notebook. In summary, the AFRC provides a simple-to-use reference polymer model that can guide intuition and aid in interpreting experimental or simulation results.

9.
Methods Mol Biol ; 2563: 95-116, 2023.
Article in English | MEDLINE | ID: mdl-36227469

ABSTRACT

Cellular organization is determined by a combination of membrane-bound and membrane-less biomolecular assemblies that range from clusters of tens of molecules to micrometer-sized cellular bodies. Over the last decade, membrane-less assemblies have come to be referred to as biomolecular condensates, reflecting their ability to condense specific molecules with respect to the remainder of the cell. In many cases, the physics of phase transitions provides a conceptual framework and a mathematical toolkit to describe the assembly, maintenance, and dissolution of biomolecular condensates. Among the various quantitative and qualitative models applied to understand intracellular phase transitions, the stickers-and-spacers framework offers an intuitive yet rigorous means to map biomolecular sequences and structure to the driving forces needed for higher-order assembly. This chapter introduces the fundamental concepts behind the stickers-and-spacers model, considers its application to different biological systems, and discusses limitations and misconceptions around the model.


Subject(s)
Biomolecular Condensates , Phase Transition
10.
EMBO J ; 41(8): e111062, 2022 04 19.
Article in English | MEDLINE | ID: mdl-35307855

ABSTRACT

Post-translational modifications of intrinsically disordered regions (IDRs) enable changes in sequence chemistry, which in turn can tune conformational behavior and molecular interactions. In this issue of The EMBO Journal, Gruijs da Silva et al disentangle the effect of hyperphosphorylation on the C-terminal domain of TDP-43, a key IDR implicated in Amyotrophic Lateral Sclerosis (ALS).


Subject(s)
Amyotrophic Lateral Sclerosis , DNA-Binding Proteins , Amyotrophic Lateral Sclerosis/genetics , DNA-Binding Proteins/chemistry , Humans , Solubility
11.
Biochemistry ; 60(47): 3566-3581, 2021 11 30.
Article in English | MEDLINE | ID: mdl-34784177

ABSTRACT

In immature oocytes, Balbiani bodies are conserved membraneless condensates implicated in oocyte polarization, the organization of mitochondria, and long-term organelle and RNA storage. In Xenopus laevis, Balbiani body assembly is mediated by the protein Velo1. Velo1 contains an N-terminal prion-like domain (PLD) that is essential for Balbiani body formation. PLDs have emerged as a class of intrinsically disordered regions that can undergo various different types of intracellular phase transitions and are often associated with dynamic, liquid-like condensates. Intriguingly, the Velo1 PLD forms solid-like assemblies. Here we sought to understand why Velo1 phase behavior appears to be biophysically distinct from that of other PLD-containing proteins. Through bioinformatic analysis and coarse-grained simulations, we predict that the clustering of aromatic residues and the amino acid composition of residues between aromatics can influence condensate material properties, organization, and the driving forces for assembly. To test our predictions, we redesigned the Velo1 PLD to test the impact of targeted sequence changes in vivo. We found that the Velo1 design with evenly spaced aromatic residues shows rapid internal dynamics, as probed by fluorescent recovery after photobleaching, even when recruited into Balbiani bodies. Our results suggest that Velo1 might have been selected in evolution for distinctly clustered aromatic residues to maintain the structure of Balbiani bodies in long-lived oocytes. In general, our work identifies several tunable parameters that can be used to augment the condensate material state, offering a road map for the design of synthetic condensates.


Subject(s)
Biomolecular Condensates/metabolism , T-Box Domain Proteins/metabolism , Xenopus Proteins/metabolism , Amino Acids, Aromatic/chemistry , Amino Acids, Aromatic/genetics , Amino Acids, Aromatic/metabolism , Animals , Cell Polarity , Cells, Cultured , Female , Intravital Microscopy , Oocytes/cytology , Oocytes/metabolism , Phase Transition , Primary Cell Culture , Protein Domains/genetics , Protein Engineering , T-Box Domain Proteins/chemistry , T-Box Domain Proteins/genetics , Xenopus Proteins/chemistry , Xenopus Proteins/genetics , Xenopus laevis
12.
J Phys Chem Lett ; 11(23): 10131-10136, 2020 Dec 03.
Article in English | MEDLINE | ID: mdl-33191750

ABSTRACT

Intrinsically disordered protein-regions (IDRs) make up roughly 30% of the human proteome and are central to a wide range of biological processes. Given a lack of persistent tertiary structure, all residues in IDRs are, to some extent, solvent exposed. This extensive surface area, coupled with the absence of strong intramolecular contacts, makes IDRs inherently sensitive to their chemical environment. We report a combined experimental, computational, and analytical framework for high-throughput characterization of IDR sensitivity. Our framework reveals that IDRs can expand or compact in response to changes in their solution environment. Importantly, the direction and magnitude of conformational change depend on both protein sequence and cosolute identity. For example, some solutes such as short polyethylene glycol chains exert an expanding effect on some IDRs and a compacting effect on others. Despite this complex behavior, we can rationally interpret IDR responsiveness to solution composition changes using relatively simple polymer models. Our results imply that solution-responsive IDRs are ubiquitous and can provide an additional layer of regulation to biological systems.


Subject(s)
Intrinsically Disordered Proteins/chemistry , Fluorescence Resonance Energy Transfer , High-Throughput Screening Assays , Humans , Protein Conformation , Solutions , Surface Properties
13.
Methods Mol Biol ; 2141: 103-126, 2020.
Article in English | MEDLINE | ID: mdl-32696354

ABSTRACT

Intrinsically disordered proteins and protein regions are ubiquitous across eukaryotic proteomes where they play a range of functional roles. Unlike folded proteins, IDRs lack a well-defined native state but exist in heterogeneous ensembles of conformations. In the absence of a defined native state, structure-guided mutations to test specific mechanistic hypotheses are generally not possible. Despite this, the use of mutations to alter sequence properties has become a relatively common approach for teasing out the relationship between sequence, ensemble, and function. A key step in designing informative mutants is the ability to identify specific sequence features that may reveal an interpretable response if perturbed. Here, we provide guidance on using the CIDER and localCIDER tools for amino acid sequence analysis, with a focus on building intuition with respect to the most commonly described features.


Subject(s)
Algorithms , Intrinsically Disordered Proteins/chemistry , Sequence Analysis, Protein , Amino Acid Sequence , Humans , Proteome/metabolism , Software
14.
Cell Rep ; 24(4): 861-872.e6, 2018 07 24.
Article in English | MEDLINE | ID: mdl-30044983

ABSTRACT

Non-retroviral integrated RNA viral sequences (NIRVs) potentially encoding ∼280 amino acid homologs to filovirus VP35 proteins are present across the Myotis genus of bats. These are estimated to have been maintained for ∼18 million years, indicating their co-option. To address the reasons for co-option, 16 Myotis VP35s were characterized in comparison to VP35s from the extant filoviruses Ebola virus and Marburg virus, in which VP35s play critical roles in immune evasion and RNA synthesis. The Myotis VP35s demonstrated a conserved suppression of innate immune signaling, albeit with reduced potency, in either human or Myotis cells. Their attenuation reflects a lack of dsRNA binding that in the filoviral VP35s correlates with potent suppression of interferon responses. Despite divergent function, evolution has preserved in Myotis the structure of the filoviral VP35s, indicating that this structure is critical for co-opted function, possibly as a regulator of innate immune signaling.


Subject(s)
Chiroptera/genetics , Ebolavirus/immunology , Filoviridae/immunology , Viral Regulatory and Accessory Proteins/chemistry , Viral Regulatory and Accessory Proteins/immunology , Animals , Chick Embryo , Ebolavirus/genetics , Filoviridae/genetics , Genome , HEK293 Cells , Humans , Interferon-beta/biosynthesis , Interferon-beta/genetics , Interferon-beta/immunology , Marburgvirus/genetics , Marburgvirus/immunology , Models, Molecular , Protein Binding , RNA, Double-Stranded/metabolism , Signal Transduction , Structure-Activity Relationship , Viral Regulatory and Accessory Proteins/antagonists & inhibitors , Viral Regulatory and Accessory Proteins/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...