Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 47
Filter
1.
PLoS Comput Biol ; 20(5): e1011372, 2024 May.
Article in English | MEDLINE | ID: mdl-38748749

ABSTRACT

Low-complexity domains (LCDs) in proteins are typically enriched in one or two predominant amino acids. As a result, LCDs often exhibit unusual structural/biophysical tendencies and can occupy functional niches. However, for each organism, protein sequences must be compatible with intracellular biomolecules and physicochemical environment, both of which vary from organism to organism. This raises the possibility that LCDs may occupy sequence spaces in select organisms that are otherwise prohibited in most organisms. Here, we report a comprehensive survey and functional analysis of LCDs in all known reference proteomes (>21k organisms), with added focus on rare and unusual types of LCDs. LCDs were classified according to both the primary amino acid and secondary amino acid in each LCD sequence, facilitating detailed comparisons of LCD class frequencies across organisms. Examination of LCD classes at different depths (i.e., domain of life, organism, protein, and per-residue levels) reveals unique facets of LCD frequencies and functions. To our surprise, all 400 LCD classes occur in nature, although some are exceptionally rare. A number of rare classes can be defined for each domain of life, with many LCD classes appearing to be eukaryote-specific. Certain LCD classes were consistently associated with identical functions across many organisms, particularly in eukaryotes. Our analysis methods enable simultaneous, direct comparison of all LCD classes between individual organisms, resulting in a proteome-scale view of differences in LCD frequencies and functions. Together, these results highlight the remarkable diversity and functional specificity of LCDs across all known life forms.


Subject(s)
Computational Biology , Proteome , Proteome/chemistry , Proteome/metabolism , Animals , Computational Biology/methods , Humans , Protein Domains , Amino Acid Sequence , Proteins/chemistry , Proteins/metabolism , Amino Acids/chemistry , Databases, Protein , Proteomics/methods
2.
RNA ; 28(10): 1298-1314, 2022 10.
Article in English | MEDLINE | ID: mdl-35863866

ABSTRACT

Serine/arginine-rich (SR) proteins comprise a family of proteins that is predominantly found in eukaryotes and plays a prominent role in RNA splicing. A characteristic feature of SR proteins is the presence of an S/R-rich low-complexity domain (RS domain), often in conjunction with spatially distinct RNA recognition motifs (RRMs). To date, 52 human proteins have been classified as SR or SR-related proteins. Here, using an unbiased series of composition criteria together with enrichment for known RNA binding activity, we identified >100 putative SR-related proteins in the human proteome. This method recovers known SR and SR-related proteins with high sensitivity (∼94%), yet identifies a number of additional proteins with many of the hallmark features of true SR-related proteins. Newly identified SR-related proteins display slightly different amino acid compositions yet similar levels of post-translational modification, suggesting that these new SR-related candidates are regulated in vivo and functionally important. Furthermore, candidate SR-related proteins with known RNA-binding activity (but not currently recognized as SR-related proteins) are nevertheless strongly associated with a variety of functions related to mRNA splicing and nuclear speckles. Finally, we applied our SR search method to all available reference proteomes, and provide maps of RS domains and Pfam annotations for all putative SR-related proteins as a resource. Together, these results expand the set of SR-related proteins in humans, and identify the most common functions associated with SR-related proteins across all domains of life.


Subject(s)
Proteome , RNA-Binding Proteins , Animals , Arginine/metabolism , Humans , Nuclear Proteins/genetics , Proteome/genetics , RNA/metabolism , RNA Precursors/genetics , RNA Splicing , RNA, Messenger/genetics , RNA-Binding Proteins/metabolism , Serine/genetics , Serine-Arginine Splicing Factors/genetics , Serine-Arginine Splicing Factors/metabolism
3.
J Biol Chem ; 298(3): 101677, 2022 03.
Article in English | MEDLINE | ID: mdl-35131265

ABSTRACT

In response to the recent SARS-CoV-2 pandemic, a number of labs across the world have reallocated their time and resources to better our understanding of the virus. For some viruses, including SARS-CoV-2, viral proteins can undergo phase separation: a biophysical process often related to the partitioning of protein and RNA into membraneless organelles in vivo. In this review, we discuss emerging observations of phase separation by the SARS-CoV-2 nucleocapsid (N) protein-an essential viral protein required for viral replication-and the possible in vivo functions that have been proposed for N-protein phase separation, including viral replication, viral genomic RNA packaging, and modulation of host-cell response to infection. Additionally, since a relatively large number of studies examining SARS-CoV-2 N-protein phase separation have been published in a short span of time, we take advantage of this situation to compare results from similar experiments across studies. Our evaluation highlights potential strengths and pitfalls of drawing conclusions from a single set of experiments, as well as the value of publishing overlapping scientific observations performed simultaneously by multiple labs.


Subject(s)
COVID-19 , Nucleocapsid Proteins , SARS-CoV-2 , COVID-19/virology , Consensus , Humans , Nucleocapsid/genetics , Nucleocapsid/metabolism , Nucleocapsid Proteins/isolation & purification , Nucleocapsid Proteins/metabolism , RNA, Viral/metabolism , SARS-CoV-2/chemistry , SARS-CoV-2/metabolism , Viral Proteins/metabolism
4.
Bioinformatics ; 38(24): 5446-5448, 2022 12 13.
Article in English | MEDLINE | ID: mdl-36282522

ABSTRACT

SUMMARY: Low-complexity domains (LCDs) in proteins are regions enriched in a small subset of amino acids. LCDs exist in all domains of life, often have unusual biophysical behavior, and function in both normal and pathological processes. We recently developed an algorithm to identify LCDs based predominantly on amino acid composition thresholds. Here, we have integrated this algorithm with a webserver and augmented it with additional analysis options. Specifically, users can (i) search for LCDs in whole proteomes by setting minimum composition thresholds for individual or grouped amino acids, (ii) submit a known LCD sequence to search for similar LCDs, (iii) search for and plot LCDs within a single protein, (iv) statistically test for enrichment of LCDs within a user-provided protein set and (v) specifically identify proteins with multiple types of LCDs. AVAILABILITY AND IMPLEMENTATION: The LCD-Composer server can be accessed at http://lcd-composer.bmb.colostate.edu. The corresponding command-line scripts can be accessed at https://github.com/RossLabCSU/LCD-Composer/tree/master/WebserverScripts.


Subject(s)
Amino Acids , Proteome , Algorithms
5.
Proc Natl Acad Sci U S A ; 117(11): 5826-5835, 2020 03 17.
Article in English | MEDLINE | ID: mdl-32127480

ABSTRACT

Mutations in a number of stress granule-associated proteins have been linked to various neurodegenerative diseases. Several of these mutations are found in aggregation-prone prion-like domains (PrLDs) within these proteins. In this work, we examine the sequence features governing PrLD localization to stress granules upon stress. We demonstrate that many yeast PrLDs are sufficient for stress-induced assembly into microscopically visible foci that colocalize with stress granule markers. Additionally, compositional biases exist among PrLDs that assemble upon stress, and these biases are consistent across different stressors. Using these biases, we have developed a composition-based prediction method that accurately predicts PrLD assembly into foci upon heat shock. We show that compositional changes alter PrLD assembly behavior in a predictable manner, while scrambling primary sequence has little effect on PrLD assembly and recruitment to stress granules. Furthermore, we were able to design synthetic PrLDs that were efficiently recruited to stress granules, and found that aromatic amino acids, which have previously been linked to PrLD phase separation, were dispensable for this recruitment. These results highlight the flexible sequence requirements for stress granule recruitment and suggest that PrLD localization to stress granules is driven primarily by amino acid composition, rather than primary sequence.


Subject(s)
Cytoplasmic Granules/metabolism , Prion Proteins/chemistry , Protein Domains , Stress, Physiological/physiology , Base Composition , Heat-Shock Proteins/metabolism , Mutation , Neurodegenerative Diseases/metabolism , Prion Proteins/genetics , Prion Proteins/metabolism , Prions/metabolism , Saccharomyces cerevisiae/metabolism , Sequence Analysis, Protein , Sodium Azide/pharmacology , Stress, Physiological/genetics
6.
FASEB J ; 34(8): 9832-9842, 2020 08.
Article in English | MEDLINE | ID: mdl-32562316

ABSTRACT

To date, the recently discovered SARS-CoV-2 virus has afflicted >6.9 million people worldwide and disrupted the global economy. Development of effective vaccines or treatments for SARS-CoV-2 infection will be aided by a molecular-level understanding of SARS-CoV-2 proteins and their interactions with host cell proteins. The SARS-CoV-2 nucleocapsid (N) protein is highly homologous to the N protein of SARS-CoV, which is essential for viral RNA replication and packaging into new virions. Emerging models indicate that nucleocapsid proteins of other viruses can form biomolecular condensates to spatiotemporally regulate N protein localization and function. Our bioinformatic analyses, in combination with pre-existing experimental evidence, suggest that the SARS-CoV-2 N protein is capable of forming or regulating biomolecular condensates in vivo by interaction with RNA and key host cell proteins. We discuss multiple models, whereby the N protein of SARS-CoV-2 may harness this activity to regulate viral life cycle and host cell response to viral infection.


Subject(s)
Coronavirus Nucleocapsid Proteins/chemistry , SARS-CoV-2/chemistry , Binding Sites , Computational Biology , Cytoplasmic Granules/chemistry , Humans , Phosphoproteins/chemistry , Protein Binding , Protein Domains , Protein Kinases/chemistry , SARS-CoV-2/physiology , Virus Assembly , Virus Replication
7.
PLoS Comput Biol ; 16(1): e1007487, 2020 01.
Article in English | MEDLINE | ID: mdl-31986130

ABSTRACT

A variety of studies have suggested that low-complexity domains (LCDs) tend to be intrinsically disordered and are relatively rare within structured proteins in the Protein Data Bank (PDB). Although LCDs are often treated as a single class, we previously found that LCDs enriched in different amino acids can exhibit substantial differences in protein metabolism and function. Therefore, we wondered whether the structural conformations of LCDs are likewise dependent on which specific amino acids are enriched within each LCD. Here, we directly examined relationships between enrichment of individual amino acids and secondary structure tendencies across the entire PDB proteome. Secondary structure tendencies varied as a function of the identity of the amino acid enriched and its degree of enrichment. Furthermore, divergence in secondary structure profiles often occurred for LCDs enriched in physicochemically similar amino acids (e.g. valine vs. leucine), indicating that LCDs composed of related amino acids can have distinct secondary structure tendencies. Comparison of LCD secondary structure tendencies with numerous pre-existing secondary structure propensity scales resulted in relatively poor correlations for certain types of LCDs, indicating that these scales may not capture secondary structure tendencies as sequence complexity decreases. Collectively, these observations provide a highly resolved view of structural tendencies among LCDs parsed by the nature and magnitude of single amino acid enrichment.


Subject(s)
Protein Domains/physiology , Protein Structure, Secondary/physiology , Proteins/chemistry , Proteome/chemistry , Proteomics/methods , Algorithms , Amino Acid Sequence/physiology , Amino Acids/chemistry , Amino Acids/metabolism , Databases, Protein
8.
PLoS Genet ; 14(7): e1007517, 2018 07.
Article in English | MEDLINE | ID: mdl-30005071

ABSTRACT

Enhanced protein aggregation and/or impaired clearance of aggregates can lead to neurodegenerative disorders such as Alzheimer's Disease, Huntington's Disease, and prion diseases. Therefore, many protein quality control factors specialize in recognizing and degrading aggregation-prone proteins. Prions, which generally result from self-propagating protein aggregates, must therefore evade or outcompete these quality control systems in order to form and propagate in a cellular context. We developed a genetic screen in yeast that allowed us to explore the sequence features that promote degradation versus aggregation of a model glutamine/asparagine (Q/N)-rich prion domain from the yeast prion protein, Sup35, and two model glycine (G)-rich prion-like domains from the human proteins hnRNPA1 and hnRNPA2. Unexpectedly, we found that aggregation propensity and degradation propensity could be uncoupled in multiple ways. First, only a subset of classically aggregation-promoting amino acids elicited a strong degradation response in the G-rich prion-like domains. Specifically, large aliphatic residues enhanced degradation of the prion-like domains, whereas aromatic residues promoted prion aggregation without enhancing degradation. Second, the degradation-promoting effect of aliphatic residues was suppressed in the context of the Q/N-rich prion domain, and instead led to a dose-dependent increase in the frequency of spontaneous prion formation. Degradation suppression correlated with Q/N content of the surrounding prion domain, potentially indicating an underappreciated activity for these residues in yeast prion domains. Collectively, these results provide key insights into how certain aggregation-prone proteins may evade protein quality control degradation systems.


Subject(s)
Neurodegenerative Diseases/genetics , Prions/genetics , Protein Aggregation, Pathological/genetics , Protein Domains/genetics , Proteolysis , Amino Acid Sequence/genetics , Asparagine/genetics , Asparagine/metabolism , Glutamine/genetics , Glutamine/metabolism , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/genetics , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/metabolism , Humans , Neurodegenerative Diseases/pathology , Peptide Termination Factors/genetics , Peptide Termination Factors/metabolism , Prions/metabolism , Protein Aggregation, Pathological/pathology , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/metabolism
9.
Int J Mol Sci ; 22(3)2021 Jan 27.
Article in English | MEDLINE | ID: mdl-33513942

ABSTRACT

Stress granules are ribonucleoprotein assemblies that form in response to cellular stress. Many of the RNA-binding proteins found in stress granule proteomes contain prion-like domains (PrLDs), which are low-complexity sequences that compositionally resemble yeast prion domains. Mutations in some of these PrLDs have been implicated in neurodegenerative diseases, including amyotrophic lateral sclerosis and frontotemporal dementia, and are associated with persistent stress granule accumulation. While both stress granules and prions are macromolecular assemblies, they differ in both their physical properties and complexity. Prion aggregates are highly stable homopolymeric solids, while stress granules are complex dynamic biomolecular condensates driven by multivalent homotypic and heterotypic interactions. Here, we use stress granules and yeast prions as a paradigm to examine how distinct sequence and compositional features of PrLDs contribute to different types of PrLD-containing assemblies.


Subject(s)
Cytoplasmic Granules/genetics , Organelles/genetics , Prion Proteins/genetics , Prions/genetics , Amyotrophic Lateral Sclerosis/genetics , Amyotrophic Lateral Sclerosis/pathology , Cytoplasmic Granules/ultrastructure , Frontotemporal Dementia/genetics , Frontotemporal Dementia/pathology , Humans , Neurodegenerative Diseases/genetics , Neurodegenerative Diseases/pathology , Organelles/ultrastructure , Prion Proteins/ultrastructure , Prions/ultrastructure , Protein Domains/genetics , Proteome/genetics , RNA-Binding Proteins , Saccharomyces cerevisiae/genetics
10.
Int J Mol Sci ; 22(16)2021 Aug 19.
Article in English | MEDLINE | ID: mdl-34445649

ABSTRACT

Protein aggregation is associated with a growing list of human diseases. A substantial fraction of proteins in eukaryotic proteomes constitutes a proteostasis network-a collection of proteins that work together to maintain properly folded proteins. One of the overarching functions of the proteostasis network is the prevention or reversal of protein aggregation. How proteins aggregate in spite of the anti-aggregation activity of the proteostasis machinery is incompletely understood. Exposed hydrophobic patches can trigger degradation by the ubiquitin-proteasome system, a key branch of the proteostasis network. However, in a recent study, we found that model glycine (G)-rich or glutamine/asparagine (Q/N)-rich prion-like domains differ in their susceptibility to detection and degradation by this system. Here, we expand upon this work by examining whether the features controlling the degradation of our model prion-like domains generalize broadly to G-rich and Q/N-rich domains. Experimentally, native yeast G-rich domains in isolation are sensitive to the degradation-promoting effects of hydrophobic residues, whereas native Q/N-rich domains completely resist these effects and tend to aggregate instead. Bioinformatic analyses indicate that native G-rich domains from yeast and humans tend to avoid degradation-promoting features, suggesting that the proteostasis network may act as a form of selection at the molecular level that constrains the sequence space accessible to G-rich domains. However, the sensitivity or resistance of G-rich and Q/N-rich domains, respectively, was not always preserved in their native protein contexts, highlighting that proteins can evolve other sequence features to overcome the intrinsic sensitivity of some LCDs to degradation.


Subject(s)
Protein Aggregates/physiology , Proteome/metabolism , Proteostasis , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/growth & development , Saccharomyces cerevisiae Proteins/genetics
11.
BMC Genomics ; 21(1): 23, 2020 Jan 08.
Article in English | MEDLINE | ID: mdl-31914925

ABSTRACT

BACKGROUND: Impaired proteostatic regulation of proteins with prion-like domains (PrLDs) is associated with a variety of human diseases including neurodegenerative disorders, myopathies, and certain forms of cancer. For many of these disorders, current models suggest a prion-like molecular mechanism of disease, whereby proteins aggregate and spread to neighboring cells in an infectious manner. The development of prion prediction algorithms has facilitated the large-scale identification of PrLDs among "reference" proteomes for various organisms. However, the degree to which intraspecies protein sequence diversity influences predicted prion propensity has not been systematically examined. RESULTS: Here, we explore protein sequence variation introduced at genetic, post-transcriptional, and post-translational levels, and its influence on predicted aggregation propensity for human PrLDs. We find that sequence variation is relatively common among PrLDs and in some cases can result in relatively large differences in predicted prion propensity. Sequence variation introduced at the post-transcriptional level (via alternative splicing) also commonly affects predicted aggregation propensity, often by direct inclusion or exclusion of a PrLD. Finally, analysis of a database of sequence variants associated with human disease reveals a number of mutations within PrLDs that are predicted to increase prion propensity. CONCLUSIONS: Our analyses expand the list of candidate human PrLDs, quantitatively estimate the effects of sequence variation on the aggregation propensity of PrLDs, and suggest the involvement of prion-like mechanisms in additional human diseases.


Subject(s)
Proteome/metabolism , Algorithms , Alternative Splicing/genetics , Alternative Splicing/physiology , Amino Acid Sequence , Humans , Mutation , Neurodegenerative Diseases/metabolism , Prion Proteins/metabolism , Prions , Protein Aggregates/genetics , Protein Aggregates/physiology , Protein Domains/genetics , Protein Domains/physiology , Proteome/genetics
12.
Curr Genet ; 66(3): 463-468, 2020 Jun.
Article in English | MEDLINE | ID: mdl-31745569

ABSTRACT

Serine-arginine (SR) protein kinases regulate diverse cellular activities, including various steps in RNA maturation and transport. The yeast Saccharomyces cerevisiae expresses a single SR kinase, Sky1. Sky1 has a bipartite kinase domain, separated by an aggregation-prone prion-like domain (PrLD). The assembly of PrLDs is involved in the formation of various membraneless organelles, including stress granules; stress granules are reversible ribonucleoprotein assemblies that form in response to a variety of stresses. Here, we review a recent study suggesting that Sky1's PrLD promotes Sky1 recruitment to stress granules, and that Sky1 regulates stress granule dissolution by phosphorylating the RNA-shuttling protein Npl3.


Subject(s)
Cytoplasmic Granules/metabolism , Organelles/metabolism , Prions/metabolism , Protein Serine-Threonine Kinases/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/metabolism , Phosphorylation , Saccharomyces cerevisiae/growth & development
13.
Curr Genet ; 65(2): 387-392, 2019 Apr.
Article in English | MEDLINE | ID: mdl-30310993

ABSTRACT

Protein aggregation in vivo is generally combated by extensive proteostatic defenses. Many proteostasis factors specifically recognize aggregation-prone features and re-fold or degrade the targeted protein. However, protein aggregation is not uncommon, suggesting that some proteins employ evasive strategies to aggregate in spite of the proteostasis machinery. Therefore, in addition to understanding the inherent aggregation propensity of protein sequences, it is important to understand how these sequences affect proteostatic recognition and regulation in vivo. In a recent study, we used a genetic mutagenesis and screening approach to explore the aggregation or degradation promoting effects of the canonical amino acids in the context of G-rich and Q/N-rich prion-like domains (PrLDs). Our results indicate that aggregation propensity scales are strongly influenced by the interplay between specific PrLD features and proteostatic recognition. Here, we briefly review these results and expand upon their potential implications. In addition, a preliminary exploration of the yeast proteome suggests that these proteostatic regulation heuristics may influence the compositional features of native G-rich and Q/N-rich domains in yeast. These results improve our understanding of the features affecting the aggregation and proteostatic regulation of prion-like domains in a cellular context, and suggest that the sequence space for native prion-like domains may be shaped by proteostatic constraints.


Subject(s)
Prion Proteins/genetics , Prion Proteins/metabolism , Prions/genetics , Prions/metabolism , Protein Aggregates , Protein Interaction Domains and Motifs , Animals , Genetic Variation , Humans , Hydrophobic and Hydrophilic Interactions , Prion Proteins/chemistry , Prions/chemistry , Protein Aggregation, Pathological , Protein Binding , Proteolysis
14.
PLoS Comput Biol ; 14(9): e1006256, 2018 09.
Article in English | MEDLINE | ID: mdl-30248088

ABSTRACT

Proteins with low-complexity domains continue to emerge as key players in both normal and pathological cellular processes. Although low-complexity domains are often grouped into a single class, individual low-complexity domains can differ substantially with respect to amino acid composition. These differences may strongly influence the physical properties, cellular regulation, and molecular functions of low-complexity domains. Therefore, we developed a bioinformatic approach to explore relationships between amino acid composition, protein metabolism, and protein function. We find that local compositional enrichment within protein sequences is associated with differences in translation efficiency, abundance, half-life, protein-protein interaction promiscuity, subcellular localization, and molecular functions of proteins on a proteome-wide scale. However, local enrichment of related amino acids is sometimes associated with opposite effects on protein regulation and function, highlighting the importance of distinguishing between different types of low-complexity domains. Furthermore, many of these effects are discernible at amino acid compositions below those required for classification as low-complexity or statistically-biased by traditional methods and in the absence of homopolymeric amino acid repeats, indicating that thresholds employed by classical methods may not reflect biologically relevant criteria. Application of our analyses to composition-driven processes, such as the formation of membraneless organelles, reveals distinct composition profiles even for closely related organelles. Collectively, these results provide a unique perspective and detailed insights into relationships between amino acid composition, protein metabolism, and protein functions.


Subject(s)
Amino Acids/chemistry , Computational Biology/methods , Proteins/chemistry , Proteome/genetics , Proteomics , Algorithms , Amino Acid Motifs , Animals , Caenorhabditis elegans , Cell Membrane/chemistry , Evolution, Molecular , Open Reading Frames , Polymers , Protein Domains , Protein Interaction Mapping , Saccharomyces cerevisiae
15.
Nature ; 495(7442): 467-73, 2013 Mar 28.
Article in English | MEDLINE | ID: mdl-23455423

ABSTRACT

Algorithms designed to identify canonical yeast prions predict that around 250 human proteins, including several RNA-binding proteins associated with neurodegenerative disease, harbour a distinctive prion-like domain (PrLD) enriched in uncharged polar amino acids and glycine. PrLDs in RNA-binding proteins are essential for the assembly of ribonucleoprotein granules. However, the interplay between human PrLD function and disease is not understood. Here we define pathogenic mutations in PrLDs of heterogeneous nuclear ribonucleoproteins (hnRNPs) A2B1 and A1 in families with inherited degeneration affecting muscle, brain, motor neuron and bone, and in one case of familial amyotrophic lateral sclerosis. Wild-type hnRNPA2 (the most abundant isoform of hnRNPA2B1) and hnRNPA1 show an intrinsic tendency to assemble into self-seeding fibrils, which is exacerbated by the disease mutations. Indeed, the pathogenic mutations strengthen a 'steric zipper' motif in the PrLD, which accelerates the formation of self-seeding fibrils that cross-seed polymerization of wild-type hnRNP. Notably, the disease mutations promote excess incorporation of hnRNPA2 and hnRNPA1 into stress granules and drive the formation of cytoplasmic inclusions in animal models that recapitulate the human pathology. Thus, dysregulated polymerization caused by a potent mutant steric zipper motif in a PrLD can initiate degenerative disease. Related proteins with PrLDs should therefore be considered candidates for initiating and perhaps propagating proteinopathies of muscle, brain, motor neuron and bone.


Subject(s)
Amyotrophic Lateral Sclerosis/genetics , Amyotrophic Lateral Sclerosis/pathology , Frontotemporal Dementia/genetics , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/chemistry , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/metabolism , Muscular Dystrophies, Limb-Girdle/genetics , Mutant Proteins/genetics , Mutation/genetics , Myositis, Inclusion Body/genetics , Osteitis Deformans/genetics , Prions/chemistry , Amino Acid Sequence , Amyotrophic Lateral Sclerosis/metabolism , Animals , Drosophila melanogaster/cytology , Drosophila melanogaster/genetics , Drosophila melanogaster/metabolism , Female , Frontotemporal Dementia/metabolism , Frontotemporal Dementia/pathology , HeLa Cells , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/genetics , Humans , Inclusion Bodies/genetics , Inclusion Bodies/metabolism , Inclusion Bodies/pathology , Male , Mice , Molecular Sequence Data , Muscular Dystrophies, Limb-Girdle/metabolism , Muscular Dystrophies, Limb-Girdle/pathology , Mutant Proteins/chemistry , Mutant Proteins/metabolism , Myositis, Inclusion Body/metabolism , Myositis, Inclusion Body/pathology , Osteitis Deformans/metabolism , Osteitis Deformans/pathology , Peptide Termination Factors/chemistry , Peptide Termination Factors/genetics , Peptide Termination Factors/metabolism , Prions/genetics , Prions/metabolism , Protein Structure, Tertiary/genetics , RNA/metabolism , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/metabolism
16.
Hum Mol Genet ; 25(5): 936-50, 2016 Mar 01.
Article in English | MEDLINE | ID: mdl-26744327

ABSTRACT

Adult-onset inherited myopathies with similar pathological features, including hereditary inclusion body myopathy (hIBM) and limb-girdle muscular dystrophy (LGMD), are a genetically heterogeneous group of muscle diseases. It is unclear whether these inherited myopathies initiated by mutations in distinct classes of genes are etiologically related. Here, we exploit a genetic model system to establish a mechanistic link between diseases caused by mutations in two distinct genes, hnRNPA2B1 and DNAJB6. Hrb98DE and mrj are the Drosophila melanogaster homologs of human hnRNPA2B1 and DNAJB6, respectively. We introduced disease-homologous mutations to Hrb98DE, thus capturing mutation-dependent phenotypes in a genetically tractable model system. Ectopic expression of the disease-associated mutant form of hnRNPA2B1 or Hrb98DE in fly muscle resulted in progressive, age-dependent cytoplasmic inclusion pathology, as observed in humans with hnRNPA2B1-related myopathy. Cytoplasmic inclusions consisted of hnRNPA2B1 or Hrb98DE protein in association with the stress granule marker ROX8 and additional endogenous RNA-binding proteins (RBPs), suggesting that these pathological inclusions are related to stress granules. Notably, TDP-43 was also recruited to these cytoplasmic inclusions. Remarkably, overexpression of MRJ rescued this phenotype and suppressed the formation of cytoplasmic inclusions, whereas reduction of endogenous MRJ by a classical loss of function allele enhanced it. Moreover, wild-type, but not disease-associated, mutant forms of MRJ interacted with RBPs after heat shock and prevented their accumulation in aggregates. These results indicate both genetic and physical interactions between disease-linked RBPs and DNAJB6/mrj, suggesting etiologic overlap between the pathogenesis of hIBM and LGMD initiated by mutations in hnRNPA2B1 and DNAJB6.


Subject(s)
Contracture/congenital , Drosophila melanogaster/genetics , HSP40 Heat-Shock Proteins/genetics , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/genetics , Molecular Chaperones/genetics , Muscular Dystrophies, Limb-Girdle/genetics , Myositis, Inclusion Body/congenital , Nerve Tissue Proteins/genetics , Ophthalmoplegia/genetics , Adult , Age of Onset , Amino Acid Sequence , Animals , Contracture/genetics , Contracture/metabolism , Contracture/pathology , DNA-Binding Proteins/genetics , DNA-Binding Proteins/metabolism , Disease Models, Animal , Drosophila Proteins/genetics , Drosophila Proteins/metabolism , Drosophila melanogaster/metabolism , Gene Expression Regulation , HSP40 Heat-Shock Proteins/metabolism , Heterogeneous-Nuclear Ribonucleoprotein Group A-B/metabolism , Heterogeneous-Nuclear Ribonucleoproteins/genetics , Heterogeneous-Nuclear Ribonucleoproteins/metabolism , Humans , Molecular Chaperones/metabolism , Molecular Sequence Data , Muscles/metabolism , Muscles/pathology , Muscular Dystrophies, Limb-Girdle/metabolism , Muscular Dystrophies, Limb-Girdle/pathology , Mutation , Myositis, Inclusion Body/genetics , Myositis, Inclusion Body/metabolism , Myositis, Inclusion Body/pathology , Nerve Tissue Proteins/metabolism , Ophthalmoplegia/metabolism , Ophthalmoplegia/pathology , Phenotype , Protein Binding , RNA-Binding Proteins/genetics , RNA-Binding Proteins/metabolism , Sequence Alignment , Sequence Homology, Amino Acid , Signal Transduction
17.
PLoS Comput Biol ; 13(4): e1005465, 2017 04.
Article in English | MEDLINE | ID: mdl-28394888

ABSTRACT

Many prion-forming proteins contain glutamine/asparagine (Q/N) rich domains, and there are conflicting opinions as to the role of primary sequence in their conversion to the prion form: is this phenomenon driven primarily by amino acid composition, or, as a recent computational analysis suggested, dependent on the presence of short sequence elements with high amyloid-forming potential. The argument for the importance of short sequence elements hinged on the relatively-high accuracy obtained using a method that utilizes a collection of length-six sequence elements with known amyloid-forming potential. We weigh in on this question and demonstrate that when those sequence elements are permuted, even higher accuracy is obtained; we also propose a novel multiple-instance machine learning method that uses sequence composition alone, and achieves better accuracy than all existing prion prediction approaches. While we expect there to be elements of primary sequence that affect the process, our experiments suggest that sequence composition alone is sufficient for predicting protein sequences that are likely to form prions. A web-server for the proposed method is available at http://faculty.pieas.edu.pk/fayyaz/prank.html, and the code for reproducing our experiments is available at http://doi.org/10.5281/zenodo.167136.


Subject(s)
Amino Acid Sequence , Asparagine/chemistry , Computational Biology/methods , Glutamine/chemistry , Machine Learning , Prions/chemistry , Amyloid/chemistry , Humans , Prions/metabolism , Yeasts
18.
Proc Natl Acad Sci U S A ; 112(28): 8584-9, 2015 Jul 14.
Article in English | MEDLINE | ID: mdl-26100899

ABSTRACT

Yeasts contain various protein-based genetic elements, termed prions, that result from the structural conversion of proteins into self-propagating amyloid forms. Most yeast prion proteins contain glutamine/asparagine (Q/N)-rich prion domains that drive prion activity. Here, we explore two mechanisms by which new prion domains could evolve. First, it has been proposed that mutation and natural selection will tend to result in proteins with aggregation propensities just low enough to function under physiological conditions and thus that a small number of mutations are often sufficient to cause aggregation. We hypothesized that if the ability to form prion aggregates was a sufficiently generic feature of Q/N-rich domains, many nonprion Q/N-rich domains might similarly have aggregation propensities on the edge of prion formation. Indeed, we tested four yeast Q/N-rich domains that had no detectable aggregation activity; in each case, a small number of rationally designed mutations were sufficient to cause the proteins to aggregate and, for two of the domains, to create prion activity. Second, oligopeptide repeats are found in multiple prion proteins, and expansion of these repeats increases prion activity. However, it is unclear whether the effects of repeat expansion are unique to these specific sequences or are a generic result of adding additional aggregation-prone segments into a protein domain. We found that within nonprion Q/N-rich domains, repeating aggregation-prone segments in tandem was sufficient to create prion activity. Duplication of DNA elements is a common source of genetic variation and may provide a simple mechanism to rapidly evolve prion activity.


Subject(s)
Gene Duplication , Mutation , Prions/genetics , Amino Acid Sequence , Molecular Sequence Data , Prions/chemistry , Sequence Homology, Amino Acid , Yeasts
19.
Cell Mol Life Sci ; 71(11): 2047-63, 2014 Jun.
Article in English | MEDLINE | ID: mdl-24390581

ABSTRACT

Prions are self-propagating infectious protein isoforms. A growing number of prions have been identified in yeast, each resulting from the conversion of soluble proteins into an insoluble amyloid form. These yeast prions have served as a powerful model system for studying the causes and consequences of prion aggregation. Remarkably, a number of human proteins containing prion-like domains, defined as domains with compositional similarity to yeast prion domains, have recently been linked to various human degenerative diseases, including amyotrophic lateral sclerosis. This suggests that the lessons learned from yeast prions may help in understanding these human diseases. In this review, we examine what has been learned about the amino acid sequence basis for prion aggregation in yeast, and how this information has been used to develop methods to predict aggregation propensity. We then discuss how this information is being applied to understand human disease, and the challenges involved in applying yeast prediction methods to higher organisms.


Subject(s)
Prions/chemistry , Saccharomyces cerevisiae Proteins/chemistry , Saccharomyces cerevisiae/chemistry , Amino Acid Sequence , Amyotrophic Lateral Sclerosis/genetics , Amyotrophic Lateral Sclerosis/metabolism , Amyotrophic Lateral Sclerosis/pathology , Gene Expression , Humans , Molecular Sequence Data , Prion Diseases/genetics , Prion Diseases/metabolism , Prion Diseases/pathology , Prions/genetics , Prions/metabolism , Protein Denaturation , Protein Structure, Secondary , Protein Structure, Tertiary , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Saccharomyces cerevisiae Proteins/genetics , Saccharomyces cerevisiae Proteins/metabolism , Sequence Homology, Amino Acid
20.
Proc Natl Acad Sci U S A ; 109(17): 6519-24, 2012 Apr 24.
Article in English | MEDLINE | ID: mdl-22474356

ABSTRACT

Prions are important disease agents and epigenetic regulatory elements. Prion formation involves the structural conversion of proteins from a soluble form into an insoluble amyloid form. In many cases, this structural conversion is driven by a glutamine/asparagine (Q/N)-rich prion-forming domain. However, our understanding of the sequence requirements for prion formation and propagation by Q/N-rich domains has been insufficient for accurate prion propensity prediction or prion domain design. By focusing exclusively on amino acid composition, we have developed a prion aggregation prediction algorithm (PAPA), specifically designed to predict prion propensity of Q/N-rich proteins. Here, we show not only that this algorithm is far more effective than traditional amyloid prediction algorithms at predicting prion propensity of Q/N-rich proteins, but remarkably, also that PAPA is capable of rationally designing protein domains that function as prions in vivo.


Subject(s)
Prions/chemistry , Algorithms , Amino Acid Sequence , Molecular Sequence Data , Sequence Homology, Amino Acid , Solubility
SELECTION OF CITATIONS
SEARCH DETAIL