RESUMO
Gene expression in human tissue has primarily been studied on the transcriptional level, largely neglecting translational regulation. Here, we analyze the translatomes of 80 human hearts to identify new translation events and quantify the effect of translational regulation. We show extensive translational control of cardiac gene expression, which is orchestrated in a process-specific manner. Translation downstream of predicted disease-causing protein-truncating variants appears to be frequent, suggesting inefficient translation termination. We identify hundreds of previously undetected microproteins, expressed from lncRNAs and circRNAs, for which we validate the protein products in vivo. The translation of microproteins is not restricted to the heart and prominent in the translatomes of human kidney and liver. We associate these microproteins with diverse cellular processes and compartments and find that many locate to the mitochondria. Importantly, dozens of microproteins are translated from lncRNAs with well-characterized noncoding functions, indicating previously unrecognized biology.
Assuntos
Miocárdio/metabolismo , Biossíntese de Proteínas , Adolescente , Adulto , Idoso , Animais , Códon/genética , Feminino , Regulação da Expressão Gênica , Células HEK293 , Humanos , Lactente , Masculino , Camundongos , Camundongos Endogâmicos C57BL , Pessoa de Meia-Idade , Fases de Leitura Aberta/genética , RNA Circular/genética , RNA Circular/metabolismo , RNA Longo não Codificante/genética , RNA Longo não Codificante/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Ratos , Ribossomos/genética , Ribossomos/metabolismo , Adulto JovemRESUMO
Cardiovascular disease is the leading cause of death worldwide. Advanced insights into disease mechanisms and therapeutic strategies require a deeper understanding of the molecular processes involved in the healthy heart. Knowledge of the full repertoire of cardiac cells and their gene expression profiles is a fundamental first step in this endeavour. Here, using state-of-the-art analyses of large-scale single-cell and single-nucleus transcriptomes, we characterize six anatomical adult heart regions. Our results highlight the cellular heterogeneity of cardiomyocytes, pericytes and fibroblasts, and reveal distinct atrial and ventricular subsets of cells with diverse developmental origins and specialized properties. We define the complexity of the cardiac vasculature and its changes along the arterio-venous axis. In the immune compartment, we identify cardiac-resident macrophages with inflammatory and protective transcriptional signatures. Furthermore, analyses of cell-to-cell interactions highlight different networks of macrophages, fibroblasts and cardiomyocytes between atria and ventricles that are distinct from those of skeletal muscle. Our human cardiac cell atlas improves our understanding of the human heart and provides a valuable reference for future studies.
Assuntos
Miocárdio/citologia , Análise de Célula Única , Transcriptoma , Adipócitos/classificação , Adipócitos/metabolismo , Adulto , Enzima de Conversão de Angiotensina 2/análise , Enzima de Conversão de Angiotensina 2/genética , Enzima de Conversão de Angiotensina 2/metabolismo , Células Epiteliais/classificação , Células Epiteliais/metabolismo , Epitélio , Feminino , Fibroblastos/classificação , Fibroblastos/metabolismo , Perfilação da Expressão Gênica , Estudo de Associação Genômica Ampla , Átrios do Coração/anatomia & histologia , Átrios do Coração/citologia , Átrios do Coração/inervação , Ventrículos do Coração/anatomia & histologia , Ventrículos do Coração/citologia , Ventrículos do Coração/inervação , Homeostase/imunologia , Humanos , Macrófagos/imunologia , Macrófagos/metabolismo , Masculino , Músculo Esquelético/citologia , Músculo Esquelético/metabolismo , Miócitos Cardíacos/classificação , Miócitos Cardíacos/metabolismo , Neurônios/classificação , Neurônios/metabolismo , Pericitos/classificação , Pericitos/metabolismo , Receptores de Coronavírus/análise , Receptores de Coronavírus/genética , Receptores de Coronavírus/metabolismo , SARS-CoV-2/metabolismo , SARS-CoV-2/patogenicidade , Células Estromais/classificação , Células Estromais/metabolismoRESUMO
High-throughput genomic sequencing has focused attention on understanding differences between species and between individuals. When this genetic variation affects protein sequences, the rate of amino acid substitution reflects both Darwinian selection for functionally advantageous mutations and selectively neutral evolution operating within the constraints of structure and function. During neutral evolution, whereby mutations accumulate by random drift, amino acid substitutions are constrained by factors such as the formation of intramolecular and intermolecular interactions and the accessibility to water or lipids surrounding the protein. These constraints arise from the need to conserve a specific architecture and to retain interactions that mediate functions in protein families and superfamilies.
Assuntos
Evolução Molecular , Proteínas/química , Proteínas/metabolismo , Sequência de Aminoácidos , Substituição de Aminoácidos , Ligação de Hidrogênio , Interações Hidrofóbicas e Hidrofílicas , Cinética , Modelos Moleculares , Dados de Sequência Molecular , Ligação Proteica , Dobramento de Proteína , Estrutura Secundária de Proteína , Homologia de Sequência de AminoácidosRESUMO
Microorganisms form surface-attached communities, termed biofilms, which can serve as protection against host immune reactions or antibiotics. Bacillus subtilis biofilms contain TasA as major proteinaceous component in addition to exopolysaccharides. In stark contrast to the initially unfolded biofilm proteins of other bacteria, TasA is a soluble, stably folded monomer, whose structure we have determined by X-ray crystallography. Subsequently, we characterized in vitro different oligomeric forms of TasA by NMR, EM, X-ray diffraction, and analytical ultracentrifugation (AUC) experiments. However, by magic-angle spinning (MAS) NMR on live biofilms, a swift structural change toward only one of these forms, consisting of homogeneous and protease-resistant, ß-sheet-rich fibrils, was observed in vivo. Thereby, we characterize a structural change from a globular state to a fibrillar form in a functional prokaryotic system on the molecular level.
Assuntos
Bacillus subtilis/fisiologia , Proteínas de Bactérias/química , Biofilmes/crescimento & desenvolvimento , Bacillus subtilis/química , Proteínas de Bactérias/metabolismo , Calorimetria , Cristalografia por Raios X , Concentração de Íons de Hidrogênio , Espectroscopia de Ressonância Magnética , Metaloendopeptidases/química , Microscopia Eletrônica , Modelos Moleculares , Peso Molecular , Conformação Proteica , Homologia Estrutural de Proteína , UltracentrifugaçãoRESUMO
G-protein-coupled receptors (GPCRs) can constitute complexes with non-GPCR integral membrane proteins, while such interaction has not been demonstrated at a single molecule level so far. We here investigated the potential interaction between the thyrotropin receptor (TSHR) and the monocarboxylate transporter 8 (MCT8), a member of the major facilitator superfamily (MFS), using fluorescence cross-correlation spectroscopy (FCCS). Both the proteins are expressed endogenously on the basolateral plasma membrane of the thyrocytes and are involved in stimulation of thyroid hormone production and release. Indeed, we demonstrate strong interaction between both the proteins which causes a suppressed activation of Gq/11 by TSH-stimulated TSHR. Thus, we provide not only evidence for a novel interaction between the TSHR and MCT8, but could also prove this interaction on a single molecule level. Moreover, this interaction forces biased signaling at the TSHR. These results are of general interest for both the GPCR and the MFS research fields.
Assuntos
Transportadores de Ácidos Monocarboxílicos/metabolismo , Mapas de Interação de Proteínas , Receptores da Tireotropina/metabolismo , Animais , Células COS , Chlorocebus aethiops , Expressão Gênica , Células HEK293 , Humanos , Transportadores de Ácidos Monocarboxílicos/análise , Transportadores de Ácidos Monocarboxílicos/genética , Multimerização Proteica , Receptores da Tireotropina/análise , Receptores da Tireotropina/genética , Transdução de Sinais , Simportadores , Glândula Tireoide/metabolismo , Glândula Tireoide/patologiaRESUMO
G-protein coupled receptors (GPCRs) are key players in signal transduction and therefore a large proportion of pharmaceutical drugs target these receptors. Structural data of GPCRs are sparse yet important for elucidating the molecular basis of GPCR-related diseases and for performing structure-based drug design. To ameliorate this problem, GPCR-SSFE 2.0 (http://www.ssfa-7tmr.de/ssfe2/), an intuitive web server dedicated to providing three-dimensional Class A GPCR homology models has been developed. The updated web server includes 27 inactive template structures and incorporates various new functionalities. Uniquely, it uses a fingerprint correlation scoring strategy for identifying the optimal templates, which we demonstrate captures structural features that sequence similarity alone is unable to do. Template selection is carried out separately for each helix, allowing both single-template models and fragment-based models to be built. Additionally, GPCR-SSFE 2.0 stores a comprehensive set of pre-calculated and downloadable homology models and also incorporates interactive loop modeling using the tool SL2, allowing knowledge-based input by the user to guide the selection process. For visual analysis, the NGL viewer is embedded into the result pages. Finally, blind-testing using two recently published structures shows that GPCR-SSFE 2.0 performs comparably or better than other state-of-the art GPCR modeling web servers.
Assuntos
Modelos Moleculares , Receptores Acoplados a Proteínas G/química , Software , Animais , Humanos , Internet , Camundongos , Ratos , Alinhamento de Sequência , Análise de Sequência de Proteína , Homologia Estrutural de ProteínaRESUMO
The type I ATP-binding cassette (ABC) importer for positively charged amino acids of the thermophilic bacterium Geobacillus stearothermophilus consists of the extracellular solute binding protein, ArtJ, and a homodimer each of the transmembrane subunit, ArtM, and the nucleotide-binding and -hydrolyzing subunit, ArtP. We have investigated the functional consequences of mutations affecting conserved residues from two peptide regions in ArtM, recently proposed to form a 'gate' by which access of a substrate to the translocation path is controlled (Hollenstein et al., 2007 [14]). Transporter variants were reconstituted into proteoliposomes and assayed for ArtJ/arginine-stimulated ATPase activity. Replacement of residues from region 1 (Arg-63, Pro-66) caused no or only moderate reduction in ATPase activity. In contrast, mutating residues from gate region 2 (Lys-159, Leu-163) resulted in a substantial increase in ATPase activity which, however, as demonstrated for variants ArtM(K159I) and ArtM(K159E), is not coupled to transport. Replacing homologous residues in the closely related histidine transporter of Salmonella enterica serovar Typhimurium (HisJ-QMP2) caused different phenotypes. Mutation to isoleucine of HisQ(K163) or HisM(H172), both homologous to ArtM(K159), abolished ATPase activity. The mutations most likely caused a structural change as revealed by limited proteolysis. In contrast, substantial, albeit reduced, enzymatic activity was observed with variants of HisQ(L167âG) or HisM(L176âG), both homologous to ArtM(L163). Our study provides the first experimental evidence in favor of a crucial role of residues from the proposed gate region in type I ABC importer function.
Assuntos
Transportadores de Cassetes de Ligação de ATP/química , Proteínas de Bactérias/química , Geobacillus stearothermophilus/química , Subunidades Proteicas/química , Proteolipídeos/química , Transportadores de Cassetes de Ligação de ATP/genética , Transportadores de Cassetes de Ligação de ATP/metabolismo , Sequência de Aminoácidos , Substituição de Aminoácidos , Aminoácidos/química , Aminoácidos/genética , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Geobacillus stearothermophilus/genética , Geobacillus stearothermophilus/metabolismo , Modelos Moleculares , Dados de Sequência Molecular , Mutação , Subunidades Proteicas/genética , Subunidades Proteicas/metabolismo , Proteolipídeos/metabolismo , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Salmonella typhimurium/química , Salmonella typhimurium/genética , Homologia de Sequência de Aminoácidos , Relação Estrutura-AtividadeRESUMO
Periodontitis affects billions of people worldwide. To address relationships of periodontal niche cell types and microbes in periodontitis, we generated an integrated single-cell RNA sequencing (scRNAseq) atlas of human periodontium (34-sample, 105918-cell), including sulcular and junctional keratinocytes (SK/JKs). SK/JKs displayed altered differentiation states and were enriched for effector cytokines in periodontitis. Single-cell metagenomics revealed 37 bacterial species with cell-specific tropism. Fluorescence in situ hybridization detected intracellular 16 S and mRNA signals of multiple species and correlated with SK/JK proinflammatory phenotypes in situ. Cell-cell communication analysis predicted keratinocyte-specific innate and adaptive immune interactions. Highly multiplexed immunofluorescence (33-antibody) revealed peri-epithelial immune foci, with innate cells often spatially constrained around JKs. Spatial phenotyping revealed immunosuppressed JK-microniches and SK-localized tertiary lymphoid structures in periodontitis. Here, we demonstrate impacts on and predicted interactomics of SK and JK cells in health and periodontitis, which requires further investigation to support precision periodontal interventions in states of chronic inflammation.
Assuntos
Comunicação Celular , Queratinócitos , Periodontite , Análise de Célula Única , Humanos , Queratinócitos/metabolismo , Queratinócitos/imunologia , Periodontite/microbiologia , Periodontite/metabolismo , Periodontite/imunologia , Periodontite/patologia , Citocinas/metabolismo , Periodonto/microbiologia , Periodonto/metabolismo , Periodonto/patologia , Imunidade Inata , Hibridização in Situ Fluorescente , Masculino , Metagenômica/métodos , Bactérias/metabolismo , Bactérias/genética , Feminino , Adulto , Imunidade AdaptativaRESUMO
The sheer volume of non-synonymous single nucleotide polymorphisms that have been generated in recent years from projects such as the Human Genome Project, the HapMap Project and Genome-Wide Association Studies means that it is not possible to characterize all mutations experimentally on the gene products, i.e. elucidate the effects of mutations on protein structure and function. However, automatic methods that can predict the effects of mutations will allow a reduced set of mutations to be studied. Site Directed Mutator (SDM) is a statistical potential energy function that uses environment-specific amino-acid substitution frequencies within homologous protein families to calculate a stability score, which is analogous to the free energy difference between the wild-type and mutant protein. Here, we present a web server for SDM (http://www-cryst.bioc.cam.ac.uk/~sdm/sdm.php), which has obtained more than 10,000 submissions since being online in April 2008. To run SDM, users must upload a wild-type structure and the position and amino acid type of the mutation. The results returned include information about the local structural environment of the wild-type and mutant residues, a stability score prediction and prediction of disease association. Additionally, the wild-type and mutant structures are displayed in a Jmol applet with the relevant residues highlighted.
Assuntos
Substituição de Aminoácidos , Mutação , Conformação Proteica , Estabilidade Proteica , Software , Doença/genética , Humanos , Internet , Proteínas/genética , TermodinâmicaRESUMO
A vast number of sweet tasting molecules are known, encompassing small compounds, carbohydrates, d-amino acids and large proteins. Carbohydrates play a particularly big role in human diet. The replacement of sugars in food with artificial sweeteners is common and is a general approach to prevent cavities, obesity and associated diseases such as diabetes and hyperlipidemia. Knowledge about the molecular basis of taste may reveal new strategies to overcome diet-induced diseases. In this context, the design of safe, low-calorie sweeteners is particularly important. Here, we provide a comprehensive collection of carbohydrates, artificial sweeteners and other sweet tasting agents like proteins and peptides. Additionally, structural information and properties such as number of calories, therapeutic annotations and a sweetness-index are stored in SuperSweet. Currently, the database consists of more than 8000 sweet molecules. Moreover, the database provides a modeled 3D structure of the sweet taste receptor and binding poses of the small sweet molecules. These binding poses provide hints for the design of new sweeteners. A user-friendly graphical interface allows similarity searching, visualization of docked sweeteners into the receptor etc. A sweetener classification tree and browsing features allow quick requests to be made to the database. The database is freely available at: http://bioinformatics.charite.de/sweet/.
Assuntos
Bases de Dados Factuais , Receptores Acoplados a Proteínas G/química , Edulcorantes/química , Aminoácidos/química , Sítios de Ligação , Carboidratos/química , Proteínas/química , Homologia Estrutural de ProteínaRESUMO
Consideration of biomolecules in terms of their molecular building blocks provides valuable new information regarding their synthesis, degradation and similarity. Here, we present the FragmentStore, a resource for the comparison of fragments found in metabolites, drugs or toxic compounds. Starting from 13,000 metabolites, 16,000 drugs and 2200 toxic compounds we generated 35,000 different building blocks (fragments), which are not only relevant to their biosynthesis and degradation but also provide important information regarding side-effects and toxicity. The FragmentStore provides a variety of search options such as 2D structure, molecular weight, rotatable bonds, etc. Various analysis tools have been implemented including the calculation of amino acid preferences of fragments' binding sites, classification of fragments based on the enzyme classification class of the enzyme(s) they bind to and small molecule library generation via a fragment-assembler tool. Using the FragmentStore, it is now possible to identify the common fragments of different classes of molecules and generate hypotheses about the effects of such intersections. For instance, the co-occurrence of fragments in different drugs may indicate similar targets and possible off-target interactions whereas the co-occurrence of fragments in a drug and a toxic compound/metabolite could be indicative of side-effects. The database is publicly available at: http://bioinformatics.charite.de/fragment_store.
Assuntos
Bases de Dados Factuais , Desenho de Fármacos , Preparações Farmacêuticas/química , Sítios de Ligação , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Redes e Vias MetabólicasRESUMO
Transmembrane helices (TMHs) 5 and 6 are known to be important for signal transduction by G-protein-coupled receptors (GPCRs). Our aim was to characterize the interface between TMH5 and TMH6 of the thyrotropin receptor (TSHR) to gain molecular insights into aspects of signal transduction and regulation. A proline at TMH5 position 5.50 is highly conserved in family A GPCRs and causes a twist in the helix structure. Mutation of the TSHR-specific alanine (Ala-5935·5°) at this position to proline resulted in a 20-fold reduction of cell surface expression. This indicates that TMH5 in the TSHR might have a conformation different from most other family A GPCRs by forming a regular α-helix. Furthermore, linking our own and previous data from directed mutagenesis with structural information led to suggestions of distinct pairs of interacting residues between TMH5 and TMH6 that are responsible for stabilizing either the basal or the active state. Our insights suggest that the inactive state conformation is constrained by a core set of polar interactions among TMHs 2, 3, 6, and 7 and in contrast that the active state conformation is stabilized mainly by non-polar interactions between TMHs 5 and 6. Our findings might be relevant for all family A GPCRs as supported by a statistical analysis of residue properties between the TMHs of a vast number of GPCR sequences.
Assuntos
Membrana Celular/metabolismo , Receptores Acoplados a Proteínas G/química , Receptores Acoplados a Proteínas G/metabolismo , Receptores da Tireotropina/química , Receptores da Tireotropina/metabolismo , Transdução de Sinais , Animais , Sequência Conservada , AMP Cíclico/metabolismo , Células HEK293 , Humanos , Fosfatos de Inositol/metabolismo , Modelos Moleculares , Mutagênese Sítio-Dirigida , Mutação , Estrutura Secundária de Proteína , Receptores da Tireotropina/genéticaRESUMO
The G protein-coupled receptor 84 (GPR84) is found in immune cells and its expression is increased under inflammatory conditions. Activation of GPR84 by medium-chain fatty acids results in pro-inflammatory responses. Here, we screened available vertebrate genome data and found that GPR84 is present in vertebrates for more than 500 million years but absent in birds and a pseudogene in bats. Cloning and functional characterization of several mammalian GPR84 orthologs in combination with evolutionary and model-based structural analyses revealed evidence for positive selection of bear GPR84 orthologs. Naturally occurring human GPR84 variants are most frequent in Asian populations causing a loss of function. Further, we identified cis- and trans-2-decenoic acid, both known to mediate bacterial communication, as evolutionary highly conserved ligands. Our integrated set of approaches contributes to a comprehensive understanding of GPR84 in terms of evolutionary and structural aspects, highlighting GPR84 as a conserved immune cell receptor for bacteria-derived molecules.
RESUMO
Pathogenic variants in genes that cause dilated cardiomyopathy (DCM) and arrhythmogenic cardiomyopathy (ACM) convey high risks for the development of heart failure through unknown mechanisms. Using single-nucleus RNA sequencing, we characterized the transcriptome of 880,000 nuclei from 18 control and 61 failing, nonischemic human hearts with pathogenic variants in DCM and ACM genes or idiopathic disease. We performed genotype-stratified analyses of the ventricular cell lineages and transcriptional states. The resultant DCM and ACM ventricular cell atlas demonstrated distinct right and left ventricular responses, highlighting genotype-associated pathways, intercellular interactions, and differential gene expression at single-cell resolution. Together, these data illuminate both shared and distinct cellular and molecular architectures of human heart failure and suggest candidate therapeutic targets.
Assuntos
Displasia Arritmogênica Ventricular Direita , Cardiomiopatia Dilatada , Insuficiência Cardíaca , Análise de Célula Única , Transcriptoma , Displasia Arritmogênica Ventricular Direita/genética , Atlas como Assunto , Cardiomiopatia Dilatada/genética , Núcleo Celular/genética , Insuficiência Cardíaca/genética , Ventrículos do Coração , Humanos , RNA-SeqRESUMO
BACKGROUND: G protein-coupled receptors (GPCRs) transduce a wide variety of extracellular signals to within the cell and therefore have a key role in regulating cell activity and physiological function. GPCR malfunction is responsible for a wide range of diseases including cancer, diabetes and hyperthyroidism and a large proportion of drugs on the market target these receptors. The three dimensional structure of GPCRs is important for elucidating the molecular mechanisms underlying these diseases and for performing structure-based drug design. Although structural data are restricted to only a handful of GPCRs, homology models can be used as a proxy for those receptors not having crystal structures. However, many researchers working on GPCRs are not experienced homology modellers and are therefore unable to benefit from the information that can be gleaned from such three-dimensional models. Here, we present a comprehensive database called the GPCR-SSFE, which provides initial homology models of the transmembrane helices for a large variety of family A GPCRs. DESCRIPTION: Extending on our previous theoretical work, we have developed an automated pipeline for GPCR homology modelling and applied it to a large set of family A GPCR sequences. Our pipeline is a fragment-based approach that exploits available family A crystal structures. The GPCR-SSFE database stores the template predictions, sequence alignments, identified sequence and structure motifs and homology models for 5025 family A GPCRs. Users are able to browse the GPCR dataset according to their pharmacological classification or search for results using a UniProt entry name. It is also possible for a user to submit a GPCR sequence that is not contained in the database for analysis and homology model building. The models can be viewed using a Jmol applet and are also available for download along with the alignments. CONCLUSIONS: The data provided by GPCR-SSFE are useful for investigating general and detailed sequence-structure-function relationships of GPCRs, performing structure-based drug design and for better understanding the molecular mechanisms underlying disease-associated mutations in GPCRs. The effectiveness of our multiple template and fragment approach is demonstrated by the accuracy of our predicted homology models compared to recently published crystal structures.
Assuntos
Bases de Dados de Proteínas , Receptores Acoplados a Proteínas G/química , Homologia Estrutural de Proteína , Motivos de Aminoácidos , Animais , Humanos , Modelos Moleculares , Estrutura Secundária de ProteínaRESUMO
The thyrotropin receptor [thyroid-stimulating hormone receptor (TSHR)], a G-protein-coupled receptor (GPCR), is endogenously activated by thyrotropin, which binds to the extracellular region of the receptor. We previously identified a low-molecular-weight (LMW) agonist of the TSHR and predicted its allosteric binding pocket within the receptor's transmembrane domain. Because binding of the LMW agonist probably disrupts interactions or leads to formation of new interactions among amino acid residues surrounding the pocket, we tested whether mutation of residues at these positions would lead to constitutive signaling activity. Guided by molecular modeling, we performed site-directed mutagenesis of 24 amino acids in this spatial region, followed by functional characterization of the mutant receptors in terms of expression and signaling, measured as cAMP accumulation. We found that mutations V421I, Y466A, T501A, L587V, M637C, M637W, S641A, Y643F, L645V, and Y667A located in several helices exhibit constitutive activity. Of note is mutation M637W at position 6.48 in transmembrane helix 6, which has a significant effect on the interaction of the receptor with the LMW agonist. In summary, we found that a high proportion of residues in several helices surrounding the allosteric binding site of LMW ligands in the TSHR when mutated lead to constitutively active receptors. Our findings of signaling-sensitive residues in this region of the transmembrane bundle may be of general importance as this domain appears to be evolutionarily retained among GPCRs.
Assuntos
Sítio Alostérico/genética , Aminoácidos/metabolismo , Receptores da Tireotropina/metabolismo , AMP Cíclico/análise , Ligantes , Mutagênese Sítio-Dirigida , Mutação de Sentido Incorreto , Receptores da Tireotropina/química , Receptores da Tireotropina/genética , Transdução de SinaisRESUMO
The collection, description and molecular analysis of naturally occurring (pathogenic) mutations are important for understanding the functional mechanisms and malfunctions of biological units such as proteins. Numerous databases collate a huge amount of functional data or descriptions of mutations, but tools to analyse the molecular effects of genetic variations are as yet poorly provided. The goal of this work was therefore to develop a translational web-application that facilitates the interactive linkage of functional and structural data and which helps improve our understanding of the molecular basis of naturally occurring gain- or loss- of function mutations. Here we focus on the human glycoprotein hormone receptors (GPHRs), for which a huge number of mutations are known to cause diseases. We describe new options for interactive data analyses within three-dimensional structures, which enable the assignment of molecular relationships between structure and function. Strikingly, as the functional data are converted into relational percentage values, the system allows the comparison and classification of data from different GPHR subtypes and different experimental approaches. Our new application has been incorporated into a freely available database and website for the GPHRs (http://www.ssfa-gphr.de), but the principle development would also be applicable to other macromolecules.
Assuntos
Bases de Dados de Proteínas , Internet , Mutação , Receptores de Superfície Celular/genética , Interface Usuário-Computador , Biologia Computacional/métodos , Glicoproteínas/química , Glicoproteínas/genética , Humanos , Armazenamento e Recuperação da Informação/métodos , Modelos Moleculares , Conformação Proteica , Receptores de Superfície Celular/química , Rodopsina/genética , SoftwareRESUMO
BACKGROUND: The hydrogen bond patterns between mainchain atoms in protein structures not only give rise to regular secondary structures but also satisfy mainchain hydrogen bond potential. However, not all mainchain atoms can be satisfied through hydrogen bond interactions that arise in regular secondary structures; in some locations sidechain-to-mainchain hydrogen bonds are required to provide polar group satisfaction. Buried polar residues that are hydrogen-bonded to mainchain amide atoms tend to be highly conserved within protein families, confirming that mainchain architecture is a critical restraint on the evolution of proteins. We have investigated the stabilizing roles of buried polar sidechains on the backbones of protein structures by performing an analysis of solvent inaccessible residues that are entirely conserved within protein families and superfamilies and hydrogen bonded to an equivalent mainchain atom in each family member. RESULTS: We show that polar and sometimes charged sidechains form hydrogen bonds to mainchain atoms in the cores of proteins in a manner that has been conserved in evolution. Although particular motifs have previously been identified where buried polar residues have conserved roles in stabilizing protein structure, for example in helix capping, we demonstrate that such interactions occur in a range of architectures and highlight those polar amino acid types that fulfil these roles. We show that these buried polar residues often span elements of secondary structure and provide stabilizing interactions of the overall protein architecture. CONCLUSIONS: Conservation of buried polar residues and the hydrogen-bond interactions that they form implies an important role for maintaining protein structure, contributing strong restraints on amino acid substitutions during divergent protein evolution. Our analysis sheds light on the important stabilizing roles of these residues in protein architecture and provides further insight into factors influencing the evolution of protein families and superfamilies.
Assuntos
Evolução Molecular , Ligação de Hidrogênio , Estrutura Secundária de Proteína/genética , Motivos de Aminoácidos , Sequência Conservada/genética , Análise de Sequência de ProteínaRESUMO
Although polar amino acids tend to be found on the surface of proteins due to their hydrophilic nature, their important roles within the core of proteins are now becoming better recognized. It has long been understood that a significant number of mainchain functions will not achieve hydrogen bond satisfaction through the formation of secondary structures; in these circumstances, it is generally buried polar residues that provide hydrogen bond satisfaction. Here, we describe an analysis of the hydrogen-bonding of polar amino acids in a set of structurally aligned protein families. This allows us not only to calculate the conservation of each polar residue but also to assess whether conservation is correlated with the hydrogen-bonding potential of polar sidechains. We show that those polar sidechains whose hydrogen-bonding potential is satisfied tend to be more conserved than their unsatisfied or nonhydrogen-bonded counterparts, particularly when buried. Interestingly, these buried and satisfied polar residues are significantly more conserved than buried hydrophobic residues. Forming hydrogen bonds to mainchain amide atoms also influences conservation, with those satisfied buried polar residues that form two hydrogen bonds to mainchain amides being significantly more conserved than those that form only one or none. These results indicate that buried polar residues whose hydrogen-bonding potential is satisfied are likely to have important roles in maintaining protein structure.
Assuntos
Aminoácidos/química , Proteínas/química , Sequência de Aminoácidos , Aminoácidos/metabolismo , Sequência Conservada , Entropia , Ligação de Hidrogênio , Interações Hidrofóbicas e Hidrofílicas , Conformação Proteica , Estabilidade Proteica , Estrutura Secundária de Proteína , Proteínas/metabolismoRESUMO
Mutations in the VHL gene lead to von Hippel-Lindau (VHL) disease, a clinically heterogeneous cancer syndrome. Here, we use software and database tools to understand and predict the phenotypes associated with missense mutations in the VHL gene product, pVHL. The protein product pVHL is known to interact with elongin B, elongin C, and the HIF substrate. By analyzing known and predicted interaction sites and predictions of thermodynamic stability change upon mutation, we generate new hypotheses regarding the molecular etiology of renal cell carcinoma (RCC) and pheochromocytoma (PCC) in VHL disease. We find that the molecular causes of RCC and PCC appear to be decoupled. RCC may arise through two distinct mechanisms: disruption of HIF interactions or binding at the elongin B interface. PCC is triggered by mutations which disrupt interactions at the elongin C binding site. These findings have important implications for VHL disease and for nonfamilial RCC, because most cases of clear cell RCC are linked with VHL inactivation. Additionally, predicting effects of genetic variation will be critical as genetic sequencing accelerates; the analytical strategy presented here may elucidate other systems as further data on genetic variation become available.