RESUMO
Metal ions can be both essential components of cells as well as potential toxins if present in excess. Organisms utilize a variety of protein systems to maintain the concentration of metal ions within the appropriate range for cellular function, and to avoid concentrations where cellular damage can occur. In bacteria, numerous proteins contribute to copper homeostasis, including copper transporters, chelators, and redox enzymes. The genes that encode these proteins are often found in clusters, thus providing modular components that work together to achieve homeostasis. A better understanding of how these components function and cooperate to achieve metal ion resistance is needed, given the extensive use of metal ions, including copper, as broad-spectrum biocides in a variety of clinical and environmental settings. The copG gene is a common component of such copper resistance clusters, but its contribution to copper resistance is not well understood. In this review the available information about the CopG protein encoded by this gene is summarized. Comparison of the recent structure to diverse copper-containing metallochaperones, metalloenzymes, and electron transfer proteins suggests that CopG is a redox enzyme that uses multiple copper ions as active site redox cofactors to act on additional copper ion substrates. Mechanisms for both oxidase and reductase activity are proposed, and the biological advantages that these activities can contribute in conjunction with existing systems are described.
Assuntos
Cobre , Metaloproteínas , Cobre/metabolismo , Oxirredução , Metais/química , Transporte de Elétrons , Metaloproteínas/metabolismoRESUMO
Lyme borreliosis is the most common vector-borne disease in the Northern Hemisphere, caused by spirochetes belonging to the Borrelia burgdorferi sensu lato species complex, which are transmitted by ixodid ticks. B. burgdorferi sensu lato species produce a family of proteins on the linear plasmid 54 (PFam54), some of which confer the functions of cell adhesion and inactivation of complement, the first line of host defense. However, the impact of PFam54 in promoting B. burgdorferi sensu lato pathogenesis remains unclear because of the hurdles to simultaneously knock out all PFam54 proteins in a spirochete. Here, we describe two Borrelia bavariensis strains, PBN and PNi, isolated from patients naturally lacking PFam54 but maintaining the rest of the genome with greater than 95% identity to the reference B. bavariensis strain, PBi. We found that PBN and PNi less efficiently survive in human serum than PBi. Such defects were restored by introducing two B. bavariensis PFam54 recombinant proteins, BGA66 and BGA71, confirming the role of these proteins in providing complement evasion of B. bavariensis. Further, we found that all three strains remain detectable in various murine tissues 21 days post-subcutaneous infection, supporting the nonessential role of B. bavariensis PFam54 in promoting spirochete persistence. This study identified and utilized isolates deficient in PFam54 to associate the defects with the absence of these proteins, building the foundation to further study the role of each PFam54 protein in contributing to B. burgdorferi sensu lato pathogenesis. IMPORTANCE To establish infections, Lyme borreliae utilize various means to overcome the host's immune system. Proteins encoded by the PFam54 gene array play a role in spirochete survival in vitro and in vivo. Moreover, this gene array has been described in all currently available Lyme borreliae genomes. By investigating the first two Borrelia bavariensis isolates naturally lacking the entire PFam54 gene array, we showed that both patient isolates display an increased susceptibility to human serum, which can be rescued in the presence of two PFam54 recombinant proteins. However, both isolates remain infectious to mice after intradermal inoculation, suggesting the nonessential role of PFam54 during the long-term, but may differ slightly in the colonization of specific tissues. Furthermore, these isolates show high genomic similarity to type strain PBi (>95%) and could be used in future studies investigating the role of each PFam54 protein in Lyme borreliosis pathogenesis.
Assuntos
Grupo Borrelia Burgdorferi , Borrelia , Ixodes , Doença de Lyme , Animais , Borrelia/genética , Grupo Borrelia Burgdorferi/genética , Humanos , Camundongos , Plasmídeos , SpirochaetalesRESUMO
BACKGROUND: The identification of protein families is of outstanding practical importance for in silico protein annotation and is at the basis of several bioinformatic resources. Pfam is possibly the most well known protein family database, built in many years of work by domain experts with extensive use of manual curation. This approach is generally very accurate, but it is quite time consuming and it may suffer from a bias generated from the hand-curation itself, which is often guided by the available experimental evidence. RESULTS: We introduce a procedure that aims to identify automatically putative protein families. The procedure is based on Density Peak Clustering and uses as input only local pairwise alignments between protein sequences. In the experiment we present here, we ran the algorithm on about 4000 full-length proteins with at least one domain classified by Pfam as belonging to the Pseudouridine synthase and Archaeosine transglycosylase (PUA) clan. We obtained 71 automatically-generated sequence clusters with at least 100 members. While our clusters were largely consistent with the Pfam classification, showing good overlap with either single or multi-domain Pfam family architectures, we also observed some inconsistencies. The latter were inspected using structural and sequence based evidence, which suggested that the automatic classification captured evolutionary signals reflecting non-trivial features of protein family architectures. Based on this analysis we identified a putative novel pre-PUA domain as well as alternative boundaries for a few PUA or PUA-associated families. As a first indication that our approach was unlikely to be clan-specific, we performed the same analysis on the P53 clan, obtaining comparable results. CONCLUSIONS: The clustering procedure described in this work takes advantage of the information contained in a large set of pairwise alignments and successfully identifies a set of putative families and family architectures in an unsupervised manner. Comparison with the Pfam classification highlights significant overlap and points to interesting differences, suggesting that our new algorithm could have potential in applications related to automatic protein classification. Testing this hypothesis, however, will require further experiments on large and diverse sequence datasets.
Assuntos
Proteínas , Alinhamento de Sequência , Sequência de Aminoácidos , Análise por Conglomerados , Bases de Dados de Proteínas , Humanos , Proteínas/genéticaRESUMO
BACKGROUND: Fibrillar adhesins are long multidomain proteins that form filamentous structures at the cell surface of bacteria. They are an important yet understudied class of proteins composed of adhesive and stalk domains that mediate interactions of bacteria with their environment. This study aims to characterize fibrillar adhesins in a wide range of bacterial phyla and to identify new fibrillar adhesin-like proteins to improve our understanding of host-bacteria interactions. RESULTS: Through careful literature and computational searches, we identified 82 stalk and 27 adhesive domain families in fibrillar adhesins. Based on the presence of these domains in the UniProt Reference Proteomes database, we identified and analysed 3,542 fibrillar adhesin-like proteins across species of the most common bacterial phyla. We further enumerate the adhesive and stalk domain combinations found in nature and demonstrate that fibrillar adhesins have complex and variable domain architectures, which differ across species. By analysing the domain architecture of fibrillar adhesins, we show that in Gram positive bacteria, adhesive domains are mostly positioned at the N-terminus and cell surface anchors at the C-terminus of the protein, while their positions are more variable in Gram negative bacteria. We provide an open repository of fibrillar adhesin-like proteins and domains to enable further studies of this class of bacterial surface proteins. CONCLUSION: This study provides a domain-based characterization of fibrillar adhesins and demonstrates that they are widely found in species across the main bacterial phyla. We have discovered numerous novel fibrillar adhesins and improved our understanding of pathogenic adhesion and invasion mechanisms.
Assuntos
Adesinas Bacterianas , Proteínas de Bactérias , Adesinas Bacterianas/genética , Bactérias/genética , Aderência Bacteriana , Proteínas de Bactérias/genética , Bactérias Gram-Positivas , Proteínas de MembranaRESUMO
Toxoplasmosis is a parasitic disease caused by infection with Toxoplasma gondii that currently has few therapeutic options. The M1 aminopeptidase enzymes have been shown to be attractive targets for anti-parasitic agents and/or vaccine candidates, suggesting potential to re-purpose inhibitors between parasite M1 aminopeptidase targets. The M1 aminopeptidase TgAPN2 has been suggested to be a potential new drug target for toxoplasmosis. Here we investigate the structure and function of TgAPN2, a homologue of the antimalarial drug target PfA-M1, and evaluate the capacity to use inhibitors that target PfA-M1 against TgAPN2. The results show that despite a similar overall fold, the TgAPN2 has a unique substrate specificity and inhibition profile. Sequence and structure differences are investigated and show how comparative structure-activity relationships may provide a route to obtaining potent inhibitors of TgAPN2.
Assuntos
Aminopeptidases/química , Proteínas de Protozoários/química , Toxoplasma/enzimologia , Cristalografia por Raios XRESUMO
The ASR protein family has been discovered thirty years ago in many plant species and is involved in the tolerance of various abiotic stresses such as dehydration, salinity and heat. Despite its importance, nothing is known about the conserved ABA-Water Deficit Stress Domain (ABA-WDS) of the ASR gene family. In this study, we characterized two ABA-WDS domains, isolated from durum wheat (TtABA-WDS) and barley (HvABA-WDS). Bioinformatics analysis shows that they are both consistently predicted to be intrinsically disordered. Hydrodynamic and circular dichroism analysis indicate that both domains are largely disordered but belong to different structural classes, with HvABA-WDS and TtABA-WDS adopting a PreMolten Globule-like (PMG-like) and a Random Coil-like (RC-like) conformation, respectively. In the presence of the secondary structure stabilizer trifluoroethanol (TFE) or of increasing glycerol concentrations, which mimics dehydration, the two domains acquire an α-helical structure. Interestingly, both domains are able to prevent heat- and dehydration-induced inactivation of the enzyme lactate dehydrogenase (LDH). Furthermore, heterologous expression of TtABA-WDS and HvABA-WDS in the yeast Saccharomyces cerevisiae improves its tolerance to salt, heat and cold stresses. Taken together our results converge to show that the ABA-WDS domain is an intrinsically disordered functional domain whose conformational plasticity could be instrumental to support the versatile functions attributed to the ASR family, including its role in abiotic stress tolerance. Finally, and after validation in the plant system, this domain could be used to improve crop tolerance to abiotic stresses.
Assuntos
Ácido Abscísico/metabolismo , Regulação da Expressão Gênica de Plantas , Hordeum , Proteínas de Plantas , Triticum , Desidratação/genética , Desidratação/metabolismo , Hordeum/genética , Hordeum/metabolismo , Proteínas de Plantas/biossíntese , Proteínas de Plantas/genética , Triticum/genética , Triticum/metabolismoRESUMO
The search for novel chemical classes of anti-malarial compounds to cope with the current state of chemoresistance of malaria parasites has led to the identification of Plasmodium falciparum aminopeptidase 1 (PfA-M1) as a new therapeutic target. PfA-M1, known to be involved in the hemoglobin digestion cascade which helps to provide most of the amino acids necessary to the parasite's metabolism, is currently considered as a promising target for anti-malarial chemotherapy. However, its immunogenic properties have not yet been tested in the Gabonese population. In Gabon, the prevalence of malaria remains three times higher in semi-urban areas (60·12%) than in urban areas (17·06%). We show that malaria-specific PfA-M1 antibodies are present in children and increase with the level of infection. Children living in semi-urban areas have higher anti-PfA-M1 antibody titers (0·14 ± 0·02 AU) than those living in urban areas (0·08 ± 0·02 AU, P = 0·03), and their antibody titers increase with age (P < 0·0001). Moreover, anti-PfA-M1 antibody titers decrease in children with hyperparasitemia (0·027 ± 0·055 AU) but they remain high in children with low parasite density (0·21 ± 0·034 AU, P = 0·034). In conclusion, our results suggest that malaria-specific PfA-M1 antibodies may play an important role in the immune response of the host against P. falciparum in Gabonese children. Further studies on the role of PfA-M1 during anemia are needed.
Assuntos
Aminopeptidases/imunologia , Anticorpos Antiprotozoários/imunologia , Antígenos de Protozoários/imunologia , Imunoglobulina G/imunologia , Malária Falciparum/imunologia , Plasmodium falciparum/imunologia , Proteínas de Protozoários/imunologia , Adolescente , Aminopeptidases/sangue , Anticorpos Antiprotozoários/sangue , Antígenos de Protozoários/sangue , Criança , Pré-Escolar , Feminino , Gabão , Humanos , Imunoglobulina G/sangue , Lactente , Recém-Nascido , Malária Falciparum/sangue , Masculino , Plasmodium falciparum/enzimologia , Proteínas de Protozoários/sangueRESUMO
Improved sequencing technologies and the maturation of metagenomic approaches allow the identification of gene variants with potential industrial applications, including cellulases. Cellulase identification from metagenomic environmental surveys is complicated by inconsistent nomenclature and multiple categorization systems. Here, we summarize the current classification and nomenclature systems, with recommendations for improvements to these systems. Addressing the issues described will strengthen the annotation of cellulose-active enzymes from environmental sequence data sets-a rapidly growing resource in environmental and applied microbiology.
Assuntos
Bactérias/isolamento & purificação , Celulase/análise , Microbiologia Ambiental , Metagenoma , Metagenômica/métodosRESUMO
Aminobenzosuberone-based PfA-M1 inhibitors were explored as novel antimalarial agents against two different Plasmodium falciparum strains. The 4-phenyl derivative 7c exhibited the most encouraging growth inhibitory activity with IC50 values of 6.5-11.2 µM. X-ray crystal structures and early assessment of DMPK/ADME-Tox parameters allowed us to initiate structure-based drug design approach and understand the liabilities (such as potential metabolic and aqueous solubility issues) as well as identify the opportunities for improvement of this aminobenzosuberone series. It also suggested that compound 7c should be regarded as an attractive chemical tool to investigate the different biological roles of this multifunctional PfA-M1 protein.
Assuntos
Aminopeptidases/antagonistas & inibidores , Anisóis/farmacologia , Antimaláricos/farmacologia , Cicloeptanos/farmacologia , Inibidores Enzimáticos/farmacologia , Plasmodium falciparum/efeitos dos fármacos , Aminopeptidases/metabolismo , Anisóis/síntese química , Anisóis/química , Antimaláricos/síntese química , Antimaláricos/química , Cicloeptanos/síntese química , Cicloeptanos/química , Relação Dose-Resposta a Droga , Inibidores Enzimáticos/síntese química , Inibidores Enzimáticos/química , Estrutura Molecular , Testes de Sensibilidade Parasitária , Plasmodium falciparum/enzimologia , Relação Estrutura-AtividadeRESUMO
The growing availability of human genetic variation has given rise to novel methods of measuring genetic tolerance that better interpret variants of unknown significance. We recently developed a concept based on protein domain homology in the human genome to improve variant interpretation. For this purpose, we mapped population variation from the Exome Aggregation Consortium (ExAC) and pathogenic mutations from the Human Gene Mutation Database (HGMD) onto Pfam protein domains. The aggregation of these variation data across homologous domains into meta-domains allowed us to generate amino acid resolution of genetic intolerance profiles for human protein domains. Here, we developed MetaDome, a fast and easy-to-use web server that visualizes meta-domain information and gene-wide profiles of genetic tolerance. We updated the underlying data of MetaDome to contain information from 56,319 human transcripts, 71,419 protein domains, 12,164,292 genetic variants from gnomAD, and 34,076 pathogenic mutations from ClinVar. MetaDome allows researchers to easily investigate their variants of interest for the presence or absence of variation at corresponding positions within homologous domains. We illustrate the added value of MetaDome by an example that highlights how it may help in the interpretation of variants of unknown significance. The MetaDome web server is freely accessible at https://stuart.radboudumc.nl/metadome.
Assuntos
Biologia Computacional/métodos , Variação Genética , Proteínas/química , Proteínas/genética , Bases de Dados Genéticas , Predisposição Genética para Doença , Genoma Humano , Humanos , Internet , Domínios Proteicos , Software , Homologia Estrutural de ProteínaRESUMO
BACKGROUND: Proteins may have none, single, double, or multiple domains, while a single domain may appear in multiple proteins. Their distribution patterns may have impacts on bacterial physi-ology and lifestyle.Objective: This study aims to understand how domains are distributed and duplicated in bacterial prote-omes, in order to better understand bacterial physiology and lifestyles. METHODS: In this study, we used 16712 Hidden Markov Models to screen 944 bacterial reference prote-omes versus a threshold E-value<0.001. The number of non-redundant domains and duplication rates of redundant domains for each species were calculated. The unique domains, if any, were also identified for each species. In addition, the properties of no-domain proteins were investigated in terms of physico-chemical properties. RESULTS: The increasing number of non-redundant domains for a bacterial proteome follows the trend of an asymptotic function. The domain duplication rate is positively correlated with proteome size and in-creases more rapidly. The high percentage of single-domain proteins is more associated with small pro-teome size. For each proteome, unique domains were also obtained. Moreover, no-domain proteins show differences with the other three groups for several physicochemical properties analysed in this study. CONCLUSION: The study confirmed that a low domain duplication rate and a high percentage of single-domain proteins are more likely to be associated with bacterial host-dependent or restricted niche-adapted lifestyle. In addition, the unique lifestyle and physiology were revealed based on the analysis of species-specific domains and core domain interactions or co-occurrences.
RESUMO
Modern sequencing technologies provide an unprecedented amount of data of single-nucleotide variations occurring in coding regions and leading to changes in the expressed protein sequences. A significant fraction of these single-residue variations is linked to disease onset and collected in public databases. In recent years, many scientific studies have been focusing on the dissection of salient features of disease-related variations from different perspectives. In this work, we complement previous analyses by updating a dataset of disease-related variations occurring in proteins with 3D structure. Within this dataset, we describe functional and structural features that can be of interest for characterizing disease-related variations, including major chemico-physical properties, the strength of association to disease of variation types, their effect on protein stability, their location on the protein structure, and their distribution in Pfam structural/functional protein models. Our results support previous findings obtained in different data sets and introduce Pfam models as possible fingerprints of patterns of disease related single-nucleotide variations.
Assuntos
Doença/genética , Proteínas Mutantes/química , Proteínas Mutantes/metabolismo , Mutação/genética , Bases de Dados de Proteínas , Humanos , Domínios Proteicos , SolventesRESUMO
Previously, we reported that infection of human macrophages with Mycobacterium tuberculosis (Mtb) results in massive alterations in the pattern of RNA splicing in the host. The finding gained significance since alternate spliced variants of a same gene may have substantially different structure, function, stability, interaction partners, localization, and so forth, owing to inclusion or exclusion of specific exons. To establish a proof-of-concept; on how infection-induced RNA splicing could impact protein functions, here we used RNA-seq data from THP-1 macrophages that were infected with clinical isolate of Mtb. In addition to re-establishing the fact that Mtb infection may cause strain specific alterations in RNA splicing, we also developed a new analysis pipeline resulting in characterization of domain maps of the transcriptome post-infection. For the sake of simplicity, we restricted our analysis to all the kinases in the human genome and considered only pfam classified protein domains and checked their frequency of inclusion or exclusion due to alternate splicing across the conditions and time points. We report massive alterations in the domain architecture of most regulated proteins across the entire kinases highlighting the physiological importance of such an understanding. This study paves way for more detailed analysis of different functional classes of proteins and perturbations to their domain architecture as a consequence of mycobacterial infections. Such analysis would yield unprecedented depth to our understanding of host-pathogen interaction and allow in a more systematic manner targeting of host pathways for controlling the infections. © 2018 The Authors. IUBMB Life published by Wiley Periodicals, Inc. on behalf of International Union of Biochemistry and Molecular Biology, 70(9):845-854, 2018.
Assuntos
Processamento Alternativo , Genoma Humano , Interações Hospedeiro-Patógeno/genética , Macrófagos/metabolismo , Mycobacterium tuberculosis/isolamento & purificação , Transcriptoma , Tuberculose/genética , Células Cultivadas , Perfilação da Expressão Gênica , Humanos , Macrófagos/microbiologia , Domínios Proteicos , Proteínas Quinases/genética , Proteínas Quinases/metabolismo , Tuberculose/microbiologiaRESUMO
BACKGROUND: Discriminating driver mutations from the ones that play no role in cancer is a severe bottleneck in elucidating molecular mechanisms underlying cancer development. Since protein domains are representatives of functional regions within proteins, mutations on them may disturb the protein functionality. Therefore, studying mutations at domain level may point researchers to more accurate assessment of the functional impact of the mutations. RESULTS: This article presents a comprehensive study to map mutations from 29 cancer types to both sequence- and structure-based domains. Statistical analysis was performed to identify candidate domains in which mutations occur with high statistical significance. For each cancer type, the corresponding type-specific domains were distinguished among all candidate domains. Subsequently, cancer type-specific domains facilitated the identification of specific proteins for each cancer type. Besides, performing interactome analysis on specific proteins of each cancer type showed high levels of interconnectivity among them, which implies their functional relationship. To evaluate the role of mitochondrial genes, stem cell-specific genes and DNA repair genes in cancer development, their mutation frequency was determined via further analysis. CONCLUSIONS: This study has provided researchers with a publicly available data repository for studying both CATH and Pfam domain regions on protein-coding genes. Moreover, the associations between different groups of genes/domains and various cancer types have been clarified. The work is available at http://www.cancerouspdomains.ir .
Assuntos
Neoplasias/genética , Proteínas/genética , Reparo do DNA/genética , Bases de Dados Genéticas , Humanos , Internet , Mitocôndrias/genética , Mutação , Neoplasias/metabolismo , Neoplasias/patologia , Células-Tronco Neoplásicas/metabolismo , Mapas de Interação de Proteínas/genética , Proteínas/metabolismo , Interface Usuário-ComputadorRESUMO
BACKGROUND: Many entries in the protein data bank (PDB) are annotated to show their component protein domains according to the Pfam classification, as well as their biological function through the enzyme commission (EC) numbering scheme. However, despite the fact that the biological activity of many proteins often arises from specific domain-domain and domain-ligand interactions, current on-line resources rarely provide a direct mapping from structure to function at the domain level. Since the PDB now contains many tens of thousands of protein chains, and since protein sequence databases can dwarf such numbers by orders of magnitude, there is a pressing need to develop automatic structure-function annotation tools which can operate at the domain level. RESULTS: This article presents ECDomainMiner, a novel content-based filtering approach to automatically infer associations between EC numbers and Pfam domains. ECDomainMiner finds a total of 20,728 non-redundant EC-Pfam associations with a F-measure of 0.95 with respect to a "Gold Standard" test set extracted from InterPro. Compared to the 1515 manually curated EC-Pfam associations in InterPro, ECDomainMiner infers a 13-fold increase in the number of EC-Pfam associations. CONCLUSION: These EC-Pfam associations could be used to annotate some 58,722 protein chains in the PDB which currently lack any EC annotation. The ECDomainMiner database is publicly available at http://ecdm.loria.fr/ .
Assuntos
Biologia Computacional/métodos , Mineração de Dados/métodos , Bases de Dados de Proteínas , Enzimas , Proteínas , Enzimas/química , Enzimas/genética , Enzimas/metabolismo , Proteínas/química , Proteínas/genética , Proteínas/metabolismoRESUMO
Whole exomes of patients with a genetic disorder are nowadays routinely sequenced but interpretation of the identified genetic variants remains a major challenge. The increased availability of population-based human genetic variation has given rise to measures of genetic tolerance that have been used, for example, to predict disease-causing genes in neurodevelopmental disorders. Here, we investigated whether combining variant information from homologous protein domains can improve variant interpretation. For this purpose, we developed a framework that maps population variation and known pathogenic mutations onto 2,750 "meta-domains." These meta-domains consist of 30,853 homologous Pfam protein domain instances that cover 36% of all human protein coding sequences. We find that genetic tolerance is consistent across protein domain homologues, and that patterns of genetic tolerance faithfully mimic patterns of evolutionary conservation. Furthermore, for a significant fraction (68%) of the meta-domains high-frequency population variation re-occurs at the same positions across domain homologues more often than expected. In addition, we observe that the presence of pathogenic missense variants at an aligned homologous domain position is often paired with the absence of population variation and vice versa. The use of these meta-domains can improve the interpretation of genetic variation.
Assuntos
Testes Genéticos , Variação Genética , Genética Populacional , Domínios Proteicos/genética , Adaptação Biológica/genética , Mapeamento Cromossômico , Biologia Computacional/métodos , Sequência Conservada , Evolução Molecular , Exoma , Ontologia Genética , Genética Populacional/métodos , Genômica/métodos , Genótipo , Humanos , Sequenciamento do ExomaRESUMO
Protein evolution often occurs at unequal rates in different sites along an amino acid chain. Site-specific evolutionary rates have been linked to several structural and functional properties of proteins. Previous analyses of this phenomenon have involved relatively small datasets and, in some cases, the interaction among multiple structural factors is not evaluated. Here, we present the results of a large-scale phylogenetic and statistical analysis, testing the effects and interactions of three structural properties on amino acid replacement rates. We used sequence-based computational methods to predict (i) intrinsic disorder propensity, (ii) secondary structure, and (iii) functional domain involvement across millions of amino acid sites in thousands of sequence alignments of metazoan proteins. Our results somewhat corroborate earlier findings that intrinsically disordered sites tend to be more variable than ordered sites, but there is considerable overlap among their rate distributions, and a significant confounding interaction exists between intrinsic disorder and secondary structure. Notably, protein sites that are consistently predicted to be both intrinsically disordered and involved in secondary structures tend to be the most conserved at the amino acid level, suggesting that they are highly constrained and functionally important. In addition, a significant interaction exists between functional domain involvement and secondary structure. These findings suggest that multiple structural drivers of protein evolution should be evaluated simultaneously in order to get a clear picture of their individual effects as well as any confounding interactions among them.
Assuntos
Aminoácidos/genética , Evolução Molecular , Proteínas/genética , Sequência de Aminoácidos , Evolução Biológica , Análise por Conglomerados , Biologia Computacional/métodos , Sequência Conservada , Bases de Dados de Proteínas , Modelos Moleculares , Taxa de Mutação , Filogenia , Estrutura Secundária de Proteína , Alinhamento de Sequência , Relação Estrutura-AtividadeRESUMO
Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that different classification systems for transport proteins have arisen, be it within a specialized database, focused on this functional class of proteins, or as part of a broader classification system for all proteins. Two such databases are the Transporter Classification Database (TCDB) and the Protein family (Pfam) database. As part of a long-term endeavor to improve consistency between the two classification systems, we have compared transporter annotations in the two databases to understand the rationale for differences and to improve both systems. Differences sometimes reflect the fact that one database has a particular transporter family while the other does not. Differing family definitions and hierarchical organizations were reconciled, resulting in recognition of 69 Pfam 'Domains of Unknown Function', which proved to be transport protein families to be renamed using TCDB annotations. Of over 400 potential new Pfam families identified from TCDB, 10% have already been added to Pfam, and TCDB has created 60 new entries based on Pfam data. This work, for the first time, reveals the benefits of comprehensive database comparisons and explains the differences between Pfam and TCDB.
Assuntos
Bases de Dados de Proteínas , Proteínas/químicaRESUMO
Malaria is a global human parasitic disease mainly caused by the protozoon Plasmodium falciparum. Increased parasite resistance to current drugs determines the relevance of finding new treatments against new targets. A novel target is the M1 alanyl-aminopeptidase from P. falciparum (PfA-M1), which is essential for parasite development in human erythrocytes and is inhibited by the pseudo-peptide bestatin. In this work, we used a combinatorial multicomponent approach to produce a library of peptidomimetics and screened it for the inhibition of recombinant PfA-M1 (rPfA-M1) and the in vitro growth of P. falciparum erythrocytic stages (3D7 and FcB1 strains). Dose-response studies with selected compounds allowed identifying the bestatin-based peptidomimetic KBE009 as a submicromolar rPfA-M1 inhibitor (Ki=0.4µM) and an in vitro antimalarial compound as potent as bestatin (IC50=18µM; without promoting erythrocyte lysis). At therapeutic-relevant concentrations, KBE009 is selective for rPfA-M1 over porcine APN (a model of these enzymes from mammals), and is not cytotoxic against HUVEC cells. Docking simulations indicate that this compound binds PfA-M1 without Zn2+ coordination, establishing mainly hydrophobic interactions and showing a remarkable shape complementarity with the active site of the enzyme. Moreover, KBE009 inhibits the M1-type aminopeptidase activity (Ala-7-amido-4-methylcoumarin substrate) in isolated live parasites with a potency similar to that of the antimalarial activity (IC50=82µM), strongly suggesting that the antimalarial effect is directly related to the inhibition of the endogenous PfA-M1. These results support the value of this multicomponent strategy to identify PfA-M1 inhibitors, and make KBE009 a promising hit for drug development against malaria.
Assuntos
Antimaláricos/química , Antígenos CD13/antagonistas & inibidores , Dipeptídeos/química , Plasmodium falciparum/enzimologia , Proteínas de Protozoários/antagonistas & inibidores , Antimaláricos/síntese química , Antimaláricos/farmacologia , Sítios de Ligação , Antígenos CD13/genética , Antígenos CD13/metabolismo , Domínio Catalítico , Sobrevivência Celular/efeitos dos fármacos , Dipeptídeos/síntese química , Dipeptídeos/farmacologia , Eritrócitos/citologia , Eritrócitos/efeitos dos fármacos , Eritrócitos/parasitologia , Células Endoteliais da Veia Umbilical Humana , Humanos , Leucina/análogos & derivados , Leucina/química , Leucina/farmacologia , Simulação de Acoplamento Molecular , Peptidomiméticos , Plasmodium falciparum/efeitos dos fármacos , Proteínas de Protozoários/metabolismo , Proteínas Recombinantes/biossíntese , Proteínas Recombinantes/química , Proteínas Recombinantes/isolamento & purificação , Relação Estrutura-AtividadeRESUMO
The need for sustainable insect pest control is driving the investigation and discovery of insecticidal proteins outside of the typical 3-domain Cry protein family from Bacillus thuringiensis (Bt). Examples include Cry35 and Cry51 that belong to protein families (Toxin_10, ETX_MTX2) sharing a common ß-pore forming structure and function with known mammalian toxins such as epsilon toxin (ETX). Although ß-pore forming proteins are related to mammalian toxins, there are key differences in sequence and structure that lead to organism specificity that is useful in the weight-of-evidence approach for safety assessment. Despite low overall amino acid sequence identity among ETX_MTX2 proteins, sequence and structural similarities are found in the tail region responsible for the shared oligomerization and pore formation functions (causing the "relatedness"). Conversely, most of the sequence and structural diversity is located in the head region that is likely responsible for differential receptor binding and target species specificity (e.g., insecticidal vs. mammalian). Therefore, inclusion of a domain-based protein characterization approach that includes bioinformatic and functional comparisons of conserved and diverse domains will enhance the overall weight of evidence safety assessment of proteins including recently reported Cry51 protein variants (Cry51Aa1, Cry51Aa2, and Cry51Aa2.834_16).