RESUMEN
Since its public release in 2021, AlphaFold2 (AF2) has made investigating biological questions, by using predicted protein structures of single monomers or full complexes, a common practice. ColabFold-AF2 is an open-source Jupyter Notebook inside Google Colaboratory and a command-line tool that makes it easy to use AF2 while exposing its advanced options. ColabFold-AF2 shortens turnaround times of experiments because of its optimized usage of AF2's models. In this protocol, we guide the reader through ColabFold best practices by using three scenarios: (i) monomer prediction, (ii) complex prediction and (iii) conformation sampling. The first two scenarios cover classic static structure prediction and are demonstrated on the human glycosylphosphatidylinositol transamidase protein. The third scenario demonstrates an alternative use case of the AF2 models by predicting two conformations of the human alanine serine transporter 2. Users can run the protocol without computational expertise via Google Colaboratory or in a command-line environment for advanced users. Using Google Colaboratory, it takes <2 h to run each procedure. The data and code for this protocol are available at https://protocol.colabfold.com .
RESUMEN
Strigolactones (SLs) are plant apocarotenoids with diverse roles and structures. Canonical SLs, widespread and characterized by structural variations in their tricyclic lactone (ABC-ring), are classified into two types based on C-ring configurations. The steric C-ring configuration emerges during the BC-ring closure, downstream of the biosynthetic intermediate, carlactonoic acid (CLA). Most plants produce either type of canonical SLs stereoselectively, e.g., tomato (Solanum lycopersicum) yields orobanchol with an α-oriented C-ring. The mechanisms driving SL structural diversification are partially understood, with limited insight into functional implications. Furthermore, the exact molecular mechanism for the stereoselective BC-ring closure reaction is yet to be known. We identified an enzyme, the stereoselective BC-ring-forming factor (SRF), from the dirigent protein (DIR) family, specifically the DIR-f subfamily, whose biochemical function had not been characterized, making it a key enzyme in stereoselective canonical SL biosynthesis with the α-oriented C-ring. We first confirm the precise catalytic function of the tomato cytochrome P450 SlCYP722C, previously shown to be involved in orobanchol biosynthesis [T. Wakabayashi et al., Sci. Adv. 5, eaax9067 (2019)], to convert CLA to 18-oxocarlactonoic acid. We then show that SRF catalyzes the stereoselective BC-ring closure reaction of 18-oxocarlactonoic acid, forming orobanchol. Our methodology combines experimental and computational techniques, including SRF structure prediction and conducting molecular dynamics simulations, suggesting a catalytic mechanism based on the conrotatory 4π-electrocyclic reaction for the stereoselective BC-ring formation in orobanchol. This study sheds light on the molecular basis of how plants produce SLs with specific stereochemistry in a controlled manner.
Asunto(s)
Lactonas , Lactonas/metabolismo , Lactonas/química , Estereoisomerismo , Solanum lycopersicum , Proteínas de Plantas/metabolismo , Proteínas de Plantas/química , Reguladores del Crecimiento de las Plantas/química , Reguladores del Crecimiento de las Plantas/metabolismoRESUMEN
In the field of medical chemistry and other organic chemistry, introducing a methyl group into a designed position has been difficult to achieve. However, owing to the vigorous developments in the field of enzymology, methyltransferases are considered potential tools for addressing this problem. Within the methyltransferase family, Fur6 catalyzes the methylation of C3 of 1,2,4,5,7-pentahydroxynaphthalene (PHN) using S-adenosyl-l-methionine (SAM) as the methyl donor. Here, we report the catalytic mechanism and substrate specificity of Fur6 based on computational studies. Our molecular dynamics (MD) simulation studies reveal the reactive form of PHN and its interactions with the enzyme. Our hybrid quantum mechanics/molecular mechanics (QM/MM) calculations suggest the reaction pathway of the methyl transfer step in which the energy barrier is 8.6 kcal mol-1. Our free-energy calculations with a polarizable continuum model (PCM) indicate that the final deprotonation step of the methylated intermediate occurs after it is ejected into the water solvent from the active center pocket of Fur6. Additionally, our studies on the protonation states, the highest occupied molecular orbital (HOMOs), and the energy barriers of the methylation reaction for the analogs of PHN demonstrate the mechanism of the specificity to PHN. Our study provides valuable insights into Fur6 chemistry, contributing to a deeper understanding of molecular mechanisms and offering an opportunity to engineer the enzyme to achieve high yields of the desired product(s).
Asunto(s)
Metiltransferasas , Simulación de Dinámica Molecular , Metiltransferasas/metabolismo , Especificidad por Sustrato , Catálisis , Metilación , Teoría CuánticaRESUMEN
Plants employ self-incompatibility (SI) to promote cross-fertilization. In Brassicaceae, this process is regulated by the formation of a complex between the pistil determinant S receptor kinase (SRK) and the pollen determinant S-locus protein 11 (SP11, also known as S-locus cysteine-rich protein, SCR). In our previous study, we used the crystal structures of two eSRK-SP11 complexes in Brassica rapa S8 and S9 haplotypes and nine computationally predicted complex models to demonstrate that only the SRK ectodomain (eSRK) and SP11 pairs derived from the same S haplotype exhibit high binding free energy. However, predicting the eSRK-SP11 complex structures for the other 100 + S haplotypes and genera remains difficult because of SP11 polymorphism in sequence and structure. Although protein structure prediction using AlphaFold2 exhibits considerably high accuracy for most protein monomers and complexes, 46% of the predicted SP11 structures that we tested showed < 75 mean per-residue confidence score (pLDDT). Here, we demonstrate that the use of curated multiple sequence alignment (MSA) for cysteine-rich proteins significantly improved model accuracy for SP11 and eSRK-SP11 complexes. Additionally, we calculated the binding free energies of the predicted eSRK-SP11 complexes using molecular dynamics (MD) simulations and observed that some Arabidopsis haplotypes formed a binding mode that was critically different from that of B. rapa S8 and S9. Thus, our computational results provide insights into the haplotype-specific eSRK-SP11 binding modes in Brassicaceae at the residue level. The predicted models are freely available at Zenodo, https://doi.org/10.5281/zenodo.8047768.
RESUMEN
In this study, we propose Feedback-AVPGAN, a system that aims to computationally generate novel antiviral peptides (AVPs). This system relies on the key premise of the Generative Adversarial Network (GAN) model and the Feedback method. GAN, a generative modeling approach that uses deep learning methods, comprises a generator and a discriminator. The generator is used to generate peptides; the generated proteins are fed to the discriminator to distinguish between the AVPs and non-AVPs. The original GAN design uses actual data to train the discriminator. However, not many AVPs have been experimentally obtained. To solve this problem, we used the Feedback method to allow the discriminator to learn from the existing as well as generated synthetic data. We implemented this method using a classifier module that classifies each peptide sequence generated by the GAN generator as AVP or non-AVP. The classifier uses the transformer network and achieves high classification accuracy. This mechanism enables the efficient generation of peptides with a high probability of exhibiting antiviral activity. Using the Feedback method, we evaluated various algorithms and their performance. Moreover, we modeled the structure of the generated peptides using AlphaFold2 and determined the peptides having similar physicochemical properties and structures to those of known AVPs, although with different sequences.
Asunto(s)
Péptidos Antimicrobianos , Antivirales , Aprendizaje Profundo , Farmacología en Red , Algoritmos , Secuencia de Aminoácidos , Antivirales/química , Retroalimentación , Péptidos Antimicrobianos/química , Química ComputacionalRESUMEN
In this study, we developed a system that predicts the binding sites of proteins for five mononucleotides (AMP, ADP, ATP, GDP, and GTP). The system comprises two machine learning (ML)-based predictors using a convolutional neural network and a gradient boosting machine, two template-based predictors based on sequence and structure alignment, and a predictor that performs ensemble learning of these four predictors. In this study, data augmentation of ligand binding sites with similar ligand structures was performed. For example, in the prediction of ADP-binding sites using ML methods, the binding sites of AMP and ATP, which have similar structures, are considered. In addition, we constructed the structure models using AlphaFold2, a highly accurate protein prediction method. The secondary structure and dihedral angle information obtained using the model structures were used as ML predictor features. Additionally, in the template-based predictor, the structures of the binding sites were used as templates to be explored based on structure alignment to identify the binding site of the target. Consequently, the template-based predictor based on structure alignment showed the best performance among the four individual predictors, and the ensemble predictor achieved the best performance, with an area under the curve of 0.958 for all mononucleotides.
Asunto(s)
Aprendizaje Automático , Proteínas , Adenosina Difosfato/metabolismo , Adenosina Monofosfato/metabolismo , Adenosina Trifosfato , Sitios de Unión , Ligandos , Unión Proteica , Proteínas/químicaRESUMEN
The extracellular pH is a vital regulator of various biological processes in plants. However, how plants perceive extracellular pH remains obscure. Here, we report that plant cell-surface peptide-receptor complexes can function as extracellular pH sensors. We found that pattern-triggered immunity (PTI) dramatically alkalinizes the acidic extracellular pH in root apical meristem (RAM) region, which is essential for root meristem growth factor 1 (RGF1)-mediated RAM growth. The extracellular alkalinization progressively inhibits the acidic-dependent interaction between RGF1 and its receptors (RGFRs) through the pH sensor sulfotyrosine. Conversely, extracellular alkalinization promotes the alkaline-dependent binding of plant elicitor peptides (Peps) to its receptors (PEPRs) through the pH sensor Glu/Asp, thereby promoting immunity. A domain swap between RGFR and PEPR switches the pH dependency of RAM growth. Thus, our results reveal a mechanism of extracellular pH sensing by plant peptide-receptor complexes and provide insights into the extracellular pH-mediated regulation of growth and immunity in the RAM.
Asunto(s)
Proteínas de Arabidopsis , Arabidopsis , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Concentración de Iones de Hidrógeno , Meristema/metabolismo , Péptidos/metabolismo , Células Vegetales , Raíces de Plantas/metabolismo , Plantas/metabolismo , Receptores de Superficie Celular/metabolismo , Transducción de SeñalRESUMEN
ColabFold offers accelerated prediction of protein structures and complexes by combining the fast homology search of MMseqs2 with AlphaFold2 or RoseTTAFold. ColabFold's 40-60-fold faster search and optimized model utilization enables prediction of close to 1,000 structures per day on a server with one graphics processing unit. Coupled with Google Colaboratory, ColabFold becomes a free and accessible platform for protein folding. ColabFold is open-source software available at https://github.com/sokrypton/ColabFold and its novel environmental databases are available at https://colabfold.mmseqs.com .
Asunto(s)
Pliegue de Proteína , Programas Informáticos , Computadores , Bases de Datos Factuales , ProteínasRESUMEN
Magnesium is an important nutrient for plants, but much is still unknown about plant Mg2+ transporters. Combining with the structural prediction of AlphaFold2, we used mutagenesis and 28Mg uptake assay to study the highly conserved "GMN" motif of Arabidopsis thaliana MRS2-1 (AtMRS2-1) transporter. We demonstrated that the glycine and methionine in GMN motif are essential for AtMRS2-1 to transport Mg2+.
Asunto(s)
Proteínas de Arabidopsis , Arabidopsis , Proteínas de Transporte de Catión , Arabidopsis/genética , Arabidopsis/metabolismo , Proteínas de Arabidopsis/metabolismo , Proteínas de Transporte de Catión/genética , Magnesio/metabolismo , MutagénesisRESUMEN
Prenyl pyrophosphate methyltransferases enhance the structural diversity of terpenoids. However, the molecular basis of their catalytic mechanisms is poorly understood. In this study, using multiple strategies, we characterized a geranyl pyrophosphate (GPP) C6-methyltransferase, BezA. Biochemical analysis revealed that BezA requires Mg2+ and solely methylates GPP. The crystal structures of BezA and its complex with S-adenosyl homocysteine were solved at 2.10 and 2.56â Å, respectively. Further analyses using site-directed mutagenesis, molecular docking, molecular dynamics simulations, and quantum mechanics/molecular mechanics calculations revealed the molecular basis of the methylation reaction. Importantly, the function of E170 as a catalytic base to complete the methylation reaction was established. We also succeeded in switching the substrate specificity by introducing a W210A substitution, resulting in an unprecedented farnesyl pyrophosphate C6-methyltransferase.
Asunto(s)
Metiltransferasas/metabolismo , Fosfatos de Poliisoprenilo/metabolismo , Sesquiterpenos/metabolismo , Biocatálisis , Cristalografía por Rayos X , Teoría Funcional de la Densidad , Metiltransferasas/química , Metiltransferasas/genética , Modelos Moleculares , Estructura Molecular , Fosfatos de Poliisoprenilo/química , Sesquiterpenos/química , Streptomyces/enzimología , Especificidad por SustratoRESUMEN
Antifreeze proteins (AFPs) are proteins that protect cellular fluids and body fluids from freezing by inhibiting the nucleation and growth of ice crystals and preventing ice recrystallization, thereby contributing to the maintenance of life in living organisms. They exist in fish, insects, microorganisms, and fungi. However, the number of known AFPs is currently limited, and it is essential to construct a reliable dataset of AFPs and develop a bioinformatics tool to predict AFPs. In this work, we first collected AFPs sequences from UniProtKB considering the reliability of annotations and, based on these datasets, developed a prediction system using random forest. We achieved accuracies of 0.961 and 0.947 for non-redundant sequences with less than 90% and 30% identities and achieved the accuracy of 0.953 for representative sequences for each species. Using the ability of random forest, we identified the sequence features that contributed to the prediction. Some sequence features were common to AFPs from different species. These features include the Cys content, Ala-Ala content, Trp-Gly content, and the amino acids' distribution related to the disorder propensity. The computer program and the dataset developed in this work are available from the GitHub site: https://github.com/ryomiya/Prediction-and-analysis-of-antifreeze-proteins.
RESUMEN
Cosolvent molecular dynamics (CMD) simulations involve an MD simulation of a protein in the presence of explicit water molecules mixed with cosolvent molecules to perform hotspot detection, binding site identification, and binding energy estimation, while other existing methods (e.g., MixMD, SILCS, and MDmix) utilize small molecules that represent functional groups of compounds. However, the cosolvent selections employed in these methods differ and there are only a few cosolvents that are commonly used in these methods. In this study, we proposed a systematic method for constructing a set of cosolvents for drug discovery, termed the EXtended PRObes set construction by REpresentative Retrieval (EXPRORER). First, we extracted typical substructures from FDA-approved drugs, generated 138 cosolvent structures, and for each cosolvent molecule, we conducted CMD simulations to generate a spatial probability distribution map of cosolvent atoms (PMAP). Analyses of PMAP similarity revealed that a cosolvent pair with a PMAP similarity greater than 0.70-0.75 shared similar structural features. We present a method for the construction of a cosolvent subset that satisfies a similarity threshold for all cosolvents, and we tested the constructed sets for four proteins. To our knowledge, this is the first study to include a systematic proposal for cosolvent set construction, and thus, the EXPRORER cosolvents will provide deeper insights into ligand binding sites of various proteins.
Asunto(s)
Simulación de Dinámica Molecular , Proteínas , Sitios de Unión , Ligandos , SolventesRESUMEN
Self-incompatibility (SI) is a breeding system that promotes cross-fertilization. In Brassica, pollen rejection is induced by a haplotype-specific interaction between pistil determinant SRK (S receptor kinase) and pollen determinant SP11 (S-locus Protein 11, also named SCR) from the S-locus. Although the structure of the B. rapa S9-SRK ectodomain (eSRK) and S9-SP11 complex has been determined, it remains unclear how SRK discriminates self- and nonself-SP11. Here, we uncover the detailed mechanism of self/nonself-discrimination in Brassica SI by determining the S8-eSRK-S8-SP11 crystal structure and performing molecular dynamics (MD) simulations. Comprehensive binding analysis of eSRK and SP11 structures reveals that the binding free energies are most stable for cognate eSRK-SP11 combinations. Residue-based contribution analysis suggests that the modes of eSRK-SP11 interactions differ between intra- and inter-subgroup (a group of phylogenetically neighboring haplotypes) combinations. Our data establish a model of self/nonself-discrimination in Brassica SI.
Asunto(s)
Brassica rapa/fisiología , Fitomejoramiento , Proteínas de Plantas/metabolismo , Proteínas Quinasas/metabolismo , Animales , Cristalografía , Flores/metabolismo , Haplotipos , Simulación de Dinámica Molecular , Proteínas de Plantas/genética , Proteínas de Plantas/ultraestructura , Polen/metabolismo , Unión Proteica/fisiología , Dominios Proteicos/fisiología , Proteínas Quinasas/genética , Proteínas Quinasas/ultraestructura , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Proteínas Recombinantes/ultraestructura , Células Sf9 , SpodopteraRESUMEN
Activation of corticotropin-releasing factor receptor 2ß (CRFR2ß) results in increased skeletal muscle mass and the prevention of muscle atrophy. Using a luciferase reporter assay, we screened 357 functional food factors that activate CRFR2ß and, subsequently, confirmed that nobiletin (NBT) increases CRFR2ß activity. Additionally, we found that NBT augments the activity of the endogenous peptide ligand urocortin 2 (Ucn2) in a concentration-dependent manner. Computational simulation of CRFR2ß confirmed that transmembrane domains (TMs) 1 and 2 are important for the synergistic activity of NBT and also identified important amino acids in these domains. Finally, we demonstrated that a co-administration of Ucn2 and NBT increases the hypertrophic signal in mouse skeletal muscle. These observations demonstrate that NBT can activate CRFR2ß and amplify the agonistic activity of Ucn2 and that such food-derived molecules have the potential to enhance endogenous G protein-coupled receptor ligand activities and contribute to the maintenance of skeletal muscle mass and function.
Asunto(s)
Flavonas/farmacología , Músculo Esquelético/efectos de los fármacos , Receptores de Hormona Liberadora de Corticotropina/agonistas , Regulación Alostérica/efectos de los fármacos , Animales , Línea Celular , Flavonas/análisis , Alimentos Funcionales/análisis , Células HEK293 , Humanos , Ratones , Ratones Endogámicos C57BL , Simulación de Dinámica Molecular , Músculo Esquelético/metabolismo , Dominios Proteicos/efectos de los fármacos , Receptores de Hormona Liberadora de Corticotropina/química , Receptores de Hormona Liberadora de Corticotropina/metabolismoRESUMEN
Potential inhibitors of a target biomolecule, NAD-dependent deacetylase Sirtuin 1, were identified by a contest-based approach, in which participants were asked to propose a prioritized list of 400 compounds from a designated compound library containing 2.5 million compounds using in silico methods and scoring. Our aim was to identify target enzyme inhibitors and to benchmark computer-aided drug discovery methods under the same experimental conditions. Collecting compound lists derived from various methods is advantageous for aggregating compounds with structurally diversified properties compared with the use of a single method. The inhibitory action on Sirtuin 1 of approximately half of the proposed compounds was experimentally accessed. Ultimately, seven structurally diverse compounds were identified.
RESUMEN
The ferric ion binding protein A of Thermus thermophilus HB8 (TtFbpA) is the periplasmic subunit of an ABC-type iron transporter. Two Fe3+-bound crystal structures at pH 5.5 and pH 7.5 and one apo structure have been reported for TtFbpA. In addition to three Tyr residues, TtFbpA coordinates with Fe3+ using two monodentate HCO3- and one H2O to form a six-coordinated mode at pH 5.5 or one bidentate CO32- to form a five-coordinated mode at pH 7.5. We investigated the biological significance of these Fe3+-bound forms of TtFbpA and the synergistic anions (HCO3- and CO32-). Quantum mechanical calculations in silico indicated that only these coordination modes were plausible out of six possibilities. Comparison of the crystal structures revealed a key motif, RZX1X2L(I/V), that would couple the Fe3+ coordination mode and the TtFbpA protein conformation. Both gel filtration chromatography and isothermal titration calorimetry showed that TtFbpA could bind Fe3+ at pH 7.5 but not at pH 5.5. Isothermal titration calorimetry also revealed that the binding at pH 7.5 was a three-step endothermic reaction that required NaHCO3. These results indicate that the holo structure at pH 5.5 is unstable in solution and may correspond to a transition state of TtFbpA-Fe3+ binding at pH 7.5 because HCO3- is much more abundant than CO32- at both pH values. Reorganisation of the synergistic ions and coupled protein conformational change will occur to form the stable TtFbpA-Fe3+ complex at pH 7.5, but not at pH 5.5. Identification of such a transition state will contribute to molecular design of novel FbpA inhibitors.
Asunto(s)
Transportadoras de Casetes de Unión a ATP/metabolismo , Proteínas Bacterianas/metabolismo , Hierro/metabolismo , Periplasma/metabolismo , Thermus thermophilus/metabolismo , Transportadoras de Casetes de Unión a ATP/química , Proteínas Bacterianas/química , Sitios de Unión , Cristalografía por Rayos X , Concentración de Iones de Hidrógeno , Hierro/química , Modelos Moleculares , Unión Proteica , Conformación ProteicaRESUMEN
p-Hydroxybenzoate hydroxylase (PHBH) is a flavoprotein monooxygenase that catalyzes the hydroxylation of p-hydroxybenzoate (p-OHB) to 3,4-dihydroxybenzoate (3,4-DOHB). PHBH can bind to other benzoate derivatives in addition to p-OHB; however, hydroxylation does not occur on 3,4-DOHB. Replacement of Tyr385 with Phe forms a mutant, which enables the production of 3,4,5-trihydroxybenzonate (gallic acid) from 3,4-DOHB, although the catalytic activity of the mutant is quite low. In this study, we report how the L199V/Y385F double mutant exhibits activity for producing gallic acid 4.3-fold higher than that of the Y385F single mutant. This improvement in catalytic activity is primarily due to the suppression of a shunt reaction that wastes reduced nicotinamide adenine dinucleotide phosphate by producing H2O2. To further elucidate the molecular mechanism underlying this higher catalytic activity, we performed molecular dynamics simulations and quantum mechanics/molecular mechanics calculations, in addition to determining the crystal structure of the Y385F·3,4-DOHB complex. The simulations showed that the Y385F mutation facilitates the deprotonation of the 4-hydroxy group of 3,4-DOHB, which is necessary for initiating hydroxylation. Moreover, the L199V mutation in addition to the Y385F mutation allows the OH moiety in the peroxide group of C-(4a)-flavin hydroperoxide to come into the proximity of the C5 atom of 3,4-DOHB. Overall, this study provides a consistent explanation for the change in the catalytic activity of PHBH caused by mutations, which will enable us to better design an enzyme with different activities.
Asunto(s)
4-Hidroxibenzoato-3-Monooxigenasa/metabolismo , Proteínas Bacterianas/metabolismo , Ácido Gálico/metabolismo , Pseudomonas aeruginosa/metabolismo , 4-Hidroxibenzoato-3-Monooxigenasa/química , 4-Hidroxibenzoato-3-Monooxigenasa/genética , Proteínas Bacterianas/química , Proteínas Bacterianas/genética , Cristalografía por Rayos X , Hidroxilación , Simulación de Dinámica Molecular , Mutación Puntual , Conformación Proteica , Pseudomonas aeruginosa/química , Pseudomonas aeruginosa/genética , TermodinámicaRESUMEN
Molecular recognition features (MoRFs) are key functional regions of intrinsically disordered proteins (IDPs), which play important roles in the molecular interaction network of cells and are implicated in many serious human diseases. Identifying MoRFs is essential for both functional studies of IDPs and drug design. This study adopts the cutting-edge machine learning method of artificial intelligence to develop a powerful model for improving MoRFs prediction. We proposed a method, named as en_DCNNMoRF (ensemble deep convolutional neural network-based MoRF predictor). It combines the outcomes of two independent deep convolutional neural network (DCNN) classifiers that take advantage of different features. The first, DCNNMoRF1, employs position-specific scoring matrix (PSSM) and 22 types of amino acid-related factors to describe protein sequences. The second, DCNNMoRF2, employs PSSM and 13 types of amino acid indexes to describe protein sequences. For both single classifiers, DCNN with a novel two-dimensional attention mechanism was adopted, and an average strategy was added to further process the output probabilities of each DCNN model. Finally, en_DCNNMoRF combined the two models by averaging their final scores. When compared with other well-known tools applied to the same datasets, the accuracy of the novel proposed method was comparable with that of state-of-the-art methods. The related web server can be accessed freely via http://vivace.bi.a.u-tokyo.ac.jp:8008/fang/en_MoRFs.php .
Asunto(s)
Proteínas Intrínsecamente Desordenadas/química , Redes Neurales de la Computación , Secuencia de Aminoácidos , Sitios de Unión , Biología Computacional/métodos , Bases de Datos de Proteínas/estadística & datos numéricos , Aprendizaje Profundo , Humanos , Proteínas Intrínsecamente Desordenadas/genética , Proteínas Intrínsecamente Desordenadas/metabolismo , Proteínas de la Membrana/química , Proteínas de la Membrana/genética , Proteínas de la Membrana/metabolismo , Unión ProteicaRESUMEN
Molecular recognition features (MoRFs) usually act as "hub" sites in the interaction networks of intrinsically disordered proteins (IDPs). Because an increasing number of serious diseases have been found to be associated with disordered proteins, identifying MoRFs has become increasingly important. In this study, we propose an ensemble learning strategy, named MoRFPred_en, to predict MoRFs from protein sequences. This approach combines four submodels that utilize different sequence-derived features for the prediction, including a multichannel one-dimensional convolutional neural network (CNN_1D multichannel) based model, two deep two-dimensional convolutional neural network (DCNN_2D) based models, and a support vector machine (SVM) based model. When compared with other methods on the same datasets, the MoRFPred_en approach produced better results than existing state-of-the-art MoRF prediction methods, achieving an AUC of 0.762 on the VALIDATION419 dataset, 0.795 on the TEST45 dataset, and 0.776 on the TEST49 dataset. Availability: http://vivace.bi.a.u-tokyo.ac.jp:8008/fang/MoRFPred_en.php.
Asunto(s)
Biología Computacional/métodos , Proteínas Intrínsecamente Desordenadas/química , Proteínas Intrínsecamente Desordenadas/metabolismo , Secuencia de Aminoácidos , Sitios de Unión , Bases de Datos de Proteínas , Aprendizaje Automático , Redes Neurales de la Computación , Máquina de Vectores de SoporteRESUMEN
Target-specific monoclonal antibodies can be routinely acquired, but the sequences of naturally acquired antibodies are not always affinity-matured and methods that increase antigen affinity are desirable. Most biophysical studies have focused on the complementary determining region (CDR), which directly contacts the antigen; however, it remains difficult to increase the affinity as much as desired. While strategies to alter the CDR to increase antibody affinity are abundant, those that target non-CDR regions are scarce. Here we describe a new method, designated fluctuation editing, which identifies potential mutation sites and engineers a high-affinity antibody based on conformational fluctuations observed by NMR relaxation dispersion. Our data show that relaxation dispersion detects important fluctuating residues that are not located in the CDR and that increase antigen-antibody affinity by point mutation. The affinity-increased mutants are shown to fluctuate less in their free form and to form a more packed structure in their antigen-bound form.