Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 78.466
Filtrar
1.
Mol Cell ; 84(9): 1802-1810.e4, 2024 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-38701741

RESUMEN

Polyphosphate (polyP) is a chain of inorganic phosphate that is present in all domains of life and affects diverse cellular phenomena, ranging from blood clotting to cancer. A study by Azevedo et al. described a protein modification whereby polyP is attached to lysine residues within polyacidic serine and lysine (PASK) motifs via what the authors claimed to be covalent phosphoramidate bonding. This was based largely on the remarkable ability of the modification to survive extreme denaturing conditions. Our study demonstrates that lysine polyphosphorylation is non-covalent, based on its sensitivity to ionic strength and lysine protonation and absence of phosphoramidate bond formation, as analyzed via 31P NMR. Ionic interaction with lysine residues alone is sufficient for polyP modification, and we present a new list of non-PASK lysine repeat proteins that undergo polyP modification. This work clarifies the biochemistry of polyP-lysine modification, with important implications for both studying and modulating this phenomenon. This Matters Arising paper is in response to Azevedo et al. (2015), published in Molecular Cell. See also the Matters Arising Response by Azevedo et al. (2024), published in this issue.


Asunto(s)
Amidas , Lisina , Ácidos Fosfóricos , Polifosfatos , Lisina/metabolismo , Lisina/química , Polifosfatos/química , Polifosfatos/metabolismo , Fosforilación , Humanos , Procesamiento Proteico-Postraduccional , Proteínas/química , Proteínas/metabolismo , Proteínas/genética
2.
Sci Rep ; 14(1): 10475, 2024 05 07.
Artículo en Inglés | MEDLINE | ID: mdl-38714683

RESUMEN

To ensure that an external force can break the interaction between a protein and a ligand, the steered molecular dynamics simulation requires a harmonic restrained potential applied to the protein backbone. A usual practice is that all or a certain number of protein's heavy atoms or Cα atoms are fixed, being restrained by a small force. This present study reveals that while fixing both either all heavy atoms and or all Cα atoms is not a good approach, while fixing a too small number of few atoms sometimes cannot prevent the protein from rotating under the influence of the bulk water layer, and the pulled molecule may smack into the wall of the active site. We found that restraining the Cα atoms under certain conditions is more relevant. Thus, we would propose an alternative solution in which only the Cα atoms of the protein at a distance larger than 1.2 nm from the ligand are restrained. A more flexible, but not too flexible, protein will be expected to lead to a more natural release of the ligand.


Asunto(s)
Simulación de Dinámica Molecular , Unión Proteica , Proteínas , Ligandos , Proteínas/química , Proteínas/metabolismo , Conformación Proteica
3.
Protein Sci ; 33(6): e5001, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38723111

RESUMEN

De novo protein design expands the protein universe by creating new sequences to accomplish tailor-made enzymes in the future. A promising topology to implement diverse enzyme functions is the ubiquitous TIM-barrel fold. Since the initial de novo design of an idealized four-fold symmetric TIM barrel, the family of de novo TIM barrels is expanding rapidly. Despite this and in contrast to natural TIM barrels, these novel proteins lack cavities and structural elements essential for the incorporation of binding sites or enzymatic functions. In this work, we diversified a de novo TIM barrel by extending multiple ßα-loops using constrained hallucination. Experimentally tested designs were found to be soluble upon expression in Escherichia coli and well-behaved. Biochemical characterization and crystal structures revealed successful extensions with defined α-helical structures. These diversified de novo TIM barrels provide a framework to explore a broad spectrum of functions based on the potential of natural TIM barrels.


Asunto(s)
Modelos Moleculares , Escherichia coli/genética , Escherichia coli/metabolismo , Cristalografía por Rayos X , Pliegue de Proteína , Ingeniería de Proteínas/métodos , Proteínas/química , Proteínas/metabolismo
4.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38706316

RESUMEN

Protein-ligand interactions (PLIs) are essential for cellular activities and drug discovery. But due to the complexity and high cost of experimental methods, there is a great demand for computational approaches to recognize PLI patterns, such as protein-ligand docking. In recent years, more and more models based on machine learning have been developed to directly predict the root mean square deviation (RMSD) of a ligand docking pose with reference to its native binding pose. However, new scoring methods are pressingly needed in methodology for more accurate RMSD prediction. We present a new deep learning-based scoring method for RMSD prediction of protein-ligand docking poses based on a Graphormer method and Shell-like graph architecture, named GSScore. To recognize near-native conformations from a set of poses, GSScore takes atoms as nodes and then establishes the docking interface of protein-ligand into multiple bipartite graphs within different shell ranges. Benefiting from the Graphormer and Shell-like graph architecture, GSScore can effectively capture the subtle differences between energetically favorable near-native conformations and unfavorable non-native poses without extra information. GSScore was extensively evaluated on diverse test sets including a subset of PDBBind version 2019, CASF2016 as well as DUD-E, and obtained significant improvements over existing methods in terms of RMSE, $R$ (Pearson correlation coefficient), Spearman correlation coefficient and Docking power.


Asunto(s)
Simulación del Acoplamiento Molecular , Proteínas , Ligandos , Proteínas/química , Proteínas/metabolismo , Unión Proteica , Programas Informáticos , Algoritmos , Biología Computacional/métodos , Conformación Proteica , Bases de Datos de Proteínas , Aprendizaje Profundo
5.
BMC Bioinformatics ; 25(1): 174, 2024 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-38698340

RESUMEN

BACKGROUND: In last two decades, the use of high-throughput sequencing technologies has accelerated the pace of discovery of proteins. However, due to the time and resource limitations of rigorous experimental functional characterization, the functions of a vast majority of them remain unknown. As a result, computational methods offering accurate, fast and large-scale assignment of functions to new and previously unannotated proteins are sought after. Leveraging the underlying associations between the multiplicity of features that describe proteins could reveal functional insights into the diverse roles of proteins and improve performance on the automatic function prediction task. RESULTS: We present GO-LTR, a multi-view multi-label prediction model that relies on a high-order tensor approximation of model weights combined with non-linear activation functions. The model is capable of learning high-order relationships between multiple input views representing the proteins and predicting high-dimensional multi-label output consisting of protein functional categories. We demonstrate the competitiveness of our method on various performance measures. Experiments show that GO-LTR learns polynomial combinations between different protein features, resulting in improved performance. Additional investigations establish GO-LTR's practical potential in assigning functions to proteins under diverse challenging scenarios: very low sequence similarity to previously observed sequences, rarely observed and highly specific terms in the gene ontology. IMPLEMENTATION: The code and data used for training GO-LTR is available at https://github.com/aalto-ics-kepaco/GO-LTR-prediction .


Asunto(s)
Biología Computacional , Proteínas , Proteínas/química , Proteínas/metabolismo , Biología Computacional/métodos , Bases de Datos de Proteínas , Algoritmos
6.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38701416

RESUMEN

Predicting protein function is crucial for understanding biological life processes, preventing diseases and developing new drug targets. In recent years, methods based on sequence, structure and biological networks for protein function annotation have been extensively researched. Although obtaining a protein in three-dimensional structure through experimental or computational methods enhances the accuracy of function prediction, the sheer volume of proteins sequenced by high-throughput technologies presents a significant challenge. To address this issue, we introduce a deep neural network model DeepSS2GO (Secondary Structure to Gene Ontology). It is a predictor incorporating secondary structure features along with primary sequence and homology information. The algorithm expertly combines the speed of sequence-based information with the accuracy of structure-based features while streamlining the redundant data in primary sequences and bypassing the time-consuming challenges of tertiary structure analysis. The results show that the prediction performance surpasses state-of-the-art algorithms. It has the ability to predict key functions by effectively utilizing secondary structure information, rather than broadly predicting general Gene Ontology terms. Additionally, DeepSS2GO predicts five times faster than advanced algorithms, making it highly applicable to massive sequencing data. The source code and trained models are available at https://github.com/orca233/DeepSS2GO.


Asunto(s)
Algoritmos , Biología Computacional , Redes Neurales de la Computación , Estructura Secundaria de Proteína , Proteínas , Proteínas/química , Proteínas/metabolismo , Proteínas/genética , Biología Computacional/métodos , Bases de Datos de Proteínas , Ontología de Genes , Análisis de Secuencia de Proteína/métodos , Programas Informáticos
7.
Molecules ; 29(9)2024 Apr 23.
Artículo en Inglés | MEDLINE | ID: mdl-38731411

RESUMEN

Fullerenes, particularly C60, exhibit unique properties that make them promising candidates for various applications, including drug delivery and nanomedicine. However, their interactions with biomolecules, especially proteins, remain not fully understood. This study implements both explicit and implicit C60 models into the UNRES coarse-grained force field, enabling the investigation of fullerene-protein interactions without the need for restraints to stabilize protein structures. The UNRES force field offers computational efficiency, allowing for longer timescale simulations while maintaining accuracy. Five model proteins were studied: FK506 binding protein, HIV-1 protease, intestinal fatty acid binding protein, PCB-binding protein, and hen egg-white lysozyme. Molecular dynamics simulations were performed with and without C60 to assess protein stability and investigate the impact of fullerene interactions. Analysis of contact probabilities reveals distinct interaction patterns for each protein. FK506 binding protein (1FKF) shows specific binding sites, while intestinal fatty acid binding protein (1ICN) and uteroglobin (1UTR) exhibit more generalized interactions. The explicit C60 model shows good agreement with all-atom simulations in predicting protein flexibility, the position of C60 in the binding pocket, and the estimation of effective binding energies. The integration of explicit and implicit C60 models into the UNRES force field, coupled with recent advances in coarse-grained modeling and multiscale approaches, provides a powerful framework for investigating protein-nanoparticle interactions at biologically relevant scales without the need to use restraints stabilizing the protein, thus allowing for large conformational changes to occur. These computational tools, in synergy with experimental techniques, can aid in understanding the mechanisms and consequences of nanoparticle-biomolecule interactions, guiding the design of nanomaterials for biomedical applications.


Asunto(s)
Fulerenos , Simulación de Dinámica Molecular , Muramidasa , Unión Proteica , Fulerenos/química , Muramidasa/química , Muramidasa/metabolismo , Sitios de Unión , Proteínas de Unión a Tacrolimus/química , Proteínas de Unión a Tacrolimus/metabolismo , Proteínas de Unión a Ácidos Grasos/química , Proteínas de Unión a Ácidos Grasos/metabolismo , Proteínas/química , Proteínas/metabolismo , Proteasa del VIH
8.
Commun Biol ; 7(1): 529, 2024 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-38704509

RESUMEN

Intra-organism biodiversity is thought to arise from epigenetic modification of constituent genes and post-translational modifications of translated proteins. Here, we show that post-transcriptional modifications, like RNA editing, may also contribute. RNA editing enzymes APOBEC3A and APOBEC3G catalyze the deamination of cytosine to uracil. RNAsee (RNA site editing evaluation) is a computational tool developed to predict the cytosines edited by these enzymes. We find that 4.5% of non-synonymous DNA single nucleotide polymorphisms that result in cytosine to uracil changes in RNA are probable sites for APOBEC3A/G RNA editing; the variant proteins created by such polymorphisms may also result from transient RNA editing. These polymorphisms are associated with over 20% of Medical Subject Headings across ten categories of disease, including nutritional and metabolic, neoplastic, cardiovascular, and nervous system diseases. Because RNA editing is transient and not organism-wide, future work is necessary to confirm the extent and effects of such editing in humans.


Asunto(s)
Desaminasas APOBEC , Citidina Desaminasa , Edición de ARN , Humanos , Citidina Desaminasa/metabolismo , Citidina Desaminasa/genética , Polimorfismo de Nucleótido Simple , Citosina/metabolismo , Desaminasa APOBEC-3G/metabolismo , Desaminasa APOBEC-3G/genética , Uracilo/metabolismo , Proteínas/genética , Proteínas/metabolismo , Citosina Desaminasa/genética , Citosina Desaminasa/metabolismo
9.
BMC Genomics ; 25(1): 406, 2024 May 09.
Artículo en Inglés | MEDLINE | ID: mdl-38724906

RESUMEN

Most proteins exert their functions by interacting with other proteins, making the identification of protein-protein interactions (PPI) crucial for understanding biological activities, pathological mechanisms, and clinical therapies. Developing effective and reliable computational methods for predicting PPI can significantly reduce the time-consuming and labor-intensive associated traditional biological experiments. However, accurately identifying the specific categories of protein-protein interactions and improving the prediction accuracy of the computational methods remain dual challenges. To tackle these challenges, we proposed a novel graph neural network method called GNNGL-PPI for multi-category prediction of PPI based on global graphs and local subgraphs. GNNGL-PPI consisted of two main components: using Graph Isomorphism Network (GIN) to extract global graph features from PPI network graph, and employing GIN As Kernel (GIN-AK) to extract local subgraph features from the subgraphs of protein vertices. Additionally, considering the imbalanced distribution of samples in each category within the benchmark datasets, we introduced an Asymmetric Loss (ASL) function to further enhance the predictive performance of the method. Through evaluations on six benchmark test sets formed by three different dataset partitioning algorithms (Random, BFS, DFS), GNNGL-PPI outperformed the state-of-the-art multi-category prediction methods of PPI, as measured by the comprehensive performance evaluation metric F1-measure. Furthermore, interpretability analysis confirmed the effectiveness of GNNGL-PPI as a reliable multi-category prediction method for predicting protein-protein interactions.


Asunto(s)
Algoritmos , Biología Computacional , Redes Neurales de la Computación , Mapeo de Interacción de Proteínas , Mapeo de Interacción de Proteínas/métodos , Biología Computacional/métodos , Mapas de Interacción de Proteínas , Humanos , Proteínas/metabolismo
10.
Nat Commun ; 15(1): 4217, 2024 May 17.
Artículo en Inglés | MEDLINE | ID: mdl-38760359

RESUMEN

Helix mimicry provides probes to perturb protein-protein interactions (PPIs). Helical conformations can be stabilized by joining side chains of non-terminal residues (stapling) or via capping fragments. Nature exclusively uses capping, but synthetic helical mimics are heavily biased towards stapling. This study comprises: (i) creation of a searchable database of unique helical N-caps (ASX motifs, a protein structural motif with two intramolecular hydrogen-bonds between aspartic acid/asparagine and following residues); (ii) testing trends observed in this database using linear peptides comprising only canonical L-amino acids; and, (iii) novel synthetic N-caps for helical interface mimicry. Here we show many natural ASX motifs comprise hydrophobic triangles, validate their effect in linear peptides, and further develop a biomimetic of them, Bicyclic ASX Motif Mimics (BAMMs). BAMMs are powerful helix inducing motifs. They are synthetically accessible, and potentially useful to a broad section of the community studying disruption of PPIs using secondary structure mimics.


Asunto(s)
Secuencias de Aminoácidos , Biología Computacional , Biología Computacional/métodos , Enlace de Hidrógeno , Péptidos/química , Péptidos/metabolismo , Interacciones Hidrofóbicas e Hidrofílicas , Estructura Secundaria de Proteína , Modelos Moleculares , Secuencia de Aminoácidos , Bases de Datos de Proteínas , Proteínas/química , Proteínas/metabolismo , Ácido Aspártico/química
11.
Chem Rev ; 124(10): 6501-6542, 2024 May 22.
Artículo en Inglés | MEDLINE | ID: mdl-38722769

RESUMEN

Due to advances in methods for site-specific incorporation of unnatural amino acids (UAAs) into proteins, a large number of UAAs with tailored chemical and/or physical properties have been developed and used in a wide array of biological applications. In particular, UAAs with specific spectroscopic characteristics can be used as external reporters to produce additional signals, hence increasing the information content obtainable in protein spectroscopic and/or imaging measurements. In this Review, we summarize the progress in the past two decades in the development of such UAAs and their applications in biological spectroscopy and microscopy, with a focus on UAAs that can be used as site-specific vibrational, fluorescence, electron paramagnetic resonance (EPR), or nuclear magnetic resonance (NMR) probes. Wherever applicable, we also discuss future directions.


Asunto(s)
Aminoácidos , Aminoácidos/química , Proteínas/química , Proteínas/metabolismo , Espectroscopía de Resonancia por Spin del Electrón/métodos , Microscopía/métodos , Espectroscopía de Resonancia Magnética/métodos , Humanos
12.
Proc Natl Acad Sci U S A ; 121(22): e2319094121, 2024 May 28.
Artículo en Inglés | MEDLINE | ID: mdl-38768341

RESUMEN

Protein-protein and protein-water hydrogen bonding interactions play essential roles in the way a protein passes through the transition state during folding or unfolding, but the large number of these interactions in molecular dynamics (MD) simulations makes them difficult to analyze. Here, we introduce a state space representation and associated "rarity" measure to identify and quantify transition state passage (transit) events. Applying this representation to a long MD simulation trajectory that captured multiple folding and unfolding events of the GTT WW domain, a small protein often used as a model for the folding process, we identified three transition categories: Highway (faster), Meander (slower), and Ambiguous (intermediate). We developed data sonification and visualization tools to analyze hydrogen bond dynamics before, during, and after these transition events. By means of these tools, we were able to identify characteristic hydrogen bonding patterns associated with "Highway" versus "Meander" versus "Ambiguous" transitions and to design algorithms that can identify these same folding pathways and critical protein-water interactions directly from the data. Highly cooperative hydrogen bonding can either slow down or speed up transit. Furthermore, an analysis of protein-water hydrogen bond dynamics at the surface of WW domain shows an increase in hydrogen bond lifetime from folded to unfolded conformations with Ambiguous transitions as an outlier. In summary, hydrogen bond dynamics provide a direct window into the heterogeneity of transits, which can vary widely in duration (by a factor of 10) due to a complex energy landscape.


Asunto(s)
Enlace de Hidrógeno , Simulación de Dinámica Molecular , Pliegue de Proteína , Proteínas , Proteínas/química , Proteínas/metabolismo , Agua/química , Dominios WW , Conformación Proteica , Algoritmos
13.
Proc Natl Acad Sci U S A ; 121(21): e2322428121, 2024 May 21.
Artículo en Inglés | MEDLINE | ID: mdl-38739795

RESUMEN

Protein evolution is guided by structural, functional, and dynamical constraints ensuring organismal viability. Pseudogenes are genomic sequences identified in many eukaryotes that lack translational activity due to sequence degradation and thus over time have undergone "devolution." Previously pseudogenized genes sometimes regain their protein-coding function, suggesting they may still encode robust folding energy landscapes despite multiple mutations. We study both the physical folding landscapes of protein sequences corresponding to human pseudogenes using the Associative Memory, Water Mediated, Structure and Energy Model, and the evolutionary energy landscapes obtained using direct coupling analysis (DCA) on their parent protein families. We found that generally mutations that have occurred in pseudogene sequences have disrupted their native global network of stabilizing residue interactions, making it harder for them to fold if they were translated. In some cases, however, energetic frustration has apparently decreased when the functional constraints were removed. We analyzed this unexpected situation for Cyclophilin A, Profilin-1, and Small Ubiquitin-like Modifier 2 Protein. Our analysis reveals that when such mutations in the pseudogene ultimately stabilize folding, at the same time, they likely alter the pseudogenes' former biological activity, as estimated by DCA. We localize most of these stabilizing mutations generally to normally frustrated regions required for binding to other partners.


Asunto(s)
Evolución Molecular , Pliegue de Proteína , Seudogenes , Seudogenes/genética , Humanos , Mutación , Secuencia de Aminoácidos , Proteínas/genética , Proteínas/química , Proteínas/metabolismo , Termodinámica
14.
J Am Chem Soc ; 146(20): 14307-14317, 2024 May 22.
Artículo en Inglés | MEDLINE | ID: mdl-38722189

RESUMEN

Biomolecules such as proteins and RNA could organize to form condensates with distinct microenvironments through liquid-liquid phase separation (LLPS). Recent works have demonstrated that the microenvironment of biomolecular condensates plays a crucial role in mediating biological activities, such as the partition of biomolecules, and the subphase organization of the multiphasic condensates. Ions could influence the phase transition point of LLPS, following the Hofmeister series. However, the ion-specific effect on the microenvironment of biomolecular condensates remains unknown. In this study, we utilized fluorescence lifetime imaging microscopy (FLIM), fluorescence recovery after photobleaching (FRAP), and microrheology techniques to investigate the ion effect on the microenvironment of condensates. We found that ions significantly affect the microenvironment of biomolecular condensates: salting-in ions increase micropolarity and reduce the microviscosity of the condensate, while salting-out ions induce opposing effects. Furthermore, we manipulate the miscibility and multilayering behavior of condensates through ion-specific effects. In summary, our work provides the first quantitative survey of the microenvironment of protein condensates in the presence of ions from the Hofmeister series, demonstrating how ions impact micropolarity, microviscosity, and viscoelasticity of condensates. Our results bear implications on how membrane-less organelles would exhibit varying microenvironments in the presence of continuously changing cellular conditions.


Asunto(s)
Condensados Biomoleculares , Condensados Biomoleculares/química , Iones/química , Recuperación de Fluorescencia tras Fotoblanqueo , Microscopía Fluorescente , Proteínas/química , Proteínas/metabolismo
15.
Biophys Chem ; 310: 107238, 2024 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-38733645

RESUMEN

Quantum dots (QDs) are semiconductor nanocrystals (2-10 nm) with unique optical and electronic properties due to quantum confinement effects. They offer high photostability, narrow emission spectra, broad absorption spectrum, and high quantum yields, making them versatile in various applications. Due to their highly reactive surfaces, QDs can conjugate with biomolecules while being used, produced, or unintentionally released into the environment. This systematic review delves into intricate relationship between QDs and proteins, examining their interactions that influence their physicochemical properties, enzymatic activity, ligand binding affinity, and stability. The research utilized electronic databases like PubMed, WOS, and Proquest, along with manual reviews from 2013 to 2023 using relevant keywords, to identify suitable literature. After screening titles and abstracts, only articles meeting inclusion criteria were selected for full text readings. This systematic review of 395 articles identifies 125 articles meeting the inclusion criteria, categorized into five overarching themes, encompassing various mechanisms of QDs and proteins interactions, including adsorption to covalent binding, contingent on physicochemical properties of QDs. Through a meticulous analysis of existing literature, it unravels intricate nature of interaction, significant influence on nanomaterials and biological entities, and potential for synergistic applications harnessing both specific and nonspecific interactions across various fields.


Asunto(s)
Proteínas , Puntos Cuánticos , Puntos Cuánticos/química , Puntos Cuánticos/metabolismo , Proteínas/química , Proteínas/metabolismo , Humanos , Nanotecnología , Unión Proteica
16.
Expert Opin Drug Discov ; 19(6): 649-670, 2024 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-38715415

RESUMEN

INTRODUCTION: Modern drug discovery revolves around designing ligands that target the chosen biomolecule, typically proteins. For this, the evaluation of affinities of putative ligands is crucial. This has given rise to a multitude of dedicated computational and experimental methods that are constantly being developed and improved. AREAS COVERED: In this review, the authors reassess both the industry mainstays and the newest trends among the methods for protein - small-molecule affinity determination. They discuss both computational affinity predictions and experimental techniques, describing their basic principles, main limitations, and advantages. Together, this serves as initial guide to the currently most popular and cutting-edge ligand-binding assays employed in rational drug design. EXPERT OPINION: The affinity determination methods continue to develop toward miniaturization, high-throughput, and in-cell application. Moreover, the availability of data analysis tools has been constantly increasing. Nevertheless, cross-verification of data using at least two different techniques and careful result interpretation remain of utmost importance.


Asunto(s)
Diseño de Fármacos , Descubrimiento de Drogas , Proteínas , Ligandos , Proteínas/metabolismo , Humanos , Descubrimiento de Drogas/métodos , Diseño de Fármacos/métodos , Unión Proteica , Ensayos Analíticos de Alto Rendimiento/métodos
17.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38739759

RESUMEN

Proteins interact with diverse ligands to perform a large number of biological functions, such as gene expression and signal transduction. Accurate identification of these protein-ligand interactions is crucial to the understanding of molecular mechanisms and the development of new drugs. However, traditional biological experiments are time-consuming and expensive. With the development of high-throughput technologies, an increasing amount of protein data is available. In the past decades, many computational methods have been developed to predict protein-ligand interactions. Here, we review a comprehensive set of over 160 protein-ligand interaction predictors, which cover protein-protein, protein-nucleic acid, protein-peptide and protein-other ligands (nucleotide, heme, ion) interactions. We have carried out a comprehensive analysis of the above four types of predictors from several significant perspectives, including their inputs, feature profiles, models, availability, etc. The current methods primarily rely on protein sequences, especially utilizing evolutionary information. The significant improvement in predictions is attributed to deep learning methods. Additionally, sequence-based pretrained models and structure-based approaches are emerging as new trends.


Asunto(s)
Biología Computacional , Ácidos Nucleicos , Proteínas , Ácidos Nucleicos/metabolismo , Ácidos Nucleicos/química , Proteínas/química , Proteínas/metabolismo , Biología Computacional/métodos , Ligandos , Unión Proteica , Humanos
18.
Proc Natl Acad Sci U S A ; 121(21): e2400260121, 2024 May 21.
Artículo en Inglés | MEDLINE | ID: mdl-38743624

RESUMEN

We introduce ZEPPI (Z-score Evaluation of Protein-Protein Interfaces), a framework to evaluate structural models of a complex based on sequence coevolution and conservation involving residues in protein-protein interfaces. The ZEPPI score is calculated by comparing metrics for an interface to those obtained from randomly chosen residues. Since contacting residues are defined by the structural model, this obviates the need to account for indirect interactions. Further, although ZEPPI relies on species-paired multiple sequence alignments, its focus on interfacial residues allows it to leverage quite shallow alignments. ZEPPI can be implemented on a proteome-wide scale and is applied here to millions of structural models of dimeric complexes in the Escherichia coli and human interactomes found in the PrePPI database. PrePPI's scoring function is based primarily on the evaluation of protein-protein interfaces, and ZEPPI adds a new feature to this analysis through the incorporation of evolutionary information. ZEPPI performance is evaluated through applications to experimentally determined complexes and to decoys from the CASP-CAPRI experiment. As we discuss, the standard CAPRI scores used to evaluate docking models are based on model quality and not on the ability to give yes/no answers as to whether two proteins interact. ZEPPI is able to detect weak signals from PPI models that the CAPRI scores define as incorrect and, similarly, to identify potential PPIs defined as low confidence by the current PrePPI scoring function. A number of examples that illustrate how the combination of PrePPI and ZEPPI can yield functional hypotheses are provided.


Asunto(s)
Proteoma , Proteoma/metabolismo , Humanos , Mapeo de Interacción de Proteínas/métodos , Modelos Moleculares , Escherichia coli/metabolismo , Escherichia coli/genética , Bases de Datos de Proteínas , Unión Proteica , Proteínas de Escherichia coli/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/genética , Proteínas/química , Proteínas/metabolismo , Alineación de Secuencia
19.
BMC Genomics ; 25(1): 466, 2024 May 13.
Artículo en Inglés | MEDLINE | ID: mdl-38741045

RESUMEN

BACKGROUND: Protein-protein interactions (PPIs) hold significant importance in biology, with precise PPI prediction as a pivotal factor in comprehending cellular processes and facilitating drug design. However, experimental determination of PPIs is laborious, time-consuming, and often constrained by technical limitations. METHODS: We introduce a new node representation method based on initial information fusion, called FFANE, which amalgamates PPI networks and protein sequence data to enhance the precision of PPIs' prediction. A Gaussian kernel similarity matrix is initially established by leveraging protein structural resemblances. Concurrently, protein sequence similarities are gauged using the Levenshtein distance, enabling the capture of diverse protein attributes. Subsequently, to construct an initial information matrix, these two feature matrices are merged by employing weighted fusion to achieve an organic amalgamation of structural and sequence details. To gain a more profound understanding of the amalgamated features, a Stacked Autoencoder (SAE) is employed for encoding learning, thereby yielding more representative feature representations. Ultimately, classification models are trained to predict PPIs by using the well-learned fusion feature. RESULTS: When employing 5-fold cross-validation experiments on SVM, our proposed method achieved average accuracies of 94.28%, 97.69%, and 84.05% in terms of Saccharomyces cerevisiae, Homo sapiens, and Helicobacter pylori datasets, respectively. CONCLUSION: Experimental findings across various authentic datasets validate the efficacy and superiority of this fusion feature representation approach, underscoring its potential value in bioinformatics.


Asunto(s)
Biología Computacional , Mapeo de Interacción de Proteínas , Mapeo de Interacción de Proteínas/métodos , Biología Computacional/métodos , Algoritmos , Helicobacter pylori/metabolismo , Helicobacter pylori/genética , Máquina de Vectores de Soporte , Proteínas/metabolismo , Proteínas/química , Humanos , Mapas de Interacción de Proteínas , Bases de Datos de Proteínas
20.
Clin Respir J ; 18(5): e13774, 2024 May.
Artículo en Inglés | MEDLINE | ID: mdl-38742362

RESUMEN

OBJECTIVE: This study aimed to explore the application value of human epididymis protein 4 (HE4) in diagnosing and monitoring the prognosis of lung cancer. METHODS: First, TCGA (The Cancer Genome Atlas) databases were used to analyze whey-acidic-protein 4-disulfide bond core domain 2 (WFDC2) gene expression levels in lung cancer tissues. Then, a total of 160 individuals were enrolled, categorized into three groups: the lung cancer group (n = 80), the benign lesions group (n = 40), and the healthy controls group (n = 40). Serum HE4 levels and other biomarkers were quantified using an electro-chemiluminescent immunoassay. Additionally, the expression of HE4 in tissues was analyzed through immunohistochemistry (IHC). In vitro cultures of human airway epithelial (human bronchial epithelial [HBE]) cells and various lung cancer cell lines (SPC/PC9/A594/H520) were utilized to detect HE4 levels via western blot (WB). RESULTS: Analysis of the TCGA and UALCAN (The University of Alabama at Birmingham Cancer Data Analysis Portal) databases showed that WFDC2 gene expression levels were upregulated in lung cancer tissues (p < 0.01). Compared with the control group and the benign group, HE4 was significantly higher in the serum of patients with lung cancer (p < 0.001). Receiver operating characteristic (ROC) analysis confirmed that HE4 had better diagnostic efficacy than classical markers in the differential diagnosis of lung cancer and benign lesions and had the highest diagnostic value in lung adenocarcinoma (area under the ROC curve [AUC] = 0.826). HE4 increased in early lung cancer and positively correlated with poor prognosis (p < 0.001). Moreover, the results of WB and IHC revealed that the expression of HE4 was increased in lung cancer cells (SPC/A549/H520) and lung cancer tissues but decreased in PC9 cells with a lack of exon EGFR19 (p < 0.05). CONCLUSION: Serum HE4 emerges as a promising novel biomarker for the diagnosis and prognosis assessment of lung cancer.


Asunto(s)
Biomarcadores de Tumor , Neoplasias Pulmonares , Proteína 2 de Dominio del Núcleo de Cuatro Disulfuros WAP , Humanos , Proteína 2 de Dominio del Núcleo de Cuatro Disulfuros WAP/metabolismo , Proteína 2 de Dominio del Núcleo de Cuatro Disulfuros WAP/análisis , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/metabolismo , Neoplasias Pulmonares/patología , Biomarcadores de Tumor/metabolismo , Biomarcadores de Tumor/sangre , Biomarcadores de Tumor/genética , Masculino , Pronóstico , Femenino , Persona de Mediana Edad , Proteínas/metabolismo , Proteínas/genética , Anciano , Regulación Neoplásica de la Expresión Génica , Línea Celular Tumoral , Inmunohistoquímica
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA