Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 2.040
Filtrar
Mais filtros

Intervalo de ano de publicação
1.
Mol Cell ; 84(2): 202-220.e15, 2024 Jan 18.
Artigo em Inglês | MEDLINE | ID: mdl-38103559

RESUMO

Compounds binding to the bromodomains of bromodomain and extra-terminal (BET) family proteins, particularly BRD4, are promising anticancer agents. Nevertheless, side effects and drug resistance pose significant obstacles in BET-based therapeutics development. Using high-throughput screening of a 200,000-compound library, we identified small molecules targeting a phosphorylated intrinsically disordered region (IDR) of BRD4 that inhibit phospho-BRD4 (pBRD4)-dependent human papillomavirus (HPV) genome replication in HPV-containing keratinocytes. Proteomic profiling identified two DNA damage response factors-53BP1 and BARD1-crucial for differentiation-associated HPV genome amplification. pBRD4-mediated recruitment of 53BP1 and BARD1 to the HPV origin of replication occurs in a spatiotemporal and BRD4 long (BRD4-L) and short (BRD4-S) isoform-specific manner. This recruitment is disrupted by phospho-IDR-targeting compounds with little perturbation of the global transcriptome and BRD4 chromatin landscape. The discovery of these protein-protein interaction inhibitors (PPIi) not only demonstrates the feasibility of developing PPIi against phospho-IDRs but also uncovers antiviral agents targeting an epigenetic regulator essential for virus-host interaction and cancer development.


Assuntos
Infecções por Papillomavirus , Fatores de Transcrição , Humanos , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Proteínas Nucleares/genética , Proteínas Nucleares/metabolismo , Papillomavirus Humano , Infecções por Papillomavirus/tratamento farmacológico , Infecções por Papillomavirus/genética , Proteômica , Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/metabolismo , Papillomaviridae/genética , Papillomaviridae/metabolismo , Proteínas Virais/genética , Replicação Viral/fisiologia , Reparo do DNA , Proteínas que Contêm Bromodomínio
2.
Trends Biochem Sci ; 48(6): 539-552, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-36841635

RESUMO

Protein-protein interactions (PPIs) have important roles in various cellular processes, but are commonly described as 'undruggable' therapeutic targets due to their large, flat, featureless interfaces. Fragment-based drug discovery (FBDD) has achieved great success in modulating PPIs, with more than ten compounds in clinical trials. Here, we highlight the progress of FBDD in modulating PPIs for therapeutic development. Targeting hot spots that have essential roles in both fragment binding and PPIs provides a shortcut for the development of PPI modulators via FBDD. We highlight successful cases of cracking the 'undruggable' problems of PPIs using fragment-based approaches. We also introduce new technologies and future trends. Thus, we hope that this review will provide useful guidance for drug discovery targeting PPIs.


Assuntos
Descoberta de Drogas , Ligação Proteica
3.
Trends Biochem Sci ; 48(6): 527-538, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37061423

RESUMO

Protein-protein interactions (PPIs) drive biological processes, and disruption of PPIs can cause disease. With recent breakthroughs in structure prediction and a deluge of genomic sequence data, computational methods to predict PPIs and model spatial structures of protein complexes are now approaching the accuracy of experimental approaches for permanent interactions and show promise for elucidating transient interactions. As we describe here, the key to this success is rich evolutionary information deciphered from thousands of homologous sequences that coevolve in interacting partners. This covariation signal, revealed by sophisticated statistical and machine learning (ML) algorithms, predicts physiological interactions. Accurate artificial intelligence (AI)-based modeling of protein structures promises to provide accurate 3D models of PPIs at a proteome-wide scale.


Assuntos
Inteligência Artificial , Mapeamento de Interação de Proteínas , Mapeamento de Interação de Proteínas/métodos , Algoritmos , Aprendizado de Máquina , Proteoma , Biologia Computacional/métodos
4.
Traffic ; 25(4): e12933, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38600522

RESUMO

Macroautophagy/autophagy is an essential catabolic process that targets a wide variety of cellular components including proteins, organelles, and pathogens. ATG7, a protein involved in the autophagy process, plays a crucial role in maintaining cellular homeostasis and can contribute to the development of diseases such as cancer. ATG7 initiates autophagy by facilitating the lipidation of the ATG8 proteins in the growing autophagosome membrane. The noncanonical isoform ATG7(2) is unable to perform ATG8 lipidation; however, its cellular regulation and function are unknown. Here, we uncovered a distinct regulation and function of ATG7(2) in contrast with ATG7(1), the canonical isoform. First, affinity-purification mass spectrometry analysis revealed that ATG7(2) establishes direct protein-protein interactions (PPIs) with metabolic proteins, whereas ATG7(1) primarily interacts with autophagy machinery proteins. Furthermore, we identified that ATG7(2) mediates a decrease in metabolic activity, highlighting a novel splice-dependent function of this important autophagy protein. Then, we found a divergent expression pattern of ATG7(1) and ATG7(2) across human tissues. Conclusively, our work uncovers the divergent patterns of expression, protein interactions, and function of ATG7(2) in contrast to ATG7(1). These findings suggest a molecular switch between main catabolic processes through isoform-dependent expression of a key autophagy gene.


Assuntos
Autofagia , Metabolismo Energético , Humanos , Autofagossomos/metabolismo , Proteínas Relacionadas à Autofagia/metabolismo , Proteínas Associadas aos Microtúbulos/metabolismo , Isoformas de Proteínas/metabolismo
5.
Brief Bioinform ; 25(6)2024 Sep 23.
Artigo em Inglês | MEDLINE | ID: mdl-39318189

RESUMO

Prostate cancer (PCa) is the most prevalent cancer affecting American men. Castration-resistant prostate cancer (CRPC) can emerge during hormone therapy for PCa, manifesting with elevated serum prostate-specific antigen levels, continued disease progression, and/or metastasis to the new sites, resulting in a poor prognosis. A subset of CRPC patients shows a neuroendocrine (NE) phenotype, signifying reduced or no reliance on androgen receptor signaling and a particularly unfavorable prognosis. In this study, we incorporated computational approaches based on both gene expression profiles and protein-protein interaction networks. We identified 500 potential marker genes, which are significantly enriched in cell cycle and neuronal processes. The top 40 candidates, collectively named CDHu40, demonstrated superior performance in distinguishing NE PCa (NEPC) and non-NEPC samples based on gene expression profiles. CDHu40 outperformed most of the other published marker sets, excelling particularly at the prognostic level. Notably, some marker genes in CDHu40, absent in the other marker sets, have been reported to be associated with NEPC in the literature, such as DDC, FOLH1, BEX1, MAST1, and CACNA1A. Importantly, elevated CDHu40 scores derived from our predictive model showed a robust correlation with unfavorable survival outcomes in patients, indicating the potential of the CDHu40 score as a promising indicator for predicting the survival prognosis of those patients with the NE phenotype. Motif enrichment analysis on the top candidates suggests that REST and E2F6 may serve as key regulators in the NEPC progression.


Assuntos
Biomarcadores Tumorais , Humanos , Masculino , Biomarcadores Tumorais/genética , Prognóstico , Regulação Neoplásica da Expressão Gênica , Neoplasias da Próstata/genética , Neoplasias da Próstata/patologia , Neoplasias da Próstata/metabolismo , Mapas de Interação de Proteínas , Perfilação da Expressão Gênica , Neoplasias de Próstata Resistentes à Castração/genética , Neoplasias de Próstata Resistentes à Castração/patologia , Neoplasias de Próstata Resistentes à Castração/metabolismo , Biologia Computacional/métodos , Carcinoma Neuroendócrino/genética , Carcinoma Neuroendócrino/patologia , Carcinoma Neuroendócrino/metabolismo
6.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-39003530

RESUMO

Protein function prediction is critical for understanding the cellular physiological and biochemical processes, and it opens up new possibilities for advancements in fields such as disease research and drug discovery. During the past decades, with the exponential growth of protein sequence data, many computational methods for predicting protein function have been proposed. Therefore, a systematic review and comparison of these methods are necessary. In this study, we divide these methods into four different categories, including sequence-based methods, 3D structure-based methods, PPI network-based methods and hybrid information-based methods. Furthermore, their advantages and disadvantages are discussed, and then their performance is comprehensively evaluated and compared. Finally, we discuss the challenges and opportunities present in this field.


Assuntos
Biologia Computacional , Proteínas , Proteínas/química , Proteínas/metabolismo , Biologia Computacional/métodos , Humanos , Análise de Sequência de Proteína/métodos , Algoritmos
7.
Proc Natl Acad Sci U S A ; 120(21): e2221967120, 2023 05 23.
Artigo em Inglês | MEDLINE | ID: mdl-37186857

RESUMO

The structure-based design of small-molecule inhibitors targeting protein-protein interactions (PPIs) remains a huge challenge as the drug must bind typically wide and shallow protein sites. A PPI target of high interest for hematological cancer therapy is myeloid cell leukemia 1 (Mcl-1), a prosurvival guardian protein from the Bcl-2 family. Despite being previously considered undruggable, seven small-molecule Mcl-1 inhibitors have recently entered clinical trials. Here, we report the crystal structure of the clinical-stage inhibitor AMG-176 bound to Mcl-1 and analyze its interaction along with clinical inhibitors AZD5991 and S64315. Our X-ray data reveal high plasticity of Mcl-1 and a remarkable ligand-induced pocket deepening. Nuclear Magnetic Resonance (NMR)-based free ligand conformer analysis demonstrates that such unprecedented induced fit is uniquely achieved by designing highly rigid inhibitors, preorganized in their bioactive conformation. By elucidating key chemistry design principles, this work provides a roadmap for targeting the largely untapped PPI class more successfully.


Assuntos
Apoptose , Naftalenos , Modelos Moleculares , Proteína de Sequência 1 de Leucemia de Células Mieloides/metabolismo , Ligantes
8.
J Biol Chem ; : 107868, 2024 Oct 09.
Artigo em Inglês | MEDLINE | ID: mdl-39393572

RESUMO

Dependent on the light conditions, photosynthetic organisms switch between carbohydrate synthesis or breakdown, for which the reversibility of carbohydrate metabolism, including glycolysis, is essential. Kinetic regulation of phosphofructokinase (PFK), a key-control point in glycolysis, was studied in the cyanobacterium Synechocystis sp. PCC 6803. The two PFK iso-enzymes (PFK- A1, PFK-A2), were found to use ADP instead of ATP, and have similar kinetic characteristics, but differ in their allosteric regulation. PFK-A1 is inhibited by 3- phosphoglycerate, a product of the Calvin-Benson-Bassham cycle, while PFK-A2 is inhibited by ATP, which is provided by photosynthesis. This regulation enables cyanobacteria to switch PFK off in light, and on in darkness. ADP dependence has not been reported before for the PFK-A enzyme family, and was thought to be restricted to the PFK-B ribokinase superfamily. Phosphate donor specificity within the PFK-A family could be related to specific binding motifs in ATP-, ADP-, and PPi-dependent PFK-As. Phylogenetic analysis revealed a distribution of ADP-PFK-As in cyanobacteria and in a few alphaproteobacteria, which was confirmed in enzymatic studies.

9.
Brief Bioinform ; 24(6)2023 09 22.
Artigo em Inglês | MEDLINE | ID: mdl-37870286

RESUMO

The advanced language models have enabled us to recognize protein-protein interactions (PPIs) and interaction sites using protein sequences or structures. Here, we trained the MindSpore ProteinBERT (MP-BERT) model, a Bidirectional Encoder Representation from Transformers, using protein pairs as inputs, making it suitable for identifying PPIs and their respective interaction sites. The pretrained model (MP-BERT) was fine-tuned as MPB-PPI (MP-BERT on PPI) and demonstrated its superiority over the state-of-the-art models on diverse benchmark datasets for predicting PPIs. Moreover, the model's capability to recognize PPIs among various organisms was evaluated on multiple organisms. An amalgamated organism model was designed, exhibiting a high level of generalization across the majority of organisms and attaining an accuracy of 92.65%. The model was also customized to predict interaction site propensity by fine-tuning it with PPI site data as MPB-PPISP. Our method facilitates the prediction of both PPIs and their interaction sites, thereby illustrating the potency of transfer learning in dealing with the protein pair task.


Assuntos
Aprendizado de Máquina , Proteínas , Proteínas/química , Sequência de Aminoácidos
10.
Brief Bioinform ; 24(5)2023 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-37480553

RESUMO

Most life activities in organisms are regulated through protein complexes, which are mainly controlled via Protein-Protein Interactions (PPIs). Discovering new interactions between proteins and revealing their biological functions are of great significance for understanding the molecular mechanisms of biological processes and identifying the potential targets in drug discovery. Current experimental methods only capture stable protein interactions, which lead to limited coverage. In addition, expensive cost and time consuming are also the obvious shortcomings. In recent years, various computational methods have been successfully developed for predicting PPIs based only on protein homology, primary sequences of protein or gene ontology information. Computational efficiency and data complexity are still the main bottlenecks for the algorithm generalization. In this study, we proposed a novel computational framework, HNSPPI, to predict PPIs. As a hybrid supervised learning model, HNSPPI comprehensively characterizes the intrinsic relationship between two proteins by integrating amino acid sequence information and connection properties of PPI network. The experimental results show that HNSPPI works very well on six benchmark datasets. Moreover, the comparison analysis proved that our model significantly outperforms other five existing algorithms. Finally, we used the HNSPPI model to explore the SARS-CoV-2-Human interaction system and found several potential regulations. In summary, HNSPPI is a promising model for predicting new protein interactions from known PPI data.


Assuntos
COVID-19 , Humanos , SARS-CoV-2 , Algoritmos , Sequência de Aminoácidos , Benchmarking
11.
Brief Bioinform ; 24(2)2023 03 19.
Artigo em Inglês | MEDLINE | ID: mdl-36880207

RESUMO

Protein-protein interactions (PPIs) carry out the cellular processes of all living organisms. Experimental methods for PPI detection suffer from high cost and false-positive rate, hence efficient computational methods are highly desirable for facilitating PPI detection. In recent years, benefiting from the enormous amount of protein data produced by advanced high-throughput technologies, machine learning models have been well developed in the field of PPI prediction. In this paper, we present a comprehensive survey of the recently proposed machine learning-based prediction methods. The machine learning models applied in these methods and details of protein data representation are also outlined. To understand the potential improvements in PPI prediction, we discuss the trend in the development of machine learning-based methods. Finally, we highlight potential directions in PPI prediction, such as the use of computationally predicted protein structures to extend the data source for machine learning models. This review is supposed to serve as a companion for further improvements in this field.


Assuntos
Aprendizado de Máquina , Mapeamento de Interação de Proteínas , Mapeamento de Interação de Proteínas/métodos , Proteínas/metabolismo , Biologia Computacional/métodos
12.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36403092

RESUMO

MOTIVATION: Biological experimental approaches to protein-protein interaction (PPI) site prediction are critical for understanding the mechanisms of biochemical processes but are time-consuming and laborious. With the development of Deep Learning (DL) techniques, the most popular Convolutional Neural Networks (CNN)-based methods have been proposed to address these problems. Although significant progress has been made, these methods still have limitations in encoding the characteristics of each amino acid in protein sequences. Current methods cannot efficiently explore the nature of Position Specific Scoring Matrix (PSSM), secondary structure and raw protein sequences by processing them all together. For PPI site prediction, how to effectively model the PPI context with attention to prediction remains an open problem. In addition, the long-distance dependencies of PPI features are important, which is very challenging for many CNN-based methods because the innate ability of CNN is difficult to outperform auto-regressive models like Transformers. RESULTS: To effectively mine the properties of PPI features, a novel hybrid neural network named HN-PPISP is proposed, which integrates a Multi-layer Perceptron Mixer (MLP-Mixer) module for local feature extraction and a two-stage multi-branch module for global feature capture. The model merits Transformer, TextCNN and Bi-LSTM as a powerful alternative for PPI site prediction. On the one hand, this is the first application of an advanced Transformer (i.e. MLP-Mixer) with a hybrid network for sequence-based PPI prediction. On the other hand, unlike existing methods that treat global features altogether, the proposed two-stage multi-branch hybrid module firstly assigns different attention scores to the input features and then encodes the feature through different branch modules. In the first stage, different improved attention modules are hybridized to extract features from the raw protein sequences, secondary structure and PSSM, respectively. In the second stage, a multi-branch network is designed to aggregate information from both branches in parallel. The two branches encode the features and extract dependencies through several operations such as TextCNN, Bi-LSTM and different activation functions. Experimental results on real-world public datasets show that our model consistently achieves state-of-the-art performance over seven remarkable baselines. AVAILABILITY: The source code of HN-PPISP model is available at https://github.com/ylxu05/HN-PPISP.


Assuntos
Redes Neurais de Computação , Software , Sequência de Aminoácidos , Aminoácidos , Estrutura Secundária de Proteína
13.
Hum Genomics ; 18(1): 15, 2024 Feb 08.
Artigo em Inglês | MEDLINE | ID: mdl-38326862

RESUMO

BACKGROUND: It is valuable to analyze the genome-wide association studies (GWAS) data for a complex disease phenotype in the context of the protein-protein interaction (PPI) network, as the related pathophysiology results from the function of interacting polyprotein pathways. The analysis may include the design and curation of a phenotype-specific GWAS meta-database incorporating genotypic and eQTL data linking to PPI and other biological datasets, and the development of systematic workflows for PPI network-based data integration toward protein and pathway prioritization. Here, we pursued this analysis for blood pressure (BP) regulation. METHODS: The relational scheme of the implemented in Microsoft SQL Server BP-GWAS meta-database enabled the combined storage of: GWAS data and attributes mined from GWAS Catalog and the literature, Ensembl-defined SNP-transcript associations, and GTEx eQTL data. The BP-protein interactome was reconstructed from the PICKLE PPI meta-database, extending the GWAS-deduced network with the shortest paths connecting all GWAS-proteins into one component. The shortest-path intermediates were considered as BP-related. For protein prioritization, we combined a new integrated GWAS-based scoring scheme with two network-based criteria: one considering the protein role in the reconstructed by shortest-path (RbSP) interactome and one novel promoting the common neighbors of GWAS-prioritized proteins. Prioritized proteins were ranked by the number of satisfied criteria. RESULTS: The meta-database includes 6687 variants linked with 1167 BP-associated protein-coding genes. The GWAS-deduced PPI network includes 1065 proteins, with 672 forming a connected component. The RbSP interactome contains 1443 additional, network-deduced proteins and indicated that essentially all BP-GWAS proteins are at most second neighbors. The prioritized BP-protein set was derived from the union of the most BP-significant by any of the GWAS-based or the network-based criteria. It included 335 proteins, with ~ 2/3 deduced from the BP PPI network extension and 126 prioritized by at least two criteria. ESR1 was the only protein satisfying all three criteria, followed in the top-10 by INSR, PTN11, CDK6, CSK, NOS3, SH2B3, ATP2B1, FES and FINC, satisfying two. Pathway analysis of the RbSP interactome revealed numerous bioprocesses, which are indeed functionally supported as BP-associated, extending our understanding about BP regulation. CONCLUSIONS: The implemented workflow could be used for other multifactorial diseases.


Assuntos
Estudo de Associação Genômica Ampla , Mapas de Interação de Proteínas , Humanos , Mapas de Interação de Proteínas/genética , Estudo de Associação Genômica Ampla/métodos , Pressão Sanguínea/genética , Genótipo , Bases de Dados Factuais , ATPases Transportadoras de Cálcio da Membrana Plasmática
14.
Methods ; 223: 45-55, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38272245

RESUMO

A fluorescent dansyl-based amphiphilic probe, 5-(dimethylamino)-N-hexadecylnaphthalene-1-sulfonamide (DLC), was synthesized and characterized to detect multiple analytes at different sensing environments. In acetonitrile, DLC detects nitro explosives such as trinitrophenol (TNP) and 2,4-dinitrophenol (2,4-DNP) by an emission "on-off" response method, and the detection limits (LOD) were estimated to be as low as 4.3 µM and 17.4 µM, respectively. Amphiphilic long chains of the probe were embedded into lipid bilayers to form nanoscale vesicles DLC.Ves. Nanovesicular probe DLC.Ves was found to be highly selective for Hg2+ among other metal ions and for pyrophosphate (PPi) ions among various anions. DLC.Ves could detect Hg2+ with a turn "on-off" emission and PPi with ratiometric change in emission at 525 nm. It is proposed that DLC.Ves could detect Hg2+ via the Hg2+-induced aggregation quenching mechanism and PPi through the Hydrogen bonding. The LODs are estimated as 6.41 µM and 70.9 µM for Hg2+ and PPi, respectively. 1H NMR, SEM, and fluorescence lifetime measurements confirmed the binding mechanism. Thus, it is believed that the simple fluorescent probe DLC could be a prominent sensor to detect multiple analytes depending on the sensing medium.


Assuntos
Mercúrio , Íons , Picratos , Mercúrio/química , Fluorescência , Corantes Fluorescentes/química
15.
Mol Cell Proteomics ; 22(10): 100637, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37648026

RESUMO

cAMP response element-binding protein (CREB) regulated transcriptional coactivator 2 (CRTC2) is a critical transcription factor that maintains glucose homeostasis by activating CREB. Energy homeostasis is maintained through multiple pathways; therefore, CRTC2 may interact with other transcription factors, particularly under metabolic stress. CRTC2 liver-specific KO mice were created, and the global proteome, phosphoproteome, and acetylome from liver tissue under high-fat diet conditions were analyzed using liquid chromatography-tandem mass spectrometry and bioinformatics analysis. Differentially regulated proteins (DRPs) were enriched in metabolic pathways, which were subsequently corroborated through animal experiments. The consensus DRPs from these datasets were used as seed proteins to generate a protein-protein interaction network using STRING, and GeneMANIA identified fatty acid synthase as a mutually relevant protein. In an additional local-protein-protein interaction analysis of CRTC2 and fatty acid synthase with DRPs, sterol regulatory element binding transcription factor 1 (SREBF1) was the common mediator. CRTC2-CREB and SREBF1 are transcription factors, and DNA-binding motif analysis showed that multiple CRTC2-CREB-regulated genes possess SREBF1-binding motifs. This indicates the possible induction by the CRTC2-SREBF1 complex, which is validated through luciferase assay. Therefore, the CRTC2-SREBF1 complex potentially modulates the transcription of multiple proteins that fine-tune cellular metabolism under metabolic stress.

16.
BMC Bioinformatics ; 25(1): 252, 2024 Jul 31.
Artigo em Inglês | MEDLINE | ID: mdl-39085781

RESUMO

BACKGROUND: Proteins play a pivotal role in the diverse array of biological processes, making the precise prediction of protein-protein interaction (PPI) sites critical to numerous disciplines including biology, medicine and pharmacy. While deep learning methods have progressively been implemented for the prediction of PPI sites within proteins, the task of enhancing their predictive performance remains an arduous challenge. RESULTS: In this paper, we propose a novel PPI site prediction model (DGCPPISP) based on a dynamic graph convolutional neural network and a two-stage transfer learning strategy. Initially, we implement the transfer learning from dual perspectives, namely feature input and model training that serve to supply efficacious prior knowledge for our model. Subsequently, we construct a network designed for the second stage of training, which is built on the foundation of dynamic graph convolution. CONCLUSIONS: To evaluate its effectiveness, the performance of the DGCPPISP model is scrutinized using two benchmark datasets. The ensuing results demonstrate that DGCPPISP outshines competing methods in terms of performance. Specifically, DGCPPISP surpasses the second-best method, EGRET, by margins of 5.9%, 10.1%, and 13.3% for F1-measure, AUPRC, and MCC metrics respectively on Dset_186_72_PDB164. Similarly, on Dset_331, it eclipses the performance of the runner-up method, HN-PPISP, by 14.5%, 19.8%, and 29.9% respectively.


Assuntos
Redes Neurais de Computação , Mapeamento de Interação de Proteínas/métodos , Biologia Computacional/métodos , Proteínas/química , Proteínas/metabolismo , Aprendizado Profundo , Bases de Dados de Proteínas , Aprendizado de Máquina
17.
BMC Bioinformatics ; 25(1): 115, 2024 Mar 16.
Artigo em Inglês | MEDLINE | ID: mdl-38493120

RESUMO

BACKGROUND: Protein language models, inspired by the success of large language models in deciphering human language, have emerged as powerful tools for unraveling the intricate code of life inscribed within protein sequences. They have gained significant attention for their promising applications across various areas, including the sequence-based prediction of secondary and tertiary protein structure, the discovery of new functional protein sequences/folds, and the assessment of mutational impact on protein fitness. However, their utility in learning to predict protein residue properties based on scant datasets, such as protein-protein interaction (PPI)-hotspots whose mutations significantly impair PPIs, remained unclear. Here, we explore the feasibility of using protein language-learned representations as features for machine learning to predict PPI-hotspots using a dataset containing 414 experimentally confirmed PPI-hotspots and 504 PPI-nonhot spots. RESULTS: Our findings showcase the capacity of unsupervised learning with protein language models in capturing critical functional attributes of protein residues derived from the evolutionary information encoded within amino acid sequences. We show that methods relying on protein language models can compete with methods employing sequence and structure-based features to predict PPI-hotspots from the free protein structure. We observed an optimal number of features for model precision, suggesting a balance between information and overfitting. CONCLUSIONS: This study underscores the potential of transformer-based protein language models to extract critical knowledge from sparse datasets, exemplified here by the challenging realm of predicting PPI-hotspots. These models offer a cost-effective and time-efficient alternative to traditional experimental methods for predicting certain residue properties. However, the challenge of explaining why specific features are important for determining certain residue properties remains.


Assuntos
Aprendizado de Máquina , Proteínas , Humanos , Proteínas/química , Sequência de Aminoácidos
18.
J Proteome Res ; 23(8): 3269-3279, 2024 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-38334954

RESUMO

Protein-protein interactions (PPIs) are fundamental to understanding biological systems as protein complexes are the active molecular modules critical for carrying out cellular functions. Dysfunctional PPIs have been associated with various diseases including cancer. Systems-wide PPI analysis not only sheds light on pathological mechanisms, but also represents a paradigm in identifying potential therapeutic targets. In recent years, cross-linking mass spectrometry (XL-MS) has emerged as a powerful tool for defining endogenous PPIs of cellular networks. While proteome-wide studies have been performed in cell lysates, intact cells and tissues, applications of XL-MS in clinical samples have not been reported. In this study, we adopted a DSBSO-based in vivo XL-MS platform to map interaction landscapes from two breast cancer patient-derived xenograft (PDX) models. As a result, we have generated a PDX interaction network comprising 2,557 human proteins and identified interactions unique to breast cancer subtypes. Interestingly, most of the observed differences in PPIs correlated well with protein abundance changes determined by TMT-based proteome quantitation. Collectively, this work has demonstrated the feasibility of XL-MS analysis in clinical samples, and established an analytical workflow for tissue cross-linking that can be generalized for mapping PPIs from patient samples in the future to dissect disease-relevant cellular networks.


Assuntos
Neoplasias da Mama , Mapas de Interação de Proteínas , Humanos , Neoplasias da Mama/metabolismo , Neoplasias da Mama/patologia , Feminino , Animais , Espectrometria de Massas/métodos , Camundongos , Proteoma/metabolismo , Proteoma/análise , Proteômica/métodos , Mapeamento de Interação de Proteínas/métodos
19.
J Proteome Res ; 23(1): 465-482, 2024 01 05.
Artigo em Inglês | MEDLINE | ID: mdl-38147655

RESUMO

Temozolomide (TMZ) is the first line of chemotherapy to treat primary brain tumors of the type glioblastoma multiforme (GBM). TMZ resistance (TMZR) is one of the main barriers to successful treatment and is a principal factor in relapse, resulting in a poor median survival of 15 months. The present paper focuses on proteomic analyses of cytosolic fractions from TMZ-resistant (TMZR) LN-18 cells. The experimental workflow includes an easy, cost-effective, and reproducible method to isolate subcellular fraction of cytosolic (CYTO) proteins, mitochondria, and plasma membrane proteins for proteomic studies. For this study, enriched cytoplasmic fractions were analyzed in replicates by nanoflow liquid chromatography tandem high-resolution mass spectrometry (nLC-MS/MS), and proteins identified were quantified using a label-free approach (LFQ). Statistical analysis of control (CTRL) and temozolomide-resistant (TMZR) proteomes revealed proteins that appear to be differentially controlled in the cytoplasm. The functions of these proteins are discussed as well as their roles in other cancers and TMZ resistance in GBM. Key proteins are also described through biological processes related to gene ontology (GO), molecular functions, and cellular components. For protein-protein interactions (PPI), network and pathway involvement analyses have been performed, highlighting the roles of key proteins in the TMZ resistance phenotypes. This study provides a detailed insight into methods of subcellular fractionation for proteomic analysis of TMZ-resistant GBM cells and the potential to apply this approach to future large-scale studies. Several key proteins, protein-protein interactions (PPI), and pathways have been identified, underlying the TMZ resistance phenotype and highlighting the proteins' biological functions.


Assuntos
Neoplasias Encefálicas , Glioblastoma , Humanos , Temozolomida/farmacologia , Temozolomida/uso terapêutico , Glioblastoma/patologia , Proteômica , Espectrometria de Massas em Tandem , Antineoplásicos Alquilantes/farmacologia , Antineoplásicos Alquilantes/uso terapêutico , Linhagem Celular Tumoral , Recidiva Local de Neoplasia , Citoplasma/metabolismo , Resistencia a Medicamentos Antineoplásicos , Neoplasias Encefálicas/genética
20.
J Struct Biol ; 216(4): 108118, 2024 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-39214321

RESUMO

PPIs, or protein-protein interactions, are essential for many biological processes. According to the findings, abnormal PPIs have been linked to several diseases, such as cancer and infectious and neurological disorders. Consequently, focusing on PPIs is a path toward disease treatment and a crucial tool for producing novel medications. Many methods exist to investigate PPIs, including low- and high-throughput studies. Since many PPIs have been discovered using in vitro and in vivo experimental approaches, the use of computational methods to predict PPIs has grown due to the expanding scale of PPI data and the intrinsic complexity of interacting mechanisms. Recognizing PPI networks offers a systematic means of predicting protein functions, and pathways that are included. These investigations can help uncover the underlying molecular mechanisms of complex phenotypes and clarify the biological processes related to health and diseases. Therefore, our goal in this study is to provide an overview of the latest and most popular approaches for investigating PPIs. We also overview some important clinical approaches based on the PPIs and how these interactions can be targeted.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA