RESUMEN
With recent dramatic advances in various techniques used for protein structure research, we asked researchers to comment on the next exciting questions for the field and about how these techniques will advance our knowledge not only about proteins but also about human health and diseases.
RESUMEN
AlphaFold2 revolutionized structural biology with the ability to predict protein structures with exceptionally high accuracy. Its implementation, however, lacks the code and data required to train new models. These are necessary to (1) tackle new tasks, like protein-ligand complex structure prediction, (2) investigate the process by which the model learns and (3) assess the model's capacity to generalize to unseen regions of fold space. Here we report OpenFold, a fast, memory efficient and trainable implementation of AlphaFold2. We train OpenFold from scratch, matching the accuracy of AlphaFold2. Having established parity, we find that OpenFold is remarkably robust at generalizing even when the size and diversity of its training set is deliberately limited, including near-complete elisions of classes of secondary structure elements. By analyzing intermediate structures produced during training, we also gain insights into the hierarchical manner in which OpenFold learns to fold. In sum, our studies demonstrate the power and utility of OpenFold, which we believe will prove to be a crucial resource for the protein modeling community.
Asunto(s)
Modelos Moleculares , Pliegue de Proteína , Proteínas , Proteínas/química , Biología Computacional/métodos , Programas Informáticos , Conformación Proteica , Algoritmos , Estructura Secundaria de ProteínaRESUMEN
Deep learning using neural networks relies on a class of machine-learnable models constructed using 'differentiable programs'. These programs can combine mathematical equations specific to a particular domain of natural science with general-purpose, machine-learnable components trained on experimental data. Such programs are having a growing impact on molecular and cellular biology. In this Perspective, we describe an emerging 'differentiable biology' in which phenomena ranging from the small and specific (for example, one experimental assay) to the broad and complex (for example, protein folding) can be modeled effectively and efficiently, often by exploiting knowledge about basic natural phenomena to overcome the limitations of sparse, incomplete and noisy data. By distilling differentiable biology into a small set of conceptual primitives and illustrative vignettes, we show how it can help to address long-standing challenges in integrating multimodal data from diverse experiments across biological scales. This promises to benefit fields as diverse as biophysics and functional genomics.
Asunto(s)
Biofisica/métodos , Biología Computacional/instrumentación , Biología Computacional/métodos , Aprendizaje Profundo , Redes Neurales de la Computación , Química Computacional , Modelos Químicos , Reconocimiento de Normas Patrones Automatizadas , Conformación Proteica , Proteínas/químicaRESUMEN
BACKGROUND: Biotechnology provides a cost-effective way to produce nanomaterials such as silver oxide nanoparticles (Ag2ONPs), which have emerged as versatile entities with diverse applications. This study investigated the ability of endophytic bacteria to biosynthesize Ag2ONPs. RESULTS: A novel endophytic bacterial strain, Neobacillus niacini AUMC-B524, was isolated from Lycium shawii Roem. & Schult leaves and used to synthesize Ag2ONPS extracellularly. Plackett-Burman design and response surface approach was carried out to optimize the biosynthesis of Ag2ONPs (Bio-Ag2ONPs). Comprehensive characterization techniques, including UV-vis spectral analysis, Fourier transform infrared spectroscopy, transmission electron microscopy, X-ray diffraction, dynamic light scattering analysis, Raman microscopy, and energy dispersive X-ray analysis, confirmed the precise composition of the Ag2ONPS. Bio-Ag2ONPs were effective against multidrug-resistant wound pathogens, with minimum inhibitory concentrations (1-25 µg mL-1). Notably, Bio-Ag2ONPs demonstrated no cytotoxic effects on human skin fibroblasts (HSF) in vitro, while effectively suppressing the proliferation of human epidermoid skin carcinoma (A-431) cells, inducing apoptosis and modulating the key apoptotic genes including Bcl-2 associated X protein (Bax), B-cell lymphoma 2 (Bcl-2), Caspase-3 (Cas-3), and guardian of the genome (P53). CONCLUSIONS: These findings highlight the therapeutic potential of Bio-Ag2ONPs synthesized by endophytic N. niacini AUMC-B524, underscoring their antibacterial efficacy, anticancer activity, and biocompatibility, paving the way for novel therapeutic strategies.
Asunto(s)
Antibacterianos , Nanopartículas del Metal , Compuestos de Plata , Humanos , Nanopartículas del Metal/química , Compuestos de Plata/farmacología , Compuestos de Plata/química , Antibacterianos/farmacología , Antibacterianos/biosíntesis , Pruebas de Sensibilidad Microbiana , Bacillaceae/metabolismo , Óxidos/farmacología , Óxidos/química , Fibroblastos/efectos de los fármacos , Apoptosis/efectos de los fármacosRESUMEN
In mammalian cells, much of signal transduction is mediated by weak protein-protein interactions between globular peptide-binding domains (PBDs) and unstructured peptidic motifs in partner proteins. The number and diversity of these PBDs (over 1,800 are known), their low binding affinities and the sensitivity of binding properties to minor sequence variation represent a substantial challenge to experimental and computational analysis of PBD specificity and the networks PBDs create. Here, we introduce a bespoke machine-learning approach, hierarchical statistical mechanical modeling (HSM), capable of accurately predicting the affinities of PBD-peptide interactions across multiple protein families. By synthesizing biophysical priors within a modern machine-learning framework, HSM outperforms existing computational methods and high-throughput experimental assays. HSM models are interpretable in familiar biophysical terms at three spatial scales: the energetics of protein-peptide binding, the multidentate organization of protein-protein interactions and the global architecture of signaling networks.
Asunto(s)
Aprendizaje Automático , Péptidos/metabolismo , Proteínas/metabolismo , Transducción de Señal , Fenómenos Biofísicos , Humanos , Unión Proteica , Reproducibilidad de los Resultados , Dominios Homologos srcRESUMEN
Kinases have been the focus of drug discovery programs for three decades leading to over 70 therapeutic kinase inhibitors and biophysical affinity measurements for over 130,000 kinase-compound pairs. Nonetheless, the precise target spectrum for many kinases remains only partly understood. In this study, we describe a computational approach to unlocking qualitative and quantitative kinome-wide binding measurements for structure-based machine learning. Our study has three components: (i) a Kinase Inhibitor Complex (KinCo) data set comprising in silico predicted kinase structures paired with experimental binding constants, (ii) a machine learning loss function that integrates qualitative and quantitative data for model training, and (iii) a structure-based machine learning model trained on KinCo. We show that our approach outperforms methods trained on crystal structures alone in predicting binary and quantitative kinase-compound interaction affinities; relative to structure-free methods, our approach also captures known kinase biochemistry and more successfully generalizes to distant kinase sequences and compound scaffolds.
Asunto(s)
Descubrimiento de Drogas , Aprendizaje Automático , Inhibidores de Proteínas Quinasas/farmacologíaRESUMEN
Rational protein engineering requires a holistic understanding of protein function. Here, we apply deep learning to unlabeled amino-acid sequences to distill the fundamental features of a protein into a statistical representation that is semantically rich and structurally, evolutionarily and biophysically grounded. We show that the simplest models built on top of this unified representation (UniRep) are broadly applicable and generalize to unseen regions of sequence space. Our data-driven approach predicts the stability of natural and de novo designed proteins, and the quantitative function of molecularly diverse mutants, competitively with the state-of-the-art methods. UniRep further enables two orders of magnitude efficiency improvement in a protein engineering task. UniRep is a versatile summary of fundamental protein features that can be applied across protein engineering informatics.
Asunto(s)
Aprendizaje Profundo , Ingeniería de Proteínas/métodos , Secuencia de Aminoácidos , Mutación , Estabilidad ProteicaRESUMEN
BACKGROUND: Acute kidney injury (AKI) is associated with a severe decline in kidney function caused by abnormalities within the podocytes' glomerular matrix. Recently, AKI has been linked to alterations in glycolysis and the activity of glycolytic enzymes, including pyruvate kinase M2 (PKM2). However, the contribution of this enzyme to AKI remains largely unexplored. METHODS: Cre-loxP technology was used to examine the effects of PKM2 specific deletion in podocytes on the activation status of key signaling pathways involved in the pathophysiology of AKI by lipopolysaccharides (LPS). In addition, we used lentiviral shRNA to generate murine podocytes deficient in PKM2 and investigated the molecular mechanisms mediating PKM2 actions in vitro. RESULTS: Specific PKM2 deletion in podocytes ameliorated LPS-induced protein excretion and alleviated LPS-induced alterations in blood urea nitrogen and serum albumin levels. In addition, PKM2 deletion in podocytes alleviated LPS-induced structural and morphological alterations to the tubules and to the brush borders. At the molecular level, PKM2 deficiency in podocytes suppressed LPS-induced inflammation and apoptosis. In vitro, PKM2 knockdown in murine podocytes diminished LPS-induced apoptosis. These effects were concomitant with a reduction in LPS-induced activation of ß-catenin and the loss of Wilms' Tumor 1 (WT1) and nephrin. Notably, the overexpression of a constitutively active mutant of ß-catenin abolished the protective effect of PKM2 knockdown. Conversely, PKM2 knockdown cells reconstituted with the phosphotyrosine binding-deficient PKM2 mutant (K433E) recapitulated the effect of PKM2 depletion on LPS-induced apoptosis, ß-catenin activation, and reduction in WT1 expression. CONCLUSIONS: Taken together, our data demonstrates that PKM2 plays a key role in podocyte injury and suggests that targetting PKM2 in podocytes could serve as a promising therapeutic strategy for AKI. TRIAL REGISTRATION: Not applicable. Video abstract.
Asunto(s)
Lesión Renal Aguda , Leucemia Mieloide Aguda , Podocitos , Lesión Renal Aguda/metabolismo , Animales , Leucemia Mieloide Aguda/metabolismo , Lipopolisacáridos/farmacología , Ratones , Piruvato Quinasa/genética , Piruvato Quinasa/metabolismo , Piruvato Quinasa/farmacología , beta Catenina/metabolismoRESUMEN
Pyruvate kinase is a key regulator in glycolysis through the conversion of phosphoenolpyruvate (PEP) into pyruvate. Pyruvate kinase exists in various isoforms that can exhibit diverse biological functions and outcomes. The pyruvate kinase isoenzyme type M2 (PKM2) controls cell progression and survival through the regulation of key signaling pathways. In cancer cells, the dimer form of PKM2 predominates and plays an integral role in cancer metabolism. This predominance of the inactive dimeric form promotes the accumulation of phosphometabolites, allowing cancer cells to engage in high levels of synthetic processing to enhance their proliferative capacity. PKM2 has been recognized for its role in regulating gene expression and transcription factors critical for health and disease. This role enables PKM2 to exert profound regulatory effects that promote cancer cell metabolism, proliferation, and migration. In addition to its role in cancer, PKM2 regulates aspects essential to cellular homeostasis in non-cancer tissues and, in some cases, promotes tissue-specific pathways in health and diseases. In pursuit of understanding the diverse tissue-specific roles of PKM2, investigations targeting tissues such as the kidney, liver, adipose, and pancreas have been conducted. Findings from these studies enhance our understanding of PKM2 functions in various diseases beyond cancer. Therefore, there is substantial interest in PKM2 modulation as a potential therapeutic target for the treatment of multiple conditions. Indeed, a vast plethora of research has focused on identifying therapeutic strategies for targeting PKM2. Recently, targeting PKM2 through its regulatory microRNAs, long non-coding RNAs (lncRNAs), and circular RNAs (circRNAs) has gathered increasing interest. Thus, the goal of this review is to highlight recent advancements in PKM2 research, with a focus on PKM2 regulatory microRNAs and lncRNAs and their subsequent physiological significance.
Asunto(s)
Proteínas Portadoras/genética , Proteínas Portadoras/metabolismo , Reprogramación Celular , Metabolismo Energético , Regulación de la Expresión Génica , Proteínas de la Membrana/genética , Proteínas de la Membrana/metabolismo , Hormonas Tiroideas/genética , Hormonas Tiroideas/metabolismo , Animales , Proteínas Portadoras/antagonistas & inhibidores , Transformación Celular Neoplásica/genética , Transformación Celular Neoplásica/metabolismo , Reprogramación Celular/genética , Susceptibilidad a Enfermedades , Desarrollo de Medicamentos , Evaluación Preclínica de Medicamentos , Metabolismo Energético/genética , Inhibidores Enzimáticos/farmacología , Inhibidores Enzimáticos/uso terapéutico , Homeostasis , Humanos , Proteínas de la Membrana/antagonistas & inhibidores , Mutación , Transporte de Proteínas , Piruvato Quinasa/genética , Piruvato Quinasa/metabolismo , Interferencia de ARN , ARN Largo no Codificante/genética , Investigación , Proteínas de Unión a Hormona TiroideRESUMEN
SUMMARY: Computational prediction of protein structure from sequence is broadly viewed as a foundational problem of biochemistry and one of the most difficult challenges in bioinformatics. Once every two years the Critical Assessment of protein Structure Prediction (CASP) experiments are held to assess the state of the art in the field in a blind fashion, by presenting predictor groups with protein sequences whose structures have been solved but have not yet been made publicly available. The first CASP was organized in 1994, and the latest, CASP13, took place last December, when for the first time the industrial laboratory DeepMind entered the competition. DeepMind's entry, AlphaFold, placed first in the Free Modeling (FM) category, which assesses methods on their ability to predict novel protein folds (the Zhang group placed first in the Template-Based Modeling (TBM) category, which assess methods on predicting proteins whose folds are related to ones already in the Protein Data Bank.) DeepMind's success generated significant public interest. Their approach builds on two ideas developed in the academic community during the preceding decade: (i) the use of co-evolutionary analysis to map residue co-variation in protein sequence to physical contact in protein structure, and (ii) the application of deep neural networks to robustly identify patterns in protein sequence and co-evolutionary couplings and convert them into contact maps. In this Letter, we contextualize the significance of DeepMind's entry within the broader history of CASP, relate AlphaFold's methodological advances to prior work, and speculate on the future of this important problem.
Asunto(s)
Biología Computacional , Programas Informáticos , Bases de Datos de Proteínas , Modelos Moleculares , Conformación Proteica , ProteínasRESUMEN
BACKGROUND: Current pharmacological therapies and treatments targeting pancreatic neuroendocrine tumors (PNETs) have proven ineffective, far too often. Therefore, there is an urgent need for alternative therapeutic approaches. Zyflamend, a combination of anti-inflammatory herbal extracts, that has proven to be effective in various in vitro and in vivo cancer platforms, shows promise. However, its effects on pancreatic cancer, in particular, remain largely unexplored. METHODS: In the current study, we investigated the effects of Zyflamend on the survival of beta-TC-6 pancreatic insulinoma cells (ß-TC6) and conducted a detailed analysis of the underlying molecular mechanisms. RESULTS: Herein, we demonstrate that Zyflamend treatment decreased cell proliferation in a dose-dependent manner, concomitant with increased apoptotic cell death and cell cycle arrest at the G2/M phase. At the molecular level, treatment with Zyflamend led to the induction of ER stress, autophagy, and the activation of c-Jun N-terminal kinase (JNK) pathway. Notably, pharmacological inhibition of JNK abrogated the pro-apoptotic effects of Zyflamend. Furthermore, Zyflamend exacerbated the effects of streptozotocin and adriamycin-induced ER stress, autophagy, and apoptosis. CONCLUSION: The current study identifies Zyflamend as a potential novel adjuvant in the treatment of pancreatic cancer via modulation of the JNK pathway. Video abstract.
Asunto(s)
Apoptosis , Sistema de Señalización de MAP Quinasas , Neoplasias Pancreáticas/enzimología , Neoplasias Pancreáticas/patología , Extractos Vegetales/farmacología , Animales , Apoptosis/efectos de los fármacos , Autofagia/efectos de los fármacos , Puntos de Control del Ciclo Celular/efectos de los fármacos , Línea Celular Tumoral , Proliferación Celular/efectos de los fármacos , Doxorrubicina/farmacología , Estrés del Retículo Endoplásmico/efectos de los fármacos , Inflamación/patología , Sistema de Señalización de MAP Quinasas/efectos de los fármacos , Ratones , Modelos Biológicos , Ratas , Estreptozocina/farmacologíaRESUMEN
BACKGROUND: Rapid progress in deep learning has spurred its application to bioinformatics problems including protein structure prediction and design. In classic machine learning problems like computer vision, progress has been driven by standardized data sets that facilitate fair assessment of new methods and lower the barrier to entry for non-domain experts. While data sets of protein sequence and structure exist, they lack certain components critical for machine learning, including high-quality multiple sequence alignments and insulated training/validation splits that account for deep but only weakly detectable homology across protein space. RESULTS: We created the ProteinNet series of data sets to provide a standardized mechanism for training and assessing data-driven models of protein sequence-structure relationships. ProteinNet integrates sequence, structure, and evolutionary information in programmatically accessible file formats tailored for machine learning frameworks. Multiple sequence alignments of all structurally characterized proteins were created using substantial high-performance computing resources. Standardized data splits were also generated to emulate the difficulty of past CASP (Critical Assessment of protein Structure Prediction) experiments by resetting protein sequence and structure space to the historical states that preceded six prior CASPs. Utilizing sensitive evolution-based distance metrics to segregate distantly related proteins, we have additionally created validation sets distinct from the official CASP sets that faithfully mimic their difficulty. CONCLUSION: ProteinNet represents a comprehensive and accessible resource for training and assessing machine-learned models of protein structure.
Asunto(s)
Aprendizaje Automático , Proteínas/química , Secuencia de Aminoácidos , Bases de Datos de Proteínas , Estándares de Referencia , Alineación de SecuenciaRESUMEN
The conversion of polymer parameterization from internal coordinates (bond lengths, angles, and torsions) to Cartesian coordinates is a fundamental task in molecular modeling, often performed using the natural extension reference frame (NeRF) algorithm. NeRF can be parallelized to process multiple polymers simultaneously, but is not parallelizable along the length of a single polymer. A mathematically equivalent algorithm, pNeRF, has been derived that is parallelizable along a polymer's length. Empirical analysis demonstrates an order-of-magnitude speed up using modern GPUs and CPUs. In machine learning-based workflows, in which partial derivatives are backpropagated through NeRF equations and neural network primitives, switching to pNeRF can reduce the fractional computational cost of coordinate conversion from over two-thirds to around 10%. An optimized TensorFlow-based implementation of pNeRF is available on GitHub at https://github.com/aqlaboratory/pnerf © 2018 Wiley Periodicals, Inc.
Asunto(s)
Algoritmos , Polímeros/química , Aprendizaje Automático , Modelos MolecularesRESUMEN
BACKGROUND: Molecular interactions between proteins and DNA molecules underlie many cellular processes, including transcriptional regulation, chromosome replication, and nucleosome positioning. Computational analyses of protein-DNA interactions rely on experimental data characterizing known protein-DNA interactions structurally and biochemically. While many databases exist that contain either structural or biochemical data, few integrate these two data sources in a unified fashion. Such integration is becoming increasingly critical with the rapid growth of structural and biochemical data, and the emergence of algorithms that rely on the synthesis of multiple data types to derive computational models of molecular interactions. DESCRIPTION: We have developed an integrated affinity-structure database in which the experimental and quantitative DNA binding affinities of helix-turn-helix proteins are mapped onto the crystal structures of the corresponding protein-DNA complexes. This database provides access to: (i) protein-DNA structures, (ii) quantitative summaries of protein-DNA binding affinities using position weight matrices, and (iii) raw experimental data of protein-DNA binding instances. Critically, this database establishes a correspondence between experimental structural data and quantitative binding affinity data at the single basepair level. Furthermore, we present a novel alignment algorithm that structurally aligns the protein-DNA complexes in the database and creates a unified residue-level coordinate system for comparing the physico-chemical environments at the interface between complexes. Using this unified coordinate system, we compute the statistics of atomic interactions at the protein-DNA interface of helix-turn-helix proteins. We provide an interactive website for visualization, querying, and analyzing this database, and a downloadable version to facilitate programmatic analysis. CONCLUSIONS: This database will facilitate the analysis of protein-DNA interactions and the development of programmatic computational methods that capitalize on integration of structural and biochemical datasets. The database can be accessed at http://ProteinDNA.hms.harvard.edu.
Asunto(s)
Proteínas de Unión al ADN/genética , ADN/genética , Algoritmos , Proteínas de Unión al ADN/metabolismo , Bases de Datos Factuales , Factores de TranscripciónRESUMEN
Compressed sensing has revolutionized signal acquisition, by enabling complex signals to be measured with remarkable fidelity using a small number of so-called incoherent sensors. We show that molecular interactions, e.g., protein-DNA interactions, can be analyzed in a directly analogous manner and with similarly remarkable results. Specifically, mesoscopic molecular interactions act as incoherent sensors that measure the energies of microscopic interactions between atoms. We combine concepts from compressed sensing and statistical mechanics to determine the interatomic interaction energies of a molecular system exclusively from experimental measurements, resulting in a "de novo" energy potential. In contrast, conventional methods for estimating energy potentials are based on theoretical models premised on a priori assumptions and extensive domain knowledge. We determine the de novo energy potential for pairwise interactions between protein and DNA atoms from (i) experimental measurements of the binding affinity of protein-DNA complexes and (ii) crystal structures of the complexes. We show that the de novo energy potential can be used to predict the binding specificity of proteins to DNA with approximately 90% accuracy, compared to approximately 60% for the best performing alternative computational methods applied to this fundamental problem. This de novo potential method is directly extendable to other biomolecule interaction domains (enzymes and signaling molecule interactions) and to other classes of molecular interactions.
Asunto(s)
Simulación por Computador , Proteínas de Unión al ADN/química , ADN/química , Modelos Químicos , Cristalografía por Rayos X , Estructura Terciaria de Proteína , TermodinámicaRESUMEN
Sepsis is a potentially fatal disease that arises from an infection and is characterized by an uncontrolled immune system reaction. Global healthcare systems bear a heavy financial burden from treating sepsis. This study aimed to provide information on the effective properties of silver nanoparticles derived from pomegranate peel extract (P-AgNP) against sepsis-induced hepatic injury. P-AgNPs were spherical with a diameter of ~19 nm. The animals were placed into four groups, each with seven rats. Group 1 functioned as the control group, receiving only saline for 7 days. Group 2 received only P-AgNPs at a dose of 20 mg/kg. To induce sepsis, groups 3 and 4 were given an intraperitoneal injection of 200 mg/mL cecal slurry. Sixty min later, group 4 was given 20 mg/kg of P-AgNPs daily for 7 days. The concentrations of reduced glutathione, nitric oxide, lipid peroxidation, and superoxide dismutase in liver homogenate were measured to determine the oxidative status. In addition, enzyme activities (alanine aminotransferase, aspartate amino transferase, and alkaline phosphatase) were measured. Furthermore, we investigated the histological changes, immunohistochemical expression of nuclear factor-κB, and mRNA levels of IL1ß, IL-6, TNF-α, Bax, BCl2, and Casp-3. P-AgNPs functioned as regulators in a sepsis model, successfully controlling altered gene expression. Following treatment, P-AgNPs improved tion and oxidative state, indicating a role in sepsis management. Based on our findings, we conclude that P-AgNPs have antioxidant activity and may be useful in preventing sepsis-induced liver inflammation, oxidative damage, and apoptosis. RESEARCH HIGHLIGHTS: Pomegranate peel-derived silver nanoparticles (P-AgNPs) enhanced liver function and oxidative state in rats with sepsis-induced hepatic damage. P-AgNPs reduced oxidative stress and liver inflammation via regulating inflammatory and apoptotic gene expression. P-AgNPs enhanced liver enzyme activities, histological structure, and immunohistochemistry expression of nuclear factor-κB.
RESUMEN
Single-nucleotide variants (SNVs) in key T cell genes can drive clinical pathologies and could be repurposed to improve cellular cancer immunotherapies. Here, we perform massively parallel base-editing screens to generate thousands of variants at gene loci annotated with known or potential clinical relevance. We discover a broad landscape of putative gain-of-function (GOF) and loss-of-function (LOF) mutations, including in PIK3CD and the gene encoding its regulatory subunit, PIK3R1, LCK, SOS1, AKT1 and RHOA. Base editing of PIK3CD and PIK3R1 variants in T cells with an engineered T cell receptor specific to a melanoma epitope or in different generations of CD19 chimeric antigen receptor (CAR) T cells demonstrates that discovered GOF variants, but not LOF or silent mutation controls, enhanced signaling, cytokine production and lysis of cognate melanoma and leukemia cell models, respectively. Additionally, we show that generations of CD19 CAR T cells engineered with PIK3CD GOF mutations demonstrate enhanced antigen-specific signaling, cytokine production and leukemia cell killing, including when benchmarked against other recent strategies.
RESUMEN
Compulsive exercise is a condition characterized by uncontrollable exercise behaviour that may lead to severe and harmful physical and psychological consequences. Indeed, compulsive exercise is among the early symptoms of eating disorders that may affect different age groups. Globally and among Arab countries, compulsive exercise is common, while the screening methods used to assess compulsive exercise are limited. Thus, the Compulsive Exercise Test (CET) has emerged as a tool to assess cognitive, behavioural, and emotional factors related to compulsive exercise. The CET is a self-report, Likert-type scale comprising five distinct subscales. The increase in the CET scores is more likely associated with worsened pathology. Since the Arab countries lack such an assessment tool, we aimed to translate the CET into Arabic, validate the translated version, confirm the factor structures, and assess the internal consistency of the different subscales. Herein, we used the forward-backward translation method as recommended by the World Health Organization (WHO). The overall validity index of the translated version showed a score higher than 0.78, while the scale-level content validity index based on the average calculating method (S-CVI/Ave) and the agreement method (S-CVI/UA) were 0.91 and 0.58, respectively. Moreover, we recruited 399 Arabs living in Saudi to measure the internal consistency, and the value of the substantive internal consistency with Cronbach's α was 0.81. Subsequently, four of the Arabic-CET subscales had substantive internal consistency with Cronbach's α values higher than or equal to 0.70. Furthermore, the exploratory factor analysis results supported the substantial use of the five-subscale model. Taken together, our study supports using the Arabic-CET version to measure exercise compulsiveness among Arabs.