RESUMEN
Cystic Fibrosis Transmembrane Conductance Regulator (CFTR) is a chloride and bicarbonate channel in secretory epithelia with a critical role in maintaining fluid homeostasis. Mutations in CFTR are associated with Cystic Fibrosis (CF), the most common lethal autosomal recessive disorder in Caucasians. While remarkable treatment advances have been made recently in the form of modulator drugs directly rescuing CFTR dysfunction, there is still considerable scope for improvement of therapeutic effectiveness. Here, we report the application of a high-throughput screening variant of the Mammalian Membrane Two-Hybrid (MaMTH-HTS) to map the protein-protein interactions of wild-type (wt) and mutant CFTR (F508del), in an effort to better understand CF cellular effects and identify new drug targets for patient-specific treatments. Combined with functional validation in multiple disease models, we have uncovered candidate proteins with potential roles in CFTR function/CF pathophysiology, including Fibrinogen Like 2 (FGL2), which we demonstrate in patient-derived intestinal organoids has a significant effect on CFTR functional expression.
Asunto(s)
Regulador de Conductancia de Transmembrana de Fibrosis Quística , Fibrosis Quística , Animales , Membrana Celular/metabolismo , Fibrosis Quística/tratamiento farmacológico , Fibrosis Quística/genética , Fibrosis Quística/metabolismo , Regulador de Conductancia de Transmembrana de Fibrosis Quística/genética , Regulador de Conductancia de Transmembrana de Fibrosis Quística/metabolismo , Fibrinógeno/genética , Fibrinógeno/metabolismo , Fibrinógeno/farmacología , Ensayos Analíticos de Alto Rendimiento , Humanos , Mamíferos , MutaciónRESUMEN
Artificial intelligence (AI) is currently regaining enormous interest due to the success of machine learning (ML), and in particular deep learning (DL). Image analysis, and thus radiomics, strongly benefits from this research. However, effectively and efficiently integrating diverse clinical, imaging, and molecular profile data is necessary to understand complex diseases, and to achieve accurate diagnosis in order to provide the best possible treatment. In addition to the need for sufficient computing resources, suitable algorithms, models, and data infrastructure, three important aspects are often neglected: (1) the need for multiple independent, sufficiently large and, above all, high-quality data sets; (2) the need for domain knowledge and ontologies; and (3) the requirement for multiple networks that provide relevant relationships among biological entities. While one will always get results out of high-dimensional data, all three aspects are essential to provide robust training and validation of ML models, to provide explainable hypotheses and results, and to achieve the necessary trust in AI and confidence for clinical applications.
Asunto(s)
Inteligencia Artificial , Biología Computacional , Imagen Molecular , Biomarcadores/metabolismo , Humanos , Procesamiento de Imagen Asistido por ComputadorRESUMEN
G-protein-coupled receptors (GPCRs) are the largest family of integral membrane receptors with key roles in regulating signaling pathways targeted by therapeutics, but are difficult to study using existing proteomics technologies due to their complex biochemical features. To obtain a global view of GPCR-mediated signaling and to identify novel components of their pathways, we used a modified membrane yeast two-hybrid (MYTH) approach and identified interacting partners for 48 selected full-length human ligand-unoccupied GPCRs in their native membrane environment. The resulting GPCR interactome connects 686 proteins by 987 unique interactions, including 299 membrane proteins involved in a diverse range of cellular functions. To demonstrate the biological relevance of the GPCR interactome, we validated novel interactions of the GPR37, serotonin 5-HT4d, and adenosine ADORA2A receptors. Our data represent the first large-scale interactome mapping for human GPCRs and provide a valuable resource for the analysis of signaling pathways involving this druggable family of integral membrane proteins.
Asunto(s)
Mapeo de Interacción de Proteínas/métodos , Mapas de Interacción de Proteínas , Receptores Acoplados a Proteínas G/metabolismo , Membrana Celular/metabolismo , Humanos , Receptor de Adenosina A2A/metabolismo , Receptores de Serotonina 5-HT4/metabolismo , Transducción de Señal , Técnicas del Sistema de Dos HíbridosRESUMEN
Technological advances and high-throughput bio-chemical assays are rapidly changing ways how we formulate and test biological hypotheses, and how we treat patients. Most complex diseases arise on a background of genetics, lifestyle and environment factors, and manifest themselves as a spectrum of symptoms. To fathom intricate biological processes and their changes from healthy to disease states, we need to systematically integrate and analyze multi-omics datasets, ontologies, and diverse annotations. Without proper management of such complex biological and clinical data, artificial intelligence (AI) algorithms alone cannot be effectively trained, validated, and successfully applied to provide trustworthy and patient-centric diagnosis, prognosis and treatment. Precision medicine requires to use multi-omics approaches effectively, and offers many opportunities for using AI, "big data" analytics, and integrative computational biology workflows. Advances in optical and biochemical assay technologies including sequencing, mass spectrometry and imaging modalities have transformed research by empowering us to simultaneously view all genes expressed, identify proteome-wide changes, and assess interacting partners of each individual protein within a dynamically changing biological system, at an individual cell level. While such views are already having an impact on our understanding of healthy and disease conditions, it remains challenging to extract useful information comprehensively and systematically from individual studies, ensure that signal is separated from noise, develop models, and provide hypotheses for further research. Data remain incomplete and are often poorly connected using fragmented biological networks. In addition, statistical and machine learning models are developed at a cohort level and often not validated at the individual patient level. Combining integrative computational biology and AI has the potential to improve understanding and treatment of diseases by identifying biomarkers and building explainable models characterizing individual patients. From systematic data analysis to more specific diagnostic, prognostic and predictive biomarkers, drug mechanism of action, and patient selection, such analyses influence multiple steps from prevention to disease characterization, and from prognosis to drug discovery. Data mining, machine learning, graph theory and advanced visualization may help identify diagnostic, prognostic and predictive biomarkers, and create causal models of disease. Intertwining computational prediction and modeling with biological experiments leads to faster, more biologically and clinically relevant discoveries. However, computational analysis results and models are going to be only as accurate and useful as correct and comprehensive are the networks, ontologies and datasets used to build them. High quality, curated data portals provide the necessary foundation for translational research. They help to identify better biomarkers, new drugs, precision treatments, and should lead to improved patient outcomes and their quality of life. Intertwining computational prediction and modeling with biological experiments, efficiently and effectively leads to more useful findings faster.
RESUMEN
Introduction: Kidney transplantation is the optimal treatment for end-stage kidney disease; however, premature allograft loss remains a serious issue. While many high-throughput omics studies have analyzed patient allograft biospecimens, integration of these datasets is challenging, which represents a considerable barrier to advancing our understanding of the mechanisms of allograft loss. Methods: To facilitate integration, we have created a curated database containing all open-access high-throughput datasets from human kidney transplant studies, termed NephroDIP (Nephrology Data Integration Portal). PubMed was searched for high-throughput transcriptomic, proteomic, single nucleotide variant, metabolomic, and epigenomic studies in kidney transplantation, which yielded 9,964 studies. Results: From these, 134 studies with available data detailing 260 comparisons and 83,262 molecules were included in NephroDIP v1.0. To illustrate the capabilities of NephroDIP, we have used the database to identify common gene, protein, and microRNA networks that are disrupted in patients with chronic antibody-mediated rejection, the most important cause of late allograft loss. We have also explored the role of an immunomodulatory protein galectin-1 (LGALS1), along with its interactors and transcriptional regulators, in kidney allograft injury. We highlight the pathways enriched among LGALS1 interactors and transcriptional regulators in kidney fibrosis and during immunosuppression. Discussion: NephroDIP is an open access data portal that facilitates data visualization and will help provide new insights into existing kidney transplant data through integration of distinct studies and modules (https://ophid.utoronto.ca/NephroDIP).
Asunto(s)
Rechazo de Injerto , Trasplante de Riñón , Humanos , Trasplante de Riñón/efectos adversos , Rechazo de Injerto/inmunología , Rechazo de Injerto/genética , Aloinjertos/inmunología , Bases de Datos Factuales , Riñón/metabolismo , Riñón/patología , Riñón/inmunología , Proteómica/métodosRESUMEN
Objective: OsteoDIP aims to collect and provide, in a simple searchable format, curated high throughput RNA expression data related to osteoarthritis. Design: Datasets are collected annually by searching "osteoarthritis gene expression profile" in PubMed. Only publications containing patient data and a list of differentially expressed genes are considered. From 2020, the search has expanded to include non-coding RNAs. Moreover, a search in GEO for "osteoarthritis" datasets has been performed using 'Homo sapiens' and 'Expression profiling by array' filters. Annotations for genes linked to osteoarthritis have been downloaded from external databases. Results: Out of 1204 curated papers, 63 have been included in OsteoDIP, while GEO curation led to the collection of 28 datasets. Literature data provides a snapshot of osteoarthritis research derived from 1924 human samples, while GEO datasets provide expression for additional 1012 patients. Similar to osteoarthritis literature, OsteoDIP data has been created mostly from studies focused on knee, and the tissue most frequently investigated is cartilage. GEO data sets were fully integrated with associated clinical data. We showcase examples and use cases applicable for translational research in osteoarthritis. Conclusions: OsteoDIP is publicly available at http://ophid.utoronto.ca/OsteoDIP. The website is easy to navigate and all the data is available for download. Data consolidation allows researchers to perform comparisons across studies and to combine data from different datasets. Our examples show how OsteoDIP can integrate with and improve osteoarthritis researchers' pipelines.
RESUMEN
Discovery of non-coding RNAs continues to provide new insights into some of the key molecular drivers of musculoskeletal diseases. Among these, microRNAs have received widespread attention for their roles in osteoarthritis and rheumatoid arthritis. With evidence to suggest that long non-coding RNAs and circular RNAs function as competing endogenous RNAs to sponge microRNAs, the net effect on gene expression in specific disease contexts can be elusive. Studies to date have focused on elucidating individual long non-coding-microRNA-gene target axes and circular RNA-microRNA-gene target axes, with a paucity of data integrating experimentally validated effects of non-coding RNAs. To address this gap, we curated recent studies reporting non-coding RNA axes in chondrocytes from human osteoarthritis and in fibroblast-like synoviocytes from human rheumatoid arthritis. Using an integrative computational biology approach, we then combined the findings into cell- and disease-specific networks for in-depth interpretation. We highlight some challenges to data integration, including non-existent naming conventions and out-of-date databases for non-coding RNAs, and some successes exemplified by the International Molecular Exchange Consortium for protein interactions. In this perspective article, we suggest that data integration is a useful in silico approach for creating non-coding RNA networks in arthritis and prioritizing interactions for further in vitro and in vivo experimentation in translational research.