RESUMEN
Heritability in the immune tumor microenvironment (iTME) has been widely observed yet remains largely uncharacterized. Here, we developed a machine learning approach to map iTME modifiers within loci from genome-wide association studies (GWASs) for breast cancer (BrCa) incidence. A random forest model was trained on a positive set of immune-oncology (I-O) targets, and then used to assign I-O target probability scores to 1,362 candidate genes in linkage disequilibrium with 155 BrCa GWAS loci. Cluster analysis of the most probable candidates revealed two subfamilies of genes related to effector functions and adaptive immune responses, suggesting that iTME modifiers impact multiple aspects of anticancer immunity. Two of the top ranking BrCa candidates, LSP1 and TLR1, were orthogonally validated as iTME modifiers using BrCa patient biopsies and comparative mapping studies, respectively. Collectively, these data demonstrate a robust and flexible framework for functionally fine-mapping GWAS risk loci to identify translatable therapeutic targets.
RESUMEN
BACKGROUND: The remarkable growth of genome-wide association studies (GWAS) has created a critical need to experimentally validate the disease-associated variants, 90% of which involve non-coding variants. METHODS: To determine how the field is addressing this urgent need, we performed a comprehensive literature review identifying 36,676 articles. These were reduced to 1454 articles through a set of filters using natural language processing and ontology-based text-mining. This was followed by manual curation and cross-referencing against the GWAS catalog, yielding a final set of 286 articles. RESULTS: We identified 309 experimentally validated non-coding GWAS variants, regulating 252 genes across 130 human disease traits. These variants covered a variety of regulatory mechanisms. Interestingly, 70% (215/309) acted through cis-regulatory elements, with the remaining through promoters (22%, 70/309) or non-coding RNAs (8%, 24/309). Several validation approaches were utilized in these studies, including gene expression (n = 272), transcription factor binding (n = 175), reporter assays (n = 171), in vivo models (n = 104), genome editing (n = 96) and chromatin interaction (n = 33). CONCLUSIONS: This review of the literature is the first to systematically evaluate the status and the landscape of experimentation being used to validate non-coding GWAS-identified variants. Our results clearly underscore the multifaceted approach needed for experimental validation, have practical implications on variant prioritization and considerations of target gene nomination. While the field has a long way to go to validate the thousands of GWAS associations, we show that progress is being made and provide exemplars of validation studies covering a wide variety of mechanisms, target genes, and disease areas.
Asunto(s)
Estudio de Asociación del Genoma Completo , Secuencias Reguladoras de Ácidos Nucleicos , Humanos , Fenotipo , Regiones Promotoras GenéticasRESUMEN
The rapid emergence of pesticide resistance has given rise to a demand for herbicides with new mode of action (MoA). In the agrochemical sector, with the availability of experimental high throughput screening (HTS) data, it is now possible to utilize in silico target prediction methods in the early discovery phase to suggest the MoA of a compound via data mining of bioactivity data. While having been established in the pharmaceutical context, in the agrochemical area this approach poses rather different challenges, as we have found in this work, partially due to different chemistry, but even more so due to different (usually smaller) amounts of data, and different ways of conducting HTS. With the aim to apply computational methods for facilitating herbicide target identification, 48,000 bioactivity data against 16 herbicide targets were processed to train Laplacian modified Naïve Bayesian (NB) classification models. The herbicide target prediction model ("HerbiMod") is an ensemble of 16 binary classification models which are evaluated by internal, external and prospective validation sets. In addition to the experimental inactives, 10,000 random agrochemical inactives were included in the training process, which showed to improve the overall balanced accuracy of our models up to 40%. For all the models, performance in terms of balanced accuracy of≥80% was achieved in five-fold cross validation. Ranking target predictions was addressed by means of z-scores which improved predictivity over using raw scores alone. An external testset of 247 compounds from ChEMBL and a prospective testset of 394 compounds from BASF SE tested against five well studied herbicide targets (ACC, ALS, HPPD, PDS and PROTOX) were used for further validation. Only 4% of the compounds in the external testset lied in the applicability domain and extrapolation (and correct prediction) was hence impossible, which on one hand was surprising, and on the other hand illustrated the utilization of using applicability domains in the first place. However, performance better than 60% in balanced accuracy was achieved on the prospective testset, where all the compounds fell within the applicability domain, and which hence underlines the possibility of using target prediction also in the area of agrochemicals.
Asunto(s)
Agroquímicos/química , Descubrimiento de Drogas , Herbicidas/química , Relación Estructura-Actividad Cuantitativa , Simulación por Computador , Ensayos Analíticos de Alto Rendimiento , Estudios ProspectivosRESUMEN
The analysis of Structure-Activity-Relationships (SAR) of small molecules is a fundamental task in drug discovery. Although a large number of methods are already published, there is still a strong need for novel intuitive approaches. The inSARa (intuitive networks for Structure-Activity Relationships analysis) method introduced herein takes advantage of the synergistic combination of reduced graphs (RG) and the intuitive maximum common substructure (MCS) concept. The main feature of the inSARa concept is a hierarchical network structure of clearly defined substructure relationships based on common pharmacophoric features. Thus, straightforward SAR interpretation is possible by interactive network navigation. When focusing on a set of active molecules at one single target, the resulting inSARa networks are shown to be valuable for various essential tasks in SAR analysis, such as the identification of activity cliffs or "activity switches", bioisosteric exchanges, common pharmacophoric features, or "SAR hotspots".
Asunto(s)
Gráficos por Computador , Descubrimiento de Drogas/métodos , Relación Estructura-Actividad , Bases de Datos Farmacéuticas , Preparaciones Farmacéuticas/químicaRESUMEN
INTRODUCTION: Currently, there are many novel drugs that belong to class III or IV of the Biopharmaceutics Classification System, showing low bioavailability. Tight junction (TJ) modulation offers an approach to increase bioavailability of pharmaceutical compounds. Furthermore, some diseases are accompanied by disturbed barrier function or TJ dysregulation and thus represent a second application for TJ modulators. AREAS COVERED: This review contains a summary of three different TJ modulators: AT1002, PN159 and labradimil. Within this summary, the authors provide a description of their effects on TJs, their adverse effects and their success in clinical trials. Furthermore, the authors present the current understanding of TJ regulation and highlight opportunities to develop new TJ modulators; they also review the problems that might occur. EXPERT OPINION: The development of new mechanism-based (MB) TJ modulators is a very promising field of research. MB approaches are expected to have the best future prospects. Further elucidation of signaling pathways and TJ regulation will be necessary for advancing MB TJ modulator research.