RESUMEN
BACKGROUND: The association between cancer and multiple sclerosis has long been investigated. Several studies and reviews have examined the risk of cancer among patients with multiple sclerosis treated with disease-modifying therapies (DMTs) but with conflicting results. This study will aim to investigate the association between DMTs for multiple sclerosis and subsequent cancer risk using research synthesis methods. METHODS/DESIGN: We designed and registered a study protocol for a systematic review and meta-analysis. We will include randomised and non-randomised trials, prospective or retrospective cohort studies, and case-control studies of treatment with DMTs compared with placebo, no treatment, or another active agent. The primary outcome will be the risk of cancer (all-malignant neoplasms) in association with the exposure of DMTs. Secondary outcomes will include site-specific cancers (e.g. breast cancer). Literature searches will be conducted in multiple electronic databases (from their inception onwards), including the following: PubMed/MEDLINE, EMBASE, and Cochrane Central Register of Controlled Trials (CENTRAL). Two researchers will screen all citations, full-text articles, and abstract data independently. The risk of bias (quality) of individual studies will be appraised using an appropriate tool. If feasible, we will use a two-stage approach to evidence synthesis: (1) Peto's method for meta-analysis of data from randomised trials alone; and (2) Random-effects model for meta-analysis adding data from non-randomised studies. We will calculate odds ratios and their associated 95% confidence intervals. Potential sources of heterogeneity will be explored in additional analyses (e.g. subgroups considering different DMTs individually, mechanism of action, type of control, length of follow-up, mode of treatment). DISCUSSION: This systematic review and meta-analysis of randomised and non-randomised studies will provide an updated synthesis of the risk of cancer associated with DMTs for adult patients with multiple sclerosis. This study will also examine some factors that may explain potential variations across studies. The findings will be published in a peer-reviewed journal. SYSTEMATIC REVIEW REGISTRATION: Open Science Framework ( https://osf.io/v4sez ).
Asunto(s)
Esclerosis Múltiple , Neoplasias , Revisiones Sistemáticas como Asunto , Humanos , Esclerosis Múltiple/tratamiento farmacológico , Proyectos de Investigación , Metaanálisis como Asunto , Ensayos Clínicos Controlados Aleatorios como Asunto , Factores de RiesgoRESUMEN
Precision medicine aims at tailoring treatments to individual patient's characteristics. In this regard, recognizing the significance of sex and gender becomes indispensable for meeting the distinct healthcare needs of diverse populations. To this end, continuing a trend of improving data quality observed since 2014, the European Genome-phenome Archive (EGA) established a policy in 2018 that mandates data providers to declare the sex of donor samples, aiming to enhance data accuracy and prevent imbalance in sex classification. We analyzed sex classification imbalance in human data from EGA and the U.S. counterpart, the database of genotypes and phenotypes (dbGaP). Our findings show a significant decrease in samples classified as unknown in EGA, potentially promoting better sex reporting during data collection. Based on our findings, we raise awareness of sample imbalance problems and provide a list of recommendations for enhancing biomedical research practices.
Asunto(s)
Genoma Humano , Neoplasias , Neoplasias/genética , Humanos , Genómica/métodos , Bases de Datos GenéticasRESUMEN
MOTIVATION: The interpretation of genomic data is crucial to understand the molecular mechanisms of biological processes. Protein structures play a vital role in facilitating this interpretation by providing functional context to genetic coding variants. However, mapping genes to proteins is a tedious and error-prone task due to inconsistencies in data formats. Over the past two decades, numerous tools and databases have been developed to automatically map annotated positions and variants to protein structures. However, most of these tools are web-based and not well-suited for large-scale genomic data analysis. RESULTS: To address this issue, we introduce 3Dmapper, a stand-alone command-line tool developed in Python and R. It systematically maps annotated protein positions and variants to protein structures, providing a solution that is both efficient and reliable. AVAILABILITY AND IMPLEMENTATION: https://github.com/vicruiser/3Dmapper.
Asunto(s)
Bancos de Muestras Biológicas , Programas Informáticos , Proteínas/química , GenómicaRESUMEN
BACKGROUND: Chronic Obstructive Pulmonary Disease (COPD) is a heterogeneous condition. We hypothesized that the unbiased integration of different COPD lung omics using a novel multi-layer approach may unravel mechanisms associated with clinical characteristics. METHODS: We profiled mRNA, miRNA and methylome in lung tissue samples from 135 former smokers with COPD. For each omic (layer) we built a patient network based on molecular similarity. The three networks were used to build a multi-layer network, and optimization of multiplex-modularity was employed to identify patient communities across the three distinct layers. Uncovered communities were related to clinical features. RESULTS: We identified five patient communities in the multi-layer network which were molecularly distinct and related to clinical characteristics, such as FEV1 and blood eosinophils. Two communities (C#3 and C#4) had both similarly low FEV1 values and emphysema, but were molecularly different: C#3, but not C#4, presented B and T cell signatures and a downregulation of secretory (SCGB1A1/SCGB3A1) and ciliated cells. A machine learning model was set up to discriminate C#3 and C#4 in our cohort, and to validate them in an independent cohort. Finally, using spatial transcriptomics we characterized the small airway differences between C#3 and C#4, identifying an upregulation of T/B cell homing chemokines, and bacterial response genes in C#3. CONCLUSIONS: A novel multi-layer network analysis is able to identify clinically relevant COPD patient communities. Patients with similarly low FEV1 and emphysema can have molecularly distinct small airways and immune response patterns, indicating that different endotypes can lead to similar clinical presentation.
RESUMEN
BACKGROUND: The co-administration of drugs known to interact greatly impacts morbidity, mortality, and health economics. This study aims to examine the drug-drug interaction (DDI) phenomenon with a large-scale longitudinal analysis of age and gender differences found in drug administration data from three distinct healthcare systems. METHODS: This study analyzes drug administrations from population-wide electronic health records in Blumenau (Brazil; 133 K individuals), Catalonia (Spain; 5.5 M individuals), and Indianapolis (USA; 264 K individuals). The stratified prevalences of DDI for multiple severity levels per patient gender and age at the time of administration are computed, and null models are used to estimate the expected impact of polypharmacy on DDI prevalence. Finally, to study actionable strategies to reduce DDI prevalence, alternative polypharmacy regimens using drugs with fewer known interactions are simulated. RESULTS: A large prevalence of co-administration of drugs known to interact is found in all populations, affecting 12.51%, 12.12%, and 10.06% of individuals in Blumenau, Indianapolis, and Catalonia, respectively. Despite very different healthcare systems and drug availability, the increasing prevalence of DDI as patients age is very similar across all three populations and is not explained solely by higher co-administration rates in the elderly. In general, the prevalence of DDI is significantly higher in women - with the exception of men over 50 years old in Indianapolis. Finally, we show that using proton pump inhibitor alternatives to omeprazole (the drug involved in more co-administrations in Catalonia and Blumenau), the proportion of patients that are administered known DDI can be reduced by up to 21% in both Blumenau and Catalonia and 2% in Indianapolis. CONCLUSIONS: DDI administration has a high incidence in society, regardless of geographic, population, and healthcare management differences. Although DDI prevalence increases with age, our analysis points to a complex phenomenon that is much more prevalent than expected, suggesting comorbidities as key drivers of the increase. Furthermore, the gender differences observed in most age groups across populations are concerning in regard to gender equity in healthcare. Finally, our study exemplifies how electronic health records' analysis can lead to actionable interventions that significantly reduce the administration of known DDI and its associated human and economic costs.
Asunto(s)
Polifarmacia , Masculino , Humanos , Femenino , Anciano , Persona de Mediana Edad , Preparaciones Farmacéuticas , Prevalencia , Interacciones Farmacológicas , ComorbilidadRESUMEN
According to the Principle of Minimal Frustration, folded proteins can only have a minimal number of strong energetic conflicts in their native states. However, not all interactions are energetically optimized for folding but some remain in energetic conflict, i.e. they are highly frustrated. This remaining local energetic frustration has been shown to be statistically correlated with distinct functional aspects such as protein-protein interaction sites, allosterism and catalysis. Fuelled by the recent breakthroughs in efficient protein structure prediction that have made available good quality models for most proteins, we have developed a strategy to calculate local energetic frustration within large protein families and quantify its conservation over evolutionary time. Based on this evolutionary information we can identify how stability and functional constraints have appeared at the common ancestor of the family and have been maintained over the course of evolution. Here, we present FrustraEvo, a web server tool to calculate and quantify the conservation of local energetic frustration in protein families.
Asunto(s)
Internet , Pliegue de Proteína , Proteínas , Programas Informáticos , Proteínas/química , Termodinámica , Conformación Proteica , Evolución Molecular , Modelos MolecularesRESUMEN
Activation of the p53 tumor suppressor triggers a transcriptional program to control cellular response to stress. However, the molecular mechanisms by which p53 controls gene transcription are not completely understood. Here, we uncover the critical role of spatio-temporal genome architecture in this process. We demonstrate that p53 drives direct and indirect changes in genome compartments, topologically associating domains, and DNA loops prior to one hour of its activation, which escort the p53 transcriptional program. Focusing on p53-bound enhancers, we report 340 genes directly regulated by p53 over a median distance of 116 kb, with 74% of these genes not previously identified. Finally, we showcase that p53 controls transcription of distal genes through newly formed and pre-existing enhancer-promoter loops in a cohesin dependent manner. Collectively, our findings demonstrate a previously unappreciated architectural role of p53 as regulator at distinct topological layers and provide a reliable set of new p53 direct target genes that may help designs of cancer therapies.
Asunto(s)
Cohesinas , Proteína p53 Supresora de Tumor , Proteína p53 Supresora de Tumor/genética , Proteína p53 Supresora de Tumor/metabolismo , Secuencias Reguladoras de Ácidos Nucleicos , ADN , Cromatina/genéticaRESUMEN
Safe and effective severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) vaccines are crucial to fight against the coronavirus disease 2019 pandemic. Most vaccines are based on a mutated version of the Spike glycoprotein [K986P/V987P (S-2P)] with improved stability, yield and immunogenicity. However, S-2P is still produced at low levels. Here, we describe the V987H mutation that increases by two-fold the production of the recombinant Spike and the exposure of the receptor binding domain (RBD). S-V987H immunogenicity is similar to S-2P in mice and golden Syrian hamsters (GSH), and superior to a monomeric RBD. S-V987H immunization confer full protection against severe disease in K18-hACE2 mice and GSH upon SARS-CoV-2 challenge (D614G or B.1.351 variants). Furthermore, S-V987H immunized K18-hACE2 mice show a faster tissue viral clearance than RBD- or S-2P-vaccinated animals challenged with D614G, B.1.351 or Omicron BQ1.1 variants. Thus, S-V987H protein might be considered for future SARS-CoV-2 vaccines development.
Asunto(s)
COVID-19 , Melfalán , SARS-CoV-2 , gammaglobulinas , Cricetinae , Animales , Humanos , Ratones , Mesocricetus , Vacunas contra la COVID-19 , COVID-19/prevención & control , Glicoproteína de la Espiga del Coronavirus/genética , Inmunización , Glicoproteínas , Anticuerpos Neutralizantes , Anticuerpos AntiviralesRESUMEN
Exploring the molecular basis of disease severity in rare disease scenarios is a challenging task provided the limitations on data availability. Causative genes have been described for Congenital Myasthenic Syndromes (CMS), a group of diverse minority neuromuscular junction (NMJ) disorders; yet a molecular explanation for the phenotypic severity differences remains unclear. Here, we present a workflow to explore the functional relationships between CMS causal genes and altered genes from each patient, based on multilayer network community detection analysis of complementary biomedical information provided by relevant data sources, namely protein-protein interactions, pathways and metabolomics. Our results show that CMS severity can be ascribed to the personalized impairment of extracellular matrix components and postsynaptic modulators of acetylcholine receptor (AChR) clustering. This work showcases how coupling multilayer network analysis with personalized -omics information provides molecular explanations to the varying severity of rare diseases; paving the way for sorting out similar cases in other rare diseases.
Asunto(s)
Síndromes Miasténicos Congénitos , Humanos , Síndromes Miasténicos Congénitos/genética , Síndromes Miasténicos Congénitos/diagnóstico , Unión Neuromuscular/metabolismo , Enfermedades Raras/metabolismo , Flujo de Trabajo , Receptores Colinérgicos/genética , Receptores Colinérgicos/metabolismo , MutaciónRESUMEN
BACKGROUND: Neoantigens are patient- and tumor-specific peptides that arise from somatic mutations. They stand as promising targets for personalized therapeutic cancer vaccines. The identification process for neoantigens has evolved with the use of next-generation sequencing technologies and bioinformatic tools in tumor genomics. However, in-silico strategies for selecting immunogenic neoantigens still have very low accuracy rates, since they mainly focus on predicting peptide binding to Major Histocompatibility Complex (MHC) molecules, which is key but not the sole determinant for immunogenicity. Moreover, the therapeutic potential of neoantigen-based vaccines may be enhanced using an optimal delivery platform that elicits robust de novo immune responses. METHODS: We developed a novel neoantigen selection pipeline based on existing software combined with a novel prediction method, the Neoantigen Optimization Algorithm (NOAH), which takes into account structural features of the peptide/MHC-I interaction, as well as the interaction between the peptide/MHC-I complex and the TCR, in its prediction strategy. Moreover, to maximize neoantigens' therapeutic potential, neoantigen-based vaccines should be manufactured in an optimal delivery platform that elicits robust de novo immune responses and bypasses central and peripheral tolerance. RESULTS: We generated a highly immunogenic vaccine platform based on engineered HIV-1 Gag-based Virus-Like Particles (VLPs) expressing a high copy number of each in silico selected neoantigen. We tested different neoantigen-loaded VLPs (neoVLPs) in a B16-F10 melanoma mouse model to evaluate their capability to generate new immunogenic specificities. NeoVLPs were used in in vivo immunogenicity and tumor challenge experiments. CONCLUSIONS: Our results indicate the relevance of incorporating other immunogenic determinants beyond the binding of neoantigens to MHC-I. Thus, neoVLPs loaded with neoantigens enhancing the interaction with the TCR can promote the generation of de novo antitumor-specific immune responses, resulting in a delay in tumor growth. Vaccination with the neoVLP platform is a robust alternative to current therapeutic vaccine approaches and a promising candidate for future personalized immunotherapy.
Asunto(s)
Vacunas contra el Cáncer , Neoplasias , Vacunas , Humanos , Animales , Ratones , Neoplasias/genética , Antígenos de Neoplasias/metabolismo , Péptidos , Receptores de Antígenos de Linfocitos T/metabolismo , Inmunoterapia/métodosRESUMEN
To enable a collective effort that generates a new level of UNderstanding CANcer (UNCAN.eu) [Cancer Discov (2022) 12 (11): OF1], the European Union supports the creation of a sustainable platform that connects cancer research across Member States. A workshop hosted in Heidelberg gathered European cancer experts to identify ongoing initiatives that may contribute to building this platform and discuss the governance and long-term evolution of a European Federated Cancer Data Hub.
Asunto(s)
Neoplasias , Humanos , Investigación , Unión EuropeaRESUMEN
We introduce Promoter-Enhancer-Guided Interaction Networks (PENGUIN), a method for studying protein-protein interaction (PPI) networks within enhancer-promoter interactions. PENGUIN integrates H3K27ac-HiChIP data with tissue-specific PPIs to define enhancer-promoter PPI networks (EPINs). We validated PENGUIN using cancer (LNCaP) and benign (LHSAR) prostate cell lines. Our analysis detected EPIN clusters enriched with the architectural protein CTCF, a regulator of enhancer-promoter interactions. CTCF presence was coupled with the prevalence of prostate cancer (PrCa) single nucleotide polymorphisms (SNPs) within the same EPIN clusters, suggesting functional implications in PrCa. Within the EPINs displaying enrichments in both CTCF and PrCa SNPs, we also show enrichment in oncogenes. We substantiated our identified SNPs through CRISPR/Cas9 knockout and RNAi screens experiments. Here we show that PENGUIN provides insights into the intricate interplay between enhancer-promoter interactions and PPI networks, which are crucial for identifying key genes and potential intervention targets. A dedicated server is available at https://penguin.life.bsc.es/ .
Asunto(s)
Neoplasias de la Próstata , Spheniscidae , Masculino , Animales , Humanos , Spheniscidae/genética , Elementos de Facilitación Genéticos/genética , Regiones Promotoras Genéticas/genética , Neoplasias de la Próstata/genética , Proteínas/genéticaRESUMEN
Energetic local frustration offers a biophysical perspective to interpret the effects of sequence variability on protein families. Here we present a methodology to analyze local frustration patterns within protein families and superfamilies that allows us to uncover constraints related to stability and function, and identify differential frustration patterns in families with a common ancestry. We analyze these signals in very well studied protein families such as PDZ, SH3, É and ß globins and RAS families. Recent advances in protein structure prediction make it possible to analyze a vast majority of the protein space. An automatic and unsupervised proteome-wide analysis on the SARS-CoV-2 virus demonstrates the potential of our approach to enhance our understanding of the natural phenotypic diversity of protein families beyond single protein instances. We apply our method to modify biophysical properties of natural proteins based on their family properties, as well as perform unsupervised analysis of large datasets to shed light on the physicochemical signatures of poorly characterized proteins such as the ones belonging to emergent pathogens.
Asunto(s)
Proteínas , Proteínas/metabolismoRESUMEN
Most COVID-19 vaccines are based on the SARS-CoV-2 Spike glycoprotein (S) or their subunits. However, S shows some structural instability that limits its immunogenicity and production, hampering the development of recombinant S-based vaccines. The introduction of the K986P and V987P (S-2P) mutations increases the production and immunogenicity of the recombinant S trimer, suggesting that these two parameters are related. Nevertheless, S-2P still shows some molecular instability and it is produced with low yield. Here we described a novel set of mutations identified by molecular modeling and located in the S2 region of the S-2P that increase its production up to five-fold. Besides their immunogenicity, the efficacy of two representative S-2P-based mutants, S-29 and S-21, protecting from a heterologous SARS-CoV-2 Beta variant challenge was assayed in K18-hACE2 mice (an animal model of severe SARS-CoV-2 disease) and golden Syrian hamsters (GSH) (a moderate disease model). S-21 induced higher level of WH1 and Delta variants neutralizing antibodies than S-2P in K18-hACE2 mice three days after challenge. Viral load in nasal turbinate and oropharyngeal samples were reduced in S-21 and S-29 vaccinated mice. Despite that, only the S-29 protein protected 100% of K18-hACE2 mice from severe disease. When GSH were analyzed, all immunized animals were protected from disease development irrespectively of the immunogen they received. Therefore, the higher yield of S-29, as well as its improved immunogenicity and efficacy protecting from the highly pathogenic SARS-CoV-2 Beta variant, pinpoint the S-29 mutant as an alternative to the S-2P protein for future SARS-CoV-2 vaccine development.
Asunto(s)
COVID-19 , SARS-CoV-2 , Cricetinae , Animales , Humanos , Ratones , SARS-CoV-2/genética , Mesocricetus , COVID-19/prevención & control , Vacunas contra la COVID-19RESUMEN
It is getting increasingly challenging to efficiently exploit drug-related information described in the growing amount of scientific literature. Indeed, for drug-gene/protein interactions, the challenge is even bigger, considering the scattered information sources and types of interactions. However, their systematic, large-scale exploitation is key for developing tools, impacting knowledge fields as diverse as drug design or metabolic pathway research. Previous efforts in the extraction of drug-gene/protein interactions from the literature did not address these scalability and granularity issues. To tackle them, we have organized the DrugProt track at BioCreative VII. In the context of the track, we have released the DrugProt Gold Standard corpus, a collection of 5000 PubMed abstracts, manually annotated with granular drug-gene/protein interactions. We have proposed a novel large-scale track to evaluate the capacity of natural language processing systems to scale to the range of millions of documents, and generate with their predictions a silver standard knowledge graph of 53 993 602 nodes and 19 367 406 edges. Its use exceeds the shared task and points toward pharmacological and biological applications such as drug discovery or continuous database curation. Finally, we have created a persistent evaluation scenario on CodaLab to continuously evaluate new relation extraction systems that may arise. Thirty teams from four continents, which involved 110 people, sent 107 submission runs for the Main DrugProt track, and nine teams submitted 21 runs for the Large Scale DrugProt track. Most participants implemented deep learning approaches based on pretrained transformer-like language models (LMs) such as BERT or BioBERT, reaching precision and recall values as high as 0.9167 and 0.9542 for some relation types. Finally, some initial explorations of the applicability of the knowledge graph have shown its potential to explore the chemical-protein relations described in the literature, or chemical compound-enzyme interactions. Database URL: https://doi.org/10.5281/zenodo.4955410.
Asunto(s)
Minería de Datos , Reconocimiento de Normas Patrones Automatizadas , Humanos , Bases de Datos Factuales , Minería de Datos/métodos , Proteínas/metabolismoRESUMEN
In systems biology, mathematical models and simulations play a crucial role in understanding complex biological systems. Different modelling frameworks are employed depending on the nature and scales of the system under study. For instance, signalling and regulatory networks can be simulated using Boolean modelling, whereas multicellular systems can be studied using agent-based modelling. Herein, we present PhysiBoSS 2.0, a hybrid agent-based modelling framework that allows simulating signalling and regulatory networks within individual cell agents. PhysiBoSS 2.0 is a redesign and reimplementation of PhysiBoSS 1.0 and was conceived as an add-on that expands the PhysiCell functionalities by enabling the simulation of intracellular cell signalling using MaBoSS while keeping a decoupled, maintainable and model-agnostic design. PhysiBoSS 2.0 also expands the set of functionalities offered to the users, including custom models and cell specifications, mechanistic submodels of substrate internalisation and detailed control over simulation parameters. Together with PhysiBoSS 2.0, we introduce PCTK, a Python package developed for handling and processing simulation outputs, and generating summary plots and 3D renders. PhysiBoSS 2.0 allows studying the interplay between the microenvironment, the signalling pathways that control cellular processes and population dynamics, suitable for modelling cancer. We show different approaches for integrating Boolean networks into multi-scale simulations using strategies to study the drug effects and synergies in models of cancer cell lines and validate them using experimental data. PhysiBoSS 2.0 is open-source and publicly available on GitHub with several repositories of accompanying interoperable tools.
Asunto(s)
Modelos Biológicos , Neoplasias , Humanos , Simulación por Computador , Transducción de Señal , Modelos Teóricos , Análisis de Sistemas , Microambiente TumoralRESUMEN
Co-occurrence of diseases decreases patient quality of life, complicates treatment choices, and increases mortality. Analyses of electronic health records present a complex scenario of comorbidity relationships that vary by age, sex, and cohort under study. The study of similarities between diseases using 'omics data, such as genes altered in diseases, gene expression, proteome, and microbiome, are fundamental to uncovering the origin of, and potential treatment for, comorbidities. Recent studies have produced a first generation of genetic interpretations for as much as 46% of the comorbidities described in large cohorts. Integrating different sources of molecular information and using artificial intelligence (AI) methods are promising approaches for the study of comorbidities. They may help to improve the treatment of comorbidities, including the potential repositioning of drugs.
Asunto(s)
Inteligencia Artificial , Calidad de Vida , Humanos , ComorbilidadRESUMEN
Spatiotemporal gene transcription is tightly regulated by distal regulatory elements, such as enhancers and silencers, which rely on physical proximity with their target gene promoters to control transcription. Although these regulatory elements are easy to identify, their target genes are difficult to predict, since most of them are cell-type specific and may be separated by hundreds of kilobases in the linear genome sequence, skipping over other non-target genes. For several years, Promoter Capture Hi-C (PCHi-C) has been the gold standard for the association of distal regulatory elements to their target genes. However, PCHi-C relies on the availability of millions of cells, prohibiting the study of rare cell populations such as those commonly obtained from primary tissues. To overcome this limitation, low input Capture Hi-C (liCHi-C), a cost-effective and customizable method to identify the repertoire of distal regulatory elements controlling each gene of the genome, has been developed. liCHi-C relies on a similar experimental and computational framework as PCHi-C, but by employing minimal tube changes, modifying the reagent concentration and volumes, and swapping or eliminating steps, it accounts for minimal material loss during library construction. Collectively, liCHi-C enables the study of gene regulation and spatiotemporal genome organization in the context of developmental biology and cellular function.