RESUMEN
Disease mechanisms are usually complex and governed by the interaction of several distinct molecular processes. Complex, multidimensional datasets are a valuable resource to generate more insights into those processes, but the analysis of such datasets can be challenging due to the high dimensionality resulting, for example, from different disease conditions, timepoints, and omics capturing the process at different resolutions. Here, we showcase an approach to analyze and explore such a complex multiomics dataset in an unsupervised way by applying multi-omics factor analysis (MOFA) to a dataset generated from blood samples that capture the immune response in acute and chronic coronary syndromes. The dataset consists of several assays at differing resolutions, including sample-level cytokine data, plasma-proteomics and neutrophil prime-seq, and single-cell RNA-seq (scRNA-seq) data. Further complexity is added by having several different time points measured per patient and several patient subgroups. The analysis workflow outlines how to integrate and analyze the data in several steps: (1) Data pre-processing and harmonization, (2) Estimation of the MOFA model, (3) Downstream analysis. Step 1 outlines how to process the features of the different data types, filter out low-quality features, and normalize them to harmonize their distributions for further analysis. Step 2 shows how to apply the MOFA model and explore the major sources of variance within the dataset across all omics and features. Step 3 presents several strategies for the downstream analysis of the captured patterns, linking them to the disease conditions and potential molecular processes governing those conditions. Overall, we present a workflow for unsupervised data exploration of complex multi-omics datasets to enable the identification of major axes of variation composed of differing molecular features that can also be applied to other contexts and multi-omics datasets (including other assays as presented in the exemplary use case).
Asunto(s)
Enfermedades Cardiovasculares , Proteómica , Humanos , Enfermedades Cardiovasculares/metabolismo , Enfermedades Cardiovasculares/sangre , Enfermedades Cardiovasculares/genética , Proteómica/métodos , MultiómicaRESUMEN
BACKGROUND: The immune system's role in ST-segment-elevated myocardial infarction (STEMI) remains poorly characterized but is an important driver of recurrent cardiovascular events. While anti-inflammatory drugs show promise in reducing recurrence risk, their broad immune system impairment may induce severe side effects. To overcome these challenges, a nuanced understanding of the immune response to STEMI is needed. METHODS: For this, we compared peripheral blood mononuclear single-cell RNA-sequencing (scRNA-seq) and plasma protein expression over time (hospital admission, 24 hours, and 6-8 weeks post-STEMI) in 38 patients and 38 controls (95â 995 diseased and 33â 878 control peripheral blood mononuclear cells). RESULTS: Compared with controls, classical monocytes were increased and CD56dim natural killer cells were decreased in patients with STEMI at admission and persisted until 24 hours post-STEMI. The largest gene expression changes were observed in monocytes, associating with changes in toll-like receptor, interferon, and interleukin signaling activity. Finally, a targeted cardiovascular biomarker panel revealed expression changes in 33/92 plasma proteins post-STEMI. Interestingly, interleukin-6R, MMP9 (matrix metalloproteinase-9), and LDLR (low-density lipoprotein receptor) were affected by coronary artery disease-associated genetic risk variation, disease status, and time post-STEMI, indicating the importance of considering these aspects when defining potential future therapies. CONCLUSIONS: Our analyses revealed the immunologic pathways disturbed by STEMI, specifying affected cell types and disease stages. Additionally, we provide insights into patients expected to benefit most from anti-inflammatory treatments by identifying the genetic variants and disease stage at which these variants affect the outcome of these (drug-targeted) pathways. These findings advance our knowledge of the immune response post-STEMI and provide guidance for future therapeutic studies.
Asunto(s)
Análisis de la Célula Individual , Humanos , Masculino , Femenino , Persona de Mediana Edad , Infarto del Miocardio con Elevación del ST/inmunología , Infarto del Miocardio con Elevación del ST/genética , Infarto del Miocardio con Elevación del ST/sangre , Anciano , Monocitos/inmunología , Monocitos/metabolismo , Biomarcadores/sangre , Infarto del Miocardio/inmunología , Infarto del Miocardio/genética , Leucocitos Mononucleares/inmunología , Leucocitos Mononucleares/metabolismo , Metaloproteinasa 9 de la Matriz/genética , Metaloproteinasa 9 de la Matriz/metabolismo , Receptores de Interleucina-6/genética , Receptores de Interleucina-6/metabolismo , Células Asesinas Naturales/inmunología , Células Asesinas Naturales/metabolismo , Estudios de Casos y ControlesRESUMEN
Acute and chronic coronary syndromes (ACS and CCS) are leading causes of mortality. Inflammation is considered a key pathogenic driver of these diseases, but the underlying immune states and their clinical implications remain poorly understood. Multiomic factor analysis (MOFA) allows unsupervised data exploration across multiple data types, identifying major axes of variation and associating these with underlying molecular processes. We hypothesized that applying MOFA to multiomic data obtained from blood might uncover hidden sources of variance and provide pathophysiological insights linked to clinical needs. Here we compile a longitudinal multiomic dataset of the systemic immune landscape in both ACS and CCS (n = 62 patients in total, n = 15 women and n = 47 men) and validate this in an external cohort (n = 55 patients in total, n = 11 women and n = 44 men). MOFA reveals multicellular immune signatures characterized by distinct monocyte, natural killer and T cell substates and immune-communication pathways that explain a large proportion of inter-patient variance. We also identify specific factors that reflect disease state or associate with treatment outcome in ACS as measured using left ventricular ejection fraction. Hence, this study provides proof-of-concept evidence for the ability of MOFA to uncover multicellular immune programs in cardiovascular disease, opening new directions for mechanistic, biomarker and therapeutic studies.
Asunto(s)
Síndrome Coronario Agudo , Humanos , Femenino , Síndrome Coronario Agudo/inmunología , Masculino , Persona de Mediana Edad , Anciano , Enfermedad Crónica , Monocitos/inmunología , Células Asesinas Naturales/inmunología , Linfocitos T/inmunología , Inflamación/inmunologíaRESUMEN
BACKGROUND: Expression quantitative trait loci (eQTL) studies show how genetic variants affect downstream gene expression. Single-cell data allows reconstruction of personalized co-expression networks and therefore the identification of SNPs altering co-expression patterns (co-expression QTLs, co-eQTLs) and the affected upstream regulatory processes using a limited number of individuals. RESULTS: We conduct a co-eQTL meta-analysis across four scRNA-seq peripheral blood mononuclear cell datasets using a novel filtering strategy followed by a permutation-based multiple testing approach. Before the analysis, we evaluate the co-expression patterns required for co-eQTL identification using different external resources. We identify a robust set of cell-type-specific co-eQTLs for 72 independent SNPs affecting 946 gene pairs. These co-eQTLs are replicated in a large bulk cohort and provide novel insights into how disease-associated variants alter regulatory networks. One co-eQTL SNP, rs1131017, that is associated with several autoimmune diseases, affects the co-expression of RPS26 with other ribosomal genes. Interestingly, specifically in T cells, the SNP additionally affects co-expression of RPS26 and a group of genes associated with T cell activation and autoimmune disease. Among these genes, we identify enrichment for targets of five T-cell-activation-related transcription factors whose binding sites harbor rs1131017. This reveals a previously overlooked process and pinpoints potential regulators that could explain the association of rs1131017 with autoimmune diseases. CONCLUSION: Our co-eQTL results highlight the importance of studying context-specific gene regulation to understand the biological implications of genetic variation. With the expected growth of sc-eQTL datasets, our strategy and technical guidelines will facilitate future co-eQTL identification, further elucidating unknown disease mechanisms.