Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Resultados 1 - 20 de 289
Filtrar
1.
Cell ; 156(1-2): 343-58, 2014 Jan 16.
Artículo en Inglés | MEDLINE | ID: mdl-24439387

RESUMEN

Genome-wide association studies have revealed numerous risk loci associated with diverse diseases. However, identification of disease-causing variants within association loci remains a major challenge. Divergence in gene expression due to cis-regulatory variants in noncoding regions is central to disease susceptibility. We show that integrative computational analysis of phylogenetic conservation with a complexity assessment of co-occurring transcription factor binding sites (TFBS) can identify cis-regulatory variants and elucidate their mechanistic role in disease. Analysis of established type 2 diabetes risk loci revealed a striking clustering of distinct homeobox TFBS. We identified the PRRX1 homeobox factor as a repressor of PPARG2 expression in adipose cells and demonstrate its adverse effect on lipid metabolism and systemic insulin sensitivity, dependent on the rs4684847 risk allele that triggers PRRX1 binding. Thus, cross-species conservation analysis at the level of co-occurring TFBS provides a valuable contribution to the translation of genetic association signals to disease-related molecular mechanisms.


Asunto(s)
Diabetes Mellitus Tipo 2/genética , Polimorfismo de Nucleótido Simple , Animales , Línea Celular , Células Cultivadas , Secuencia Conservada , Regulación de la Expresión Génica , Estudio de Asociación del Genoma Completo , Proteínas de Homeodominio/metabolismo , Humanos , Resistencia a la Insulina , PPAR gamma/genética , Secuencias Reguladoras de Ácidos Nucleicos , Factores de Transcripción/metabolismo
2.
Nature ; 622(7982): 329-338, 2023 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-37794186

RESUMEN

The Pharma Proteomics Project is a precompetitive biopharmaceutical consortium characterizing the plasma proteomic profiles of 54,219 UK Biobank participants. Here we provide a detailed summary of this initiative, including technical and biological validations, insights into proteomic disease signatures, and prediction modelling for various demographic and health indicators. We present comprehensive protein quantitative trait locus (pQTL) mapping of 2,923 proteins that identifies 14,287 primary genetic associations, of which 81% are previously undescribed, alongside ancestry-specific pQTL mapping in non-European individuals. The study provides an updated characterization of the genetic architecture of the plasma proteome, contextualized with projected pQTL discovery rates as sample sizes and proteomic assay coverages increase over time. We offer extensive insights into trans pQTLs across multiple biological domains, highlight genetic influences on ligand-receptor interactions and pathway perturbations across a diverse collection of cytokines and complement networks, and illustrate long-range epistatic effects of ABO blood group and FUT2 secretor status on proteins with gastrointestinal tissue-enriched expression. We demonstrate the utility of these data for drug discovery by extending the genetic proxied effects of protein targets, such as PCSK9, on additional endpoints, and disentangle specific genes and proteins perturbed at loci associated with COVID-19 susceptibility. This public-private partnership provides the scientific community with an open-access proteomics resource of considerable breadth and depth to help to elucidate the biological mechanisms underlying proteo-genomic discoveries and accelerate the development of biomarkers, predictive models and therapeutics1.


Asunto(s)
Bancos de Muestras Biológicas , Proteínas Sanguíneas , Bases de Datos Factuales , Genómica , Salud , Proteoma , Proteómica , Humanos , Sistema del Grupo Sanguíneo ABO/genética , Proteínas Sanguíneas/análisis , Proteínas Sanguíneas/genética , COVID-19/genética , Descubrimiento de Drogas , Epistasis Genética , Fucosiltransferasas/metabolismo , Predisposición Genética a la Enfermedad , Plasma/química , Proproteína Convertasa 9/metabolismo , Proteoma/análisis , Proteoma/genética , Asociación entre el Sector Público-Privado , Sitios de Carácter Cuantitativo , Reino Unido , Galactósido 2-alfa-L-Fucosiltransferasa
3.
Nat Rev Genet ; 22(1): 19-37, 2021 01.
Artículo en Inglés | MEDLINE | ID: mdl-32860016

RESUMEN

Proteomic analysis of cells, tissues and body fluids has generated valuable insights into the complex processes influencing human biology. Proteins represent intermediate phenotypes for disease and provide insight into how genetic and non-genetic risk factors are mechanistically linked to clinical outcomes. Associations between protein levels and DNA sequence variants that colocalize with risk alleles for common diseases can expose disease-associated pathways, revealing novel drug targets and translational biomarkers. However, genome-wide, population-scale analyses of proteomic data are only now emerging. Here, we review current findings from studies of the plasma proteome and discuss their potential for advancing biomedical translation through the interpretation of genome-wide association analyses. We highlight the challenges faced by currently available technologies and provide perspectives relevant to their future application in large-scale biobank studies.


Asunto(s)
Proteínas Sanguíneas/análisis , Estudio de Asociación del Genoma Completo , Proteoma/genética , Proteómica , Biomarcadores/análisis , Humanos , Fenotipo
4.
Mol Cell Proteomics ; 23(7): 100786, 2024 May 17.
Artículo en Inglés | MEDLINE | ID: mdl-38761890

RESUMEN

Advances in proteomic assay technologies have significantly increased coverage and throughput, enabling recent increases in the number of large-scale population-based proteomic studies of human plasma and serum. Improvements in multiplexed protein assays have facilitated the quantification of thousands of proteins over a large dynamic range, a key requirement for detecting the lowest-ranging, and potentially the most disease-relevant, blood-circulating proteins. In this perspective, we examine how populational proteomic datasets in conjunction with other concurrent omic measures can be leveraged to better understand the genomic and non-genomic correlates of the soluble proteome, constructing biomarker panels for disease prediction, among others. Mass spectrometry workflows are discussed as they are becoming increasingly competitive with affinity-based array platforms in terms of speed, cost, and proteome coverage due to advances in both instrumentation and workflows. Despite much success, there remain considerable challenges such as orthogonal validation and absolute quantification. We also highlight emergent challenges associated with study design, analytical considerations, and data integration as population-scale studies are run in batches and may involve longitudinal samples collated over many years. Lastly, we take a look at the future of what the nascent next-generation proteomic technologies might provide to the analysis of large sets of blood samples, as well as the difficulties in designing large-scale studies that will likely require participation from multiple and complex funding sources and where data sharing, study designs, and financing must be solved.

5.
Hum Mol Genet ; 32(6): 907-916, 2023 03 06.
Artículo en Inglés | MEDLINE | ID: mdl-36168886

RESUMEN

Polygenic scores (PGS) can identify individuals at risk of adverse health events and guide genetics-based personalized medicine. However, it is not clear how well PGS translate between different populations, limiting their application to well-studied ethnicities. Proteins are intermediate traits linking genetic predisposition and environmental factors to disease, with numerous blood circulating protein levels representing functional readouts of disease-related processes. We hypothesized that studying the genetic architecture of a comprehensive set of blood-circulating proteins between a European and an Arab population could shed fresh light on the translatability of PGS to understudied populations. We therefore conducted a genome-wide association study with whole-genome sequencing data using 1301 proteins measured on the SOMAscan aptamer-based affinity proteomics platform in 2935 samples of Qatar Biobank and evaluated the replication of protein quantitative traits (pQTLs) from European studies in an Arab population. Then, we investigated the colocalization of shared pQTL signals between the two populations. Finally, we compared the performance of protein PGS derived from a Caucasian population in a European and an Arab cohort. We found that the majority of shared pQTL signals (81.8%) colocalized between both populations. About one-third of the genetic protein heritability was explained by protein PGS derived from a European cohort, with protein PGS performing ~20% better in Europeans when compared to Arabs. Our results are relevant for the translation of PGS to non-Caucasian populations, as well as for future efforts to extend genetic research to understudied populations.


Asunto(s)
Árabes , Sitios de Carácter Cuantitativo , Población Blanca , Humanos , Árabes/genética , Estudio de Asociación del Genoma Completo , Población Blanca/genética , Genética de Población
6.
Artículo en Inglés | MEDLINE | ID: mdl-38750824

RESUMEN

BACKGROUND: TRPM4 is a broadly expressed, calcium-activated, monovalent cation channel that regulates immune cell function in mice and cell lines. Clinically, however, partial loss- or gain-of-function mutations in TRPM4 lead to arrhythmia and heart disease, with no documentation of immunologic disorders. OBJECTIVE: To characterize functional cellular mechanisms underlying the immune dysregulation phenotype in a proband with a mutated TRPM4 gene. METHODS: We employed a combination of biochemical, cell biological, imaging, omics analyses, flow cytometry, and gene editing approaches. RESULTS: We report the first human cases to our knowledge with complete loss of the TRPM4 channel, leading to immune dysregulation with frequent bacterial and fungal infections. Single-cell and bulk RNA sequencing point to altered expression of genes affecting cell migration, specifically in monocytes. Inhibition of TRPM4 in T cells and the THP-1 monocyte cell line reduces migration. More importantly, primary T cells and monocytes from TRPM4 patients migrate poorly. Finally, CRISPR knockout of TRPM4 in THP-1 cells greatly reduces their migration potential. CONCLUSION: Our results demonstrate that TRPM4 plays a critical role in regulating immune cell migration, leading to increased susceptibility to infections.

7.
Hum Mol Genet ; 31(19): 3367-3376, 2022 09 29.
Artículo en Inglés | MEDLINE | ID: mdl-34718574

RESUMEN

In the era of personalized medicine with more and more patient-specific targeted therapies being used, we need reliable, dynamic, faster and sensitive biomarkers both to track the causes of disease and to develop and evolve therapies during the course of treatment. Metabolomics recently has shown substantial evidence to support its emerging role in disease diagnosis and prognosis. Aside from biomarkers and development of therapies, it is also an important goal to understand the involvement of mitochondrial DNA (mtDNA) in metabolic regulation, aging and disease development. Somatic mutations of the mitochondrial genome are also heavily implicated in age-related disease and aging. The general hypothesis is that an alteration in the concentration of metabolite profiles (possibly conveyed by lifestyle and environmental factors) influences the increase of mutation rate in the mtDNA and thereby contributes to a range of pathophysiological alterations observed in complex diseases. We performed an inverted mitochondrial genome-wide association analysis between mitochondrial nucleotide variants (mtSNVs) and concentration of metabolites. We used 151 metabolites and the whole sequenced mitochondrial genome from 2718 individuals to identify the genetic variants associated with metabolite profiles. Because of the high coverage, next-generation sequencing-based analysis of the mitochondrial genome allows for an accurate detection of mitochondrial heteroplasmy and for the identification of variants associated with the metabolome. The strongest association was found for mt715G > A located in the MT-12SrRNA with the metabolite ratio of C2/C10:1 (P-value = 6.82*10-09, ß = 0.909). The second most significant mtSNV was found for mt3714A > G located in the MT-ND1 with the metabolite ratio of phosphatidylcholine (PC) ae C42:5/PC ae C44:5 (P-value = 1.02*10-08, ß = 3.631). A large number of significant metabolite ratios were observed involving PC aa C36:6 and the variant mt10689G > A, located in the MT-ND4L gene. These results show an important interconnection between mitochondria and metabolite concentrations. Considering that some of the significant metabolites found in this study have been previously related to complex diseases, such as neurological disorders and metabolic conditions, these associations found here might play a crucial role for further investigations of such complex diseases. Understanding the mechanisms that control human health and disease, in particular, the role of genetic predispositions and their interaction with environmental factors is a prerequisite for the development of safe and efficient therapies for complex disorders.


Asunto(s)
Estudio de Asociación del Genoma Completo , Metabolómica , Biomarcadores/metabolismo , ADN Mitocondrial/genética , ADN Mitocondrial/metabolismo , Humanos , Metabolómica/métodos , Mitocondrias/genética , Mitocondrias/metabolismo , Nucleótidos/metabolismo , Fosfatidilcolinas/metabolismo
8.
Brief Bioinform ; 23(2)2022 03 10.
Artículo en Inglés | MEDLINE | ID: mdl-34981111

RESUMEN

Large metabolomics datasets inevitably contain unwanted technical variations which can obscure meaningful biological signals and affect how this information is applied to personalized healthcare. Many methods have been developed to handle unwanted variations. However, the underlying assumptions of many existing methods only hold for a few specific scenarios. Some tools remove technical variations with models trained on quality control (QC) samples which may not generalize well on subject samples. Additionally, almost none of the existing methods supports datasets with multiple types of QC samples, which greatly limits their performance and flexibility. To address these issues, a non-parametric method TIGER (Technical variation elImination with ensemble learninG architEctuRe) is developed in this study and released as an R package (https://CRAN.R-project.org/package=TIGERr). TIGER integrates the random forest algorithm into an adaptable ensemble learning architecture. Evaluation results show that TIGER outperforms four popular methods with respect to robustness and reliability on three human cohort datasets constructed with targeted or untargeted metabolomics data. Additionally, a case study aiming to identify age-associated metabolites is performed to illustrate how TIGER can be used for cross-kit adjustment in a longitudinal analysis with experimental data of three time-points generated by different analytical kits. A dynamic website is developed to help evaluate the performance of TIGER and examine the patterns revealed in our longitudinal analysis (https://han-siyu.github.io/TIGER_web/). Overall, TIGER is expected to be a powerful tool for metabolomics data analysis.


Asunto(s)
Algoritmos , Metabolómica , Humanos , Aprendizaje Automático , Metabolómica/métodos , Reproducibilidad de los Resultados , Proyectos de Investigación
9.
PLoS Pathog ; 18(9): e1010819, 2022 09.
Artículo en Inglés | MEDLINE | ID: mdl-36121875

RESUMEN

BACKGROUND: Acute respiratory distress syndrome (ARDS), a life-threatening condition characterized by hypoxemia and poor lung compliance, is associated with high mortality. ARDS induced by COVID-19 has similar clinical presentations and pathological manifestations as non-COVID-19 ARDS. However, COVID-19 ARDS is associated with a more protracted inflammatory respiratory failure compared to traditional ARDS. Therefore, a comprehensive molecular comparison of ARDS of different etiologies groups may pave the way for more specific clinical interventions. METHODS AND FINDINGS: In this study, we compared COVID-19 ARDS (n = 43) and bacterial sepsis-induced (non-COVID-19) ARDS (n = 24) using multi-omic plasma profiles covering 663 metabolites, 1,051 lipids, and 266 proteins. To address both between- and within- ARDS group variabilities we followed two approaches. First, we identified 706 molecules differently abundant between the two ARDS etiologies, revealing more than 40 biological processes differently regulated between the two groups. From these processes, we assembled a cascade of therapeutically relevant pathways downstream of sphingosine metabolism. The analysis suggests a possible overactivation of arginine metabolism involved in long-term sequelae of ARDS and highlights the potential of JAK inhibitors to improve outcomes in bacterial sepsis-induced ARDS. The second part of our study involved the comparison of the two ARDS groups with respect to clinical manifestations. Using a data-driven multi-omic network, we identified signatures of acute kidney injury (AKI) and thrombocytosis within each ARDS group. The AKI-associated network implicated mitochondrial dysregulation which might lead to post-ARDS renal-sequalae. The thrombocytosis-associated network hinted at a synergy between prothrombotic processes, namely IL-17, MAPK, TNF signaling pathways, and cell adhesion molecules. Thus, we speculate that combination therapy targeting two or more of these processes may ameliorate thrombocytosis-mediated hypercoagulation. CONCLUSION: We present a first comprehensive molecular characterization of differences between two ARDS etiologies-COVID-19 and bacterial sepsis. Further investigation into the identified pathways will lead to a better understanding of the pathophysiological processes, potentially enabling novel therapeutic interventions.


Asunto(s)
Lesión Renal Aguda , COVID-19 , Inhibidores de las Cinasas Janus , Síndrome de Dificultad Respiratoria , Sepsis , Trombocitosis , Arginina , COVID-19/complicaciones , Humanos , Interleucina-17 , Lípidos , Síndrome de Dificultad Respiratoria/etiología , Sepsis/complicaciones , Esfingosina
10.
Cardiovasc Diabetol ; 23(1): 181, 2024 May 29.
Artículo en Inglés | MEDLINE | ID: mdl-38811951

RESUMEN

BACKGROUND AND AIMS: Atherosclerosis is the main cause of stroke and coronary heart disease (CHD), both leading mortality causes worldwide. Proteomics, as a high-throughput method, could provide helpful insights into the pathological mechanisms underlying atherosclerosis. In this study, we characterized the associations of plasma protein levels with CHD and with carotid intima-media thickness (CIMT), as a surrogate measure of atherosclerosis. METHODS: The discovery phase included 1000 participants from the KORA F4 study, whose plasma protein levels were quantified using the aptamer-based SOMAscan proteomics platform. We evaluated the associations of plasma protein levels with CHD using logistic regression, and with CIMT using linear regression. For both outcomes we applied two models: an age-sex adjusted model, and a model additionally adjusted for body mass index, smoking status, physical activity, diabetes status, hypertension status, low density lipoprotein, high density lipoprotein, and triglyceride levels (fully-adjusted model). The replication phase included a matched case-control sample from the independent KORA F3 study, using ELISA-based measurements of galectin-4. Pathway analysis was performed with nominally associated proteins (p-value < 0.05) from the fully-adjusted model. RESULTS: In the KORA F4 sample, after Bonferroni correction, we found CHD to be associated with five proteins using the age-sex adjusted model: galectin-4 (LGALS4), renin (REN), cathepsin H (CTSH), and coagulation factors X and Xa (F10). The fully-adjusted model yielded only the positive association of galectin-4 (OR = 1.58, 95% CI = 1.30-1.93), which was successfully replicated in the KORA F3 sample (OR = 1.40, 95% CI = 1.09-1.88). For CIMT, we found four proteins to be associated using the age-sex adjusted model namely: cytoplasmic protein NCK1 (NCK1), insulin-like growth factor-binding protein 2 (IGFBP2), growth hormone receptor (GHR), and GDNF family receptor alpha-1 (GFRA1). After assessing the fully-adjusted model, only NCK1 remained significant (ß = 0.017, p-value = 1.39e-06). Upstream regulators of galectin-4 and NCK1 identified from pathway analysis were predicted to be involved in inflammation pathways. CONCLUSIONS: Our proteome-wide association study identified galectin-4 to be associated with CHD and NCK1 to be associated with CIMT. Inflammatory pathways underlying the identified associations highlight the importance of inflammation in the development and progression of CHD.


Asunto(s)
Biomarcadores , Proteínas Sanguíneas , Grosor Intima-Media Carotídeo , Enfermedad Coronaria , Valor Predictivo de las Pruebas , Proteómica , Humanos , Masculino , Femenino , Persona de Mediana Edad , Anciano , Biomarcadores/sangre , Proteínas Sanguíneas/análisis , Estudios de Casos y Controles , Enfermedad Coronaria/sangre , Enfermedad Coronaria/diagnóstico , Enfermedad Coronaria/epidemiología , Enfermedad Coronaria/diagnóstico por imagen , Enfermedades de las Arterias Carótidas/sangre , Enfermedades de las Arterias Carótidas/diagnóstico por imagen , Enfermedades de las Arterias Carótidas/epidemiología , Proteoma , Alemania/epidemiología , Factores de Riesgo , Medición de Riesgo , Enfermedad de la Arteria Coronaria/sangre , Enfermedad de la Arteria Coronaria/diagnóstico por imagen , Adulto
11.
Cardiovasc Diabetol ; 23(1): 199, 2024 Jun 12.
Artículo en Inglés | MEDLINE | ID: mdl-38867314

RESUMEN

BACKGROUND: Metformin and sodium-glucose-cotransporter-2 inhibitors (SGLT2i) are cornerstone therapies for managing hyperglycemia in diabetes. However, their detailed impacts on metabolic processes, particularly within the citric acid (TCA) cycle and its anaplerotic pathways, remain unclear. This study investigates the tissue-specific metabolic effects of metformin, both as a monotherapy and in combination with SGLT2i, on the TCA cycle and associated anaplerotic reactions in both mice and humans. METHODS: Metformin-specific metabolic changes were initially identified by comparing metformin-treated diabetic mice (MET) with vehicle-treated db/db mice (VG). These findings were then assessed in two human cohorts (KORA and QBB) and a longitudinal KORA study of metformin-naïve patients with Type 2 Diabetes (T2D). We also compared MET with db/db mice on combination therapy (SGLT2i + MET). Metabolic profiling analyzed 716 metabolites from plasma, liver, and kidney tissues post-treatment, using linear regression and Bonferroni correction for statistical analysis, complemented by pathway analyses to explore the pathophysiological implications. RESULTS: Metformin monotherapy significantly upregulated TCA cycle intermediates such as malate, fumarate, and α-ketoglutarate (α-KG) in plasma, and anaplerotic substrates including hepatic glutamate and renal 2-hydroxyglutarate (2-HG) in diabetic mice. Downregulated hepatic taurine was also observed. The addition of SGLT2i, however, reversed these effects, such as downregulating circulating malate and α-KG, and hepatic glutamate and renal 2-HG, but upregulated hepatic taurine. In human T2D patients on metformin therapy, significant systemic alterations in metabolites were observed, including increased malate but decreased citrulline. The bidirectional modulation of TCA cycle intermediates in mice influenced key anaplerotic pathways linked to glutaminolysis, tumorigenesis, immune regulation, and antioxidative responses. CONCLUSION: This study elucidates the specific metabolic consequences of metformin and SGLT2i on the TCA cycle, reflecting potential impacts on the immune system. Metformin shows promise for its anti-inflammatory properties, while the addition of SGLT2i may provide liver protection in conditions like metabolic dysfunction-associated steatotic liver disease (MASLD). These observations underscore the importance of personalized treatment strategies.


Asunto(s)
Ciclo del Ácido Cítrico , Diabetes Mellitus Tipo 2 , Hipoglucemiantes , Riñón , Hígado , Metformina , Inhibidores del Cotransportador de Sodio-Glucosa 2 , Metformina/farmacología , Animales , Ciclo del Ácido Cítrico/efectos de los fármacos , Inhibidores del Cotransportador de Sodio-Glucosa 2/farmacología , Inhibidores del Cotransportador de Sodio-Glucosa 2/uso terapéutico , Humanos , Hipoglucemiantes/farmacología , Diabetes Mellitus Tipo 2/tratamiento farmacológico , Diabetes Mellitus Tipo 2/metabolismo , Diabetes Mellitus Tipo 2/sangre , Masculino , Hígado/metabolismo , Hígado/efectos de los fármacos , Riñón/metabolismo , Riñón/efectos de los fármacos , Femenino , Quimioterapia Combinada , Ratones Endogámicos C57BL , Metabolómica , Biomarcadores/sangre , Persona de Mediana Edad , Glucemia/metabolismo , Glucemia/efectos de los fármacos , Estudios Longitudinales , Ratones , Anciano , Resultado del Tratamiento
12.
Mol Psychiatry ; 28(9): 3874-3887, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37495887

RESUMEN

Metabolome reflects the interplay of genome and exposome at molecular level and thus can provide deep insights into the pathogenesis of a complex disease like major depression. To identify metabolites associated with depression we performed a metabolome-wide association analysis in 13,596 participants from five European population-based cohorts characterized for depression, and circulating metabolites using ultra high-performance liquid chromatography/tandem accurate mass spectrometry (UHPLC/MS/MS) based Metabolon platform. We tested 806 metabolites covering a wide range of biochemical processes including those involved in lipid, amino-acid, energy, carbohydrate, xenobiotic and vitamin metabolism for their association with depression. In a conservative model adjusting for life style factors and cardiovascular and antidepressant medication use we identified 8 metabolites, including 6 novel, significantly associated with depression. In individuals with depression, increased levels of retinol (vitamin A), 1-palmitoyl-2-palmitoleoyl-GPC (16:0/16:1) (lecithin) and mannitol/sorbitol and lower levels of hippurate, 4-hydroxycoumarin, 2-aminooctanoate (alpha-aminocaprylic acid), 10-undecenoate (11:1n1) (undecylenic acid), 1-linoleoyl-GPA (18:2) (lysophosphatidic acid; LPA 18:2) are observed. These metabolites are either directly food derived or are products of host and gut microbial metabolism of food-derived products. Our Mendelian randomization analysis suggests that low hippurate levels may be in the causal pathway leading towards depression. Our findings highlight putative actionable targets for depression prevention that are easily modifiable through diet interventions.


Asunto(s)
Depresión , Espectrometría de Masas en Tándem , Humanos , Depresión/metabolismo , Dieta , Metaboloma/genética , Vitamina A/metabolismo , Hipuratos , Metabolómica/métodos
13.
Nature ; 558(7708): 73-79, 2018 06.
Artículo en Inglés | MEDLINE | ID: mdl-29875488

RESUMEN

Although plasma proteins have important roles in biological processes and are the direct targets of many drugs, the genetic factors that control inter-individual variation in plasma protein levels are not well understood. Here we characterize the genetic architecture of the human plasma proteome in healthy blood donors from the INTERVAL study. We identify 1,927 genetic associations with 1,478 proteins, a fourfold increase on existing knowledge, including trans associations for 1,104 proteins. To understand the consequences of perturbations in plasma protein levels, we apply an integrated approach that links genetic variation with biological pathway, disease, and drug databases. We show that protein quantitative trait loci overlap with gene expression quantitative trait loci, as well as with disease-associated loci, and find evidence that protein biomarkers have causal roles in disease using Mendelian randomization analysis. By linking genetic factors to diseases via specific proteins, our analyses highlight potential therapeutic targets, opportunities for matching existing drugs with new disease indications, and potential safety concerns for drugs under development.


Asunto(s)
Proteínas Sanguíneas/genética , Genómica , Proteoma/genética , Femenino , Factor de Crecimiento de Hepatocito/genética , Humanos , Enfermedades Inflamatorias del Intestino/genética , Masculino , Mutación Missense/genética , Mieloblastina/genética , Factor 1 de Unión al Dominio 1 de Regulación Positiva/genética , Proteínas Proto-Oncogénicas/genética , Sitios de Carácter Cuantitativo/genética , Vasculitis/genética , alfa 1-Antitripsina/genética
14.
Proc Natl Acad Sci U S A ; 118(51)2021 12 21.
Artículo en Inglés | MEDLINE | ID: mdl-34916285

RESUMEN

Spina bifida (SB) is a debilitating birth defect caused by multiple gene and environment interactions. Though SB shows non-Mendelian inheritance, genetic factors contribute to an estimated 70% of cases. Nevertheless, identifying human mutations conferring SB risk is challenging due to its relative rarity, genetic heterogeneity, incomplete penetrance, and environmental influences that hamper genome-wide association studies approaches to untargeted discovery. Thus, SB genetic studies may suffer from population substructure and/or selection bias introduced by typical candidate gene searches. We report a population based, ancestry-matched whole-genome sequence analysis of SB genetic predisposition using a systems biology strategy to interrogate 298 case-control subject genomes (149 pairs). Genes that were enriched in likely gene disrupting (LGD), rare protein-coding variants were subjected to machine learning analysis to identify genes in which LGD variants occur with a different frequency in cases versus controls and so discriminate between these groups. Those genes with high discriminatory potential for SB significantly enriched pathways pertaining to carbon metabolism, inflammation, innate immunity, cytoskeletal regulation, and essential transcriptional regulation consistent with their having impact on the pathogenesis of human SB. Additionally, an interrogation of conserved noncoding sequences identified robust variant enrichment in regulatory regions of several transcription factors critical to embryonic development. This genome-wide perspective offers an effective approach to the interrogation of coding and noncoding sequence variant contributions to rare complex genetic disorders.


Asunto(s)
Genoma Humano , Disrafia Espinal/genética , Estudios de Casos y Controles , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Biología de Sistemas , Factores de Transcripción/genética
15.
Mol Med ; 29(1): 13, 2023 01 26.
Artículo en Inglés | MEDLINE | ID: mdl-36703108

RESUMEN

BACKGROUND: Acute respiratory distress syndrome (ARDS), a life-threatening condition during critical illness, is a common complication of COVID-19. It can originate from various disease etiologies, including severe infections, major injury, or inhalation of irritants. ARDS poses substantial clinical challenges due to a lack of etiology-specific therapies, multisystem involvement, and heterogeneous, poor patient outcomes. A molecular comparison of ARDS groups holds the potential to reveal common and distinct mechanisms underlying ARDS pathogenesis. METHODS: We performed a comparative analysis of urine-based metabolomics and proteomics profiles from COVID-19 ARDS patients (n = 42) and bacterial sepsis-induced ARDS patients (n = 17). To this end, we used two different approaches, first we compared the molecular omics profiles between ARDS groups, and second, we correlated clinical manifestations within each group with the omics profiles. RESULTS: The comparison of the two ARDS etiologies identified 150 metabolites and 70 proteins that were differentially abundant between the two groups. Based on these findings, we interrogated the interplay of cell adhesion/extracellular matrix molecules, inflammation, and mitochondrial dysfunction in ARDS pathogenesis through a multi-omic network approach. Moreover, we identified a proteomic signature associated with mortality in COVID-19 ARDS patients, which contained several proteins that had previously been implicated in clinical manifestations frequently linked with ARDS pathogenesis. CONCLUSION: In summary, our results provide evidence for significant molecular differences in ARDS patients from different etiologies and a potential synergy of extracellular matrix molecules, inflammation, and mitochondrial dysfunction in ARDS pathogenesis. The proteomic mortality signature should be further investigated in future studies to develop prediction models for COVID-19 patient outcomes.


Asunto(s)
COVID-19 , Síndrome de Dificultad Respiratoria , Sepsis , Humanos , COVID-19/complicaciones , Proteómica , Multiómica , Síndrome de Dificultad Respiratoria/etiología , Sepsis/complicaciones , Inflamación
16.
Brief Bioinform ; 22(4)2021 07 20.
Artículo en Inglés | MEDLINE | ID: mdl-33063116

RESUMEN

Least absolute shrinkage and selection operator (LASSO) regression is often applied to select the most promising set of single nucleotide polymorphisms (SNPs) associated with a molecular phenotype of interest. While the penalization parameter λ restricts the number of selected SNPs and the potential model overfitting, the least-squares loss function of standard LASSO regression translates into a strong dependence of statistical results on a small number of individuals with phenotypes or genotypes divergent from the majority of the study population-typically comprised of outliers and high-leverage observations. Robust methods have been developed to constrain the influence of divergent observations and generate statistical results that apply to the bulk of study data, but they have rarely been applied to genetic association studies. In this article, we review, for newcomers to the field of robust statistics, a novel version of standard LASSO that utilizes the Huber loss function. We conduct comprehensive simulations and analyze real protein, metabolite, mRNA expression and genotype data to compare the stability of penalization, the cross-iteration concordance of the model, the false-positive and true-positive rates and the prediction accuracy of standard and robust Huber-LASSO. Although the two methods showed controlled false-positive rates ≤2.1% and similar true-positive rates, robust Huber-LASSO outperformed standard LASSO in the accuracy of predicted protein, metabolite and gene expression levels using individual SNP data. The conducted simulations and real-data analyses show that robust Huber-LASSO represents a valuable alternative to standard LASSO in genetic studies of molecular phenotypes.


Asunto(s)
Simulación por Computador , Bases de Datos de Ácidos Nucleicos , Regulación de la Expresión Génica , Estudios de Asociación Genética , Genotipo , Polimorfismo de Nucleótido Simple , Humanos
17.
Bioinformatics ; 38(2): 573-576, 2022 01 03.
Artículo en Inglés | MEDLINE | ID: mdl-34529048

RESUMEN

SUMMARY: The 'Subgroup Identification' (SGI) toolbox provides an algorithm to automatically detect clinical subgroups of samples in large-scale omics datasets. It is based on hierarchical clustering trees in combination with a specifically designed association testing and visualization framework that can process an arbitrary number of clinical parameters and outcomes in a systematic fashion. A multi-block extension allows for the simultaneous use of multiple omics datasets on the same samples. In this article, we first describe the functionality of the toolbox and then demonstrate its capabilities through application examples on a type 2 diabetes metabolomics study as well as two copy number variation datasets from The Cancer Genome Atlas. AVAILABILITY AND IMPLEMENTATION: SGI is an open-source package implemented in R. Package source codes and hands-on tutorials are available at https://github.com/krumsieklab/sgi. The QMdiab metabolomics data is included in the package and can be downloaded from https://doi.org/10.6084/m9.figshare.5904022. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Variaciones en el Número de Copia de ADN , Diabetes Mellitus Tipo 2 , Humanos , Programas Informáticos , Algoritmos , Metabolómica
18.
Bioinformatics ; 38(4): 1168-1170, 2022 01 27.
Artículo en Inglés | MEDLINE | ID: mdl-34694386

RESUMEN

This article presents maplet, an open-source R package for the creation of highly customizable, fully reproducible statistical pipelines for metabolomics data analysis. It builds on the SummarizedExperiment data structure to create a centralized pipeline framework for storing data, analysis steps, results and visualizations. maplet's key design feature is its modularity, which offers several advantages, such as ensuring code quality through the maintenance of individual functions and promoting collaborative development by removing technical barriers to code contribution. With over 90 functions, the package includes a wide range of functionalities, covering many widely used statistical approaches and data visualization techniques. AVAILABILITY AND IMPLEMENTATION: The maplet package is implemented in R and freely available at https://github.com/krumsieklab/maplet.


Asunto(s)
Metabolómica , Programas Informáticos , Análisis de Datos , Visualización de Datos
19.
Clin Proteomics ; 20(1): 31, 2023 Aug 07.
Artículo en Inglés | MEDLINE | ID: mdl-37550624

RESUMEN

BACKGROUND: Human plasma contains a wide variety of circulating proteins. These proteins can be important clinical biomarkers in disease and also possible drug targets. Large scale genomics studies of circulating proteins can identify genetic variants that lead to relative protein abundance. METHODS: We conducted a meta-analysis on genome-wide association studies of autosomal chromosomes in 22,997 individuals of primarily European ancestry across 12 cohorts to identify protein quantitative trait loci (pQTL) for 92 cardiometabolic associated plasma proteins. RESULTS: We identified 503 (337 cis and 166 trans) conditionally independent pQTLs, including several novel variants not reported in the literature. We conducted a sex-stratified analysis and found that 118 (23.5%) of pQTLs demonstrated heterogeneity between sexes. The direction of effect was preserved but there were differences in effect size and significance. Additionally, we annotate trans-pQTLs with nearest genes and report plausible biological relationships. Using Mendelian randomization, we identified causal associations for 18 proteins across 19 phenotypes, of which 10 have additional genetic colocalization evidence. We highlight proteins associated with a constellation of cardiometabolic traits including angiopoietin-related protein 7 (ANGPTL7) and Semaphorin 3F (SEMA3F). CONCLUSION: Through large-scale analysis of protein quantitative trait loci, we provide a comprehensive overview of common variants associated with plasma proteins. We highlight possible biological relationships which may serve as a basis for further investigation into possible causal roles in cardiometabolic diseases.

20.
Cardiovasc Diabetol ; 22(1): 141, 2023 06 16.
Artículo en Inglés | MEDLINE | ID: mdl-37328862

RESUMEN

BACKGROUND: Metabolic Syndrome (MetS) is characterized by risk factors such as abdominal obesity, hypertriglyceridemia, low high-density lipoprotein cholesterol (HDL-C), hypertension, and hyperglycemia, which contribute to the development of cardiovascular disease and type 2 diabetes. Here, we aim to identify candidate metabolite biomarkers of MetS and its associated risk factors to better understand the complex interplay of underlying signaling pathways. METHODS: We quantified serum samples of the KORA F4 study participants (N = 2815) and analyzed 121 metabolites. Multiple regression models adjusted for clinical and lifestyle covariates were used to identify metabolites that were Bonferroni significantly associated with MetS. These findings were replicated in the SHIP-TREND-0 study (N = 988) and further analyzed for the association of replicated metabolites with the five components of MetS. Database-driven networks of the identified metabolites and their interacting enzymes were also constructed. RESULTS: We identified and replicated 56 MetS-specific metabolites: 13 were positively associated (e.g., Val, Leu/Ile, Phe, and Tyr), and 43 were negatively associated (e.g., Gly, Ser, and 40 lipids). Moreover, the majority (89%) and minority (23%) of MetS-specific metabolites were associated with low HDL-C and hypertension, respectively. One lipid, lysoPC a C18:2, was negatively associated with MetS and all of its five components, indicating that individuals with MetS and each of the risk factors had lower concentrations of lysoPC a C18:2 compared to corresponding controls. Our metabolic networks elucidated these observations by revealing impaired catabolism of branched-chain and aromatic amino acids, as well as accelerated Gly catabolism. CONCLUSION: Our identified candidate metabolite biomarkers are associated with the pathophysiology of MetS and its risk factors. They could facilitate the development of therapeutic strategies to prevent type 2 diabetes and cardiovascular disease. For instance, elevated levels of lysoPC a C18:2 may protect MetS and its five risk components. More in-depth studies are necessary to determine the mechanism of key metabolites in the MetS pathophysiology.


Asunto(s)
Enfermedades Cardiovasculares , Diabetes Mellitus Tipo 2 , Hipertensión , Síndrome Metabólico , Humanos , Síndrome Metabólico/diagnóstico , Síndrome Metabólico/epidemiología , Metabolómica , Factores de Riesgo , Biomarcadores , Hipertensión/diagnóstico , Hipertensión/epidemiología
SELECCIÓN DE REFERENCIAS
Detalles de la búsqueda