RESUMO
High-throughput proteomics platforms measuring thousands of proteins in plasma combined with genomic and phenotypic information have the power to bridge the gap between the genome and diseases. Here we performed association studies of Olink Explore 3072 data generated by the UK Biobank Pharma Proteomics Project1 on plasma samples from more than 50,000 UK Biobank participants with phenotypic and genotypic data, stratifying on British or Irish, African and South Asian ancestries. We compared the results with those of a SomaScan v4 study on plasma from 36,000 Icelandic people2, for 1,514 of whom Olink data were also available. We found modest correlation between the two platforms. Although cis protein quantitative trait loci were detected for a similar absolute number of assays on the two platforms (2,101 on Olink versus 2,120 on SomaScan), the proportion of assays with such supporting evidence for assay performance was higher on the Olink platform (72% versus 43%). A considerable number of proteins had genomic associations that differed between the platforms. We provide examples where differences between platforms may influence conclusions drawn from the integration of protein levels with the study of diseases. We demonstrate how leveraging the diverse ancestries of participants in the UK Biobank helps to detect novel associations and refine genomic location. Our results show the value of the information provided by the two most commonly used high-throughput proteomics platforms and demonstrate the differences between them that at times provides useful complementarity.
Assuntos
Proteínas Sanguíneas , Suscetibilidade a Doenças , Genômica , Genótipo , Fenótipo , Proteômica , Humanos , África/etnologia , Ásia Meridional/etnologia , Bancos de Espécimes Biológicos , Proteínas Sanguíneas/análise , Proteínas Sanguíneas/genética , Conjuntos de Dados como Assunto , Genoma Humano/genética , Islândia/etnologia , Irlanda/etnologia , Plasma/química , Proteoma/análise , Proteoma/genética , Proteômica/métodos , Locos de Características Quantitativas , Reino UnidoRESUMO
Biosynthetic and chemical datasets are the two major pillars for microbial drug discovery in the omics era. Despite the advancement of analysis tools and platforms for multi-strain metabolomics and genomics, linking these information sources remains a considerable bottleneck in strain prioritisation and natural product discovery. In this study, molecular networking of the 100 metabolite extracts derived from applying the OSMAC approach to 25 Polar bacterial strains, showed growth media specificity and potential chemical novelty was suggested. Moreover, the metabolite extracts were screened for antibacterial activity and promising selective bioactivity against drug-persistent pathogens such as Klebsiella pneumoniae and Acinetobacter baumannii was observed. Genome sequencing data were combined with metabolomics experiments in the recently developed computational approach, NPLinker, which was used to link BGC and molecular features to prioritise strains for further investigation based on biosynthetic and chemical information. Herein, we putatively identified the known metabolites ectoine and chrloramphenicol which, through NPLinker, were linked to their associated BGCs. The metabologenomics approach followed in this study can potentially be applied to any large microbial datasets for accelerating the discovery of new (bioactive) specialised metabolites.
Assuntos
Actinobacteria/metabolismo , Genômica/métodos , Metabolômica/métodos , Clima Frio , Descoberta de Drogas , Genoma BacterianoRESUMO
Autoimmune thyroid disease (AITD) is a common autoimmune disease. In a GWAS meta-analysis of 110,945 cases and 1,084,290 controls, 290 sequence variants at 225 loci are associated with AITD. Of these variants, 115 are previously unreported. Multiomics analysis yields 235 candidate genes outside the MHC-region and the findings highlight the importance of genes involved in T-cell regulation. A rare 5'-UTR variant (rs781745126-T, MAF = 0.13% in Iceland) in LAG3 has the largest effect (OR = 3.42, P = 2.2 × 10-16) and generates a novel start codon for an open reading frame upstream of the canonical protein translation initiation site. rs781745126-T reduces mRNA and surface expression of the inhibitory immune checkpoint LAG-3 co-receptor on activated lymphocyte subsets and halves LAG-3 levels in plasma among heterozygotes. All three homozygous carriers of rs781745126-T have AITD, of whom one also has two other T-cell mediated diseases, that is vitiligo and type 1 diabetes. rs781745126-T associates nominally with vitiligo (OR = 5.1, P = 6.5 × 10-3) but not with type 1 diabetes. Thus, the effect of rs781745126-T is akin to drugs that inhibit LAG-3, which unleash immune responses and can have thyroid dysfunction and vitiligo as adverse events. This illustrates how a multiomics approach can reveal potential drug targets and safety concerns.
Assuntos
Antígenos CD , Códon de Iniciação , Predisposição Genética para Doença , Proteína do Gene 3 de Ativação de Linfócitos , Humanos , Códon de Iniciação/genética , Antígenos CD/genética , Antígenos CD/metabolismo , Diabetes Mellitus Tipo 1/genética , Diabetes Mellitus Tipo 1/imunologia , Feminino , Polimorfismo de Nucleotídeo Único , Vitiligo/genética , Masculino , Estudo de Associação Genômica Ampla , Tireoidite Autoimune/genética , Regiões 5' não Traduzidas/genética , Estudos de Casos e Controles , Islândia , AdultoRESUMO
Urticaria is a skin disorder characterized by outbreaks of raised pruritic wheals. In order to identify sequence variants associated with urticaria, we performed a meta-analysis of genome-wide association studies for urticaria with a total of 40,694 cases and 1,230,001 controls from Iceland, the UK, Finland, and Japan. We also performed transcriptome- and proteome-wide analyses in Iceland and the UK. We found nine sequence variants at nine loci associating with urticaria. The variants are at genes participating in type 2 immune responses and/or mast cell biology (CBLB, FCER1A, GCSAML, STAT6, TPSD1, ZFPM1), the innate immunity (C4), and NF-κB signaling. The most significant association was observed for the splice-donor variant rs56043070[A] (hg38: chr1:247556467) in GCSAML (MAF = 6.6%, OR = 1.24 (95%CI: 1.20-1.28), P-value = 3.6 × 10-44). We assessed the effects of the variants on transcripts, and levels of proteins relevant to urticaria pathophysiology. Our results emphasize the role of type 2 immune response and mast cell activation in the pathogenesis of urticaria. Our findings may point to an IgE-independent urticaria pathway that could help address unmet clinical need.
Assuntos
Estudo de Associação Genômica Ampla , Urticária , Humanos , Mastócitos , Urticária/genética , Splicing de RNA , ProteomaRESUMO
Secondary metabolites can be viewed as a chemical language, facilitating communication between microorganisms. From an ecological point of view, this metabolite exchange is in constant flux due to evolutionary and environmental pressures. From a biomedical perspective, the chemistry is unsurpassed for its antibiotic properties. Genome sequencing of microorganisms has revealed a large reservoir of Biosynthetic Gene Clusters (BGCs); however, linking these to the secondary metabolites they encode is currently a major bottleneck to chemical discovery. This linking of genes to metabolites with experimental validation will aid the elicitation of silent or cryptic (not expressed under normal laboratory conditions) BGCs. As a result, this will accelerate chemical dereplication, our understanding of gene transcription and provide a comprehensive resource for synthetic biology. This will ultimately provide an improved understanding of both the biosynthetic and chemical space. In recent years, integrating these complex metabolomic and genomic data sets has been achieved using a spectrum of manual and automated approaches. In this review, we cover examples of these approaches, while addressing current challenges and future directions in linking these data sets.