Your browser doesn't support javascript.
loading
Gene set proximity analysis: expanding gene set enrichment analysis through learned geometric embeddings, with drug-repurposing applications in COVID-19.
Cousins, Henry; Hall, Taryn; Guo, Yinglong; Tso, Luke; Tzeng, Kathy T H; Cong, Le; Altman, Russ B.
Afiliação
  • Cousins H; Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA 94305, USA.
  • Hall T; Optum Labs at UnitedHealth Group, Minneapolis, MN 55343, USA.
  • Guo Y; Optum Labs at UnitedHealth Group, Minneapolis, MN 55343, USA.
  • Tso L; Optum Labs at UnitedHealth Group, Minneapolis, MN 55343, USA.
  • Tzeng KTH; Optum Labs at UnitedHealth Group, Minneapolis, MN 55343, USA.
  • Cong L; Department of Genetics, Stanford University School of Medicine, Stanford, CA 94305, USA.
  • Altman RB; Department of Pathology, Stanford University School of Medicine, Stanford, CA 94305, USA.
Bioinformatics ; 39(1)2023 01 01.
Article em En | MEDLINE | ID: mdl-36394254
ABSTRACT
MOTIVATION Gene set analysis methods rely on knowledge-based representations of genetic interactions in the form of both gene set collections and protein-protein interaction (PPI) networks. However, explicit representations of genetic interactions often fail to capture complex interdependencies among genes, limiting the analytic power of such methods.

RESULTS:

We propose an extension of gene set enrichment analysis to a latent embedding space reflecting PPI network topology, called gene set proximity analysis (GSPA). Compared with existing methods, GSPA provides improved ability to identify disease-associated pathways in disease-matched gene expression datasets, while improving reproducibility of enrichment statistics for similar gene sets. GSPA is statistically straightforward, reducing to a version of traditional gene set enrichment analysis through a single user-defined parameter. We apply our method to identify novel drug associations with SARS-CoV-2 viral entry. Finally, we validate our drug association predictions through retrospective clinical analysis of claims data from 8 million patients, supporting a role for gabapentin as a risk factor and metformin as a protective factor for severe COVID-19. AVAILABILITY AND IMPLEMENTATION GSPA is available for download as a command-line Python package at https//github.com/henrycousins/gspa. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: COVID-19 Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: COVID-19 Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Estados Unidos