Your browser doesn't support javascript.
loading
NetActivity enhances transcriptional signals by combining gene expression into robust gene set activity scores through interpretable autoencoders.
Ruiz-Arenas, Carlos; Marín-Goñi, Irene; Wang, Liewei; Ochoa, Idoia; Pérez-Jurado, Luis A; Hernaez, Mikel.
  • Ruiz-Arenas C; Computational Biology Program, CIMA University of Navarra, idiSNA, Pamplona 31008, Spain.
  • Marín-Goñi I; Department MELIS, Universitat Pompeu Fabra (UPF), Barcelona, Spain.
  • Wang L; Computational Biology Program, CIMA University of Navarra, idiSNA, Pamplona 31008, Spain.
  • Ochoa I; Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA.
  • Pérez-Jurado LA; Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA.
  • Hernaez M; Department of Electrical and Electronics Engineering, Tecnun, University of Navarra, Donostia, Spain.
Nucleic Acids Res ; 52(9): e44, 2024 May 22.
Article en En | MEDLINE | ID: mdl-38597610
ABSTRACT
Grouping gene expression into gene set activity scores (GSAS) provides better biological insights than studying individual genes. However, existing gene set projection methods cannot return representative, robust, and interpretable GSAS. We developed NetActivity, a machine learning framework that generates GSAS based on a sparsely-connected autoencoder, where each neuron in the inner layer represents a gene set. We proposed a three-tier training that yielded representative, robust, and interpretable GSAS. NetActivity model was trained with 1518 GO biological processes terms and KEGG pathways and all GTEx samples. NetActivity generates GSAS robust to the initialization parameters and representative of the original transcriptome, and assigned higher importance to more biologically relevant genes. Moreover, NetActivity returns GSAS with a more consistent definition and higher interpretability than GSVA and hipathia, state-of-the-art gene set projection methods. Finally, NetActivity enables combining bulk RNA-seq and microarray datasets in a meta-analysis of prostate cancer progression, highlighting gene sets related to cell division, key for disease progression. When applied to metastatic prostate cancer, gene sets associated with cancer progression were also altered due to drug resistance, while a classical enrichment analysis identified gene sets irrelevant to the phenotype. NetActivity is publicly available in Bioconductor and GitHub.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Neoplasias de la Próstata Límite: Humans / Male Idioma: En Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Neoplasias de la Próstata Límite: Humans / Male Idioma: En Año: 2024 Tipo del documento: Article