Your browser doesn't support javascript.
loading
Censored Least Squares for Imputing Missing Values in PARAFAC Tensor Factorization.
Hung, Ethan C; Hodzic, Enio; Tan, Zhixin Cyrillus; Meyer, Aaron S.
Affiliation
  • Hung EC; Computational and Systems Biology, University of California, Los Angeles (UCLA), USA.
  • Hodzic E; Department of Bioengineering, UCLA, USA.
  • Tan ZC; Bioinformatics Interdepartmental Program, UCLA, USA.
  • Meyer AS; Department of Bioengineering, UCLA, USA.
bioRxiv ; 2024 Jul 10.
Article de En | MEDLINE | ID: mdl-39026852
ABSTRACT
Tensor factorization is a dimensionality reduction method applied to multidimensional arrays. These methods are useful for identifying patterns within a variety of biomedical datasets due to their ability to preserve the organizational structure of experiments and therefore aid in generating meaningful insights. However, missing data in the datasets being analyzed can impose challenges. Tensor factorization can be performed with some level of missing data and reconstruct a complete tensor. However, while tensor methods may impute these missing values, the choice of fitting algorithm may influence the fidelity of these imputations. Previous approaches, based on alternating least squares with prefilled values or direct optimization, suffer from introduced bias or slow computational performance. In this study, we propose that censored least squares can better handle missing values with data structured in tensor form. We ran censored least squares on four different biological datasets and compared its performance against alternating least squares with prefilled values and direct optimization. We used the error of imputation and the ability to infer masked values to benchmark their missing data performance. Censored least squares appeared best suited for the analysis of high-dimensional biological data by accuracy and convergence metrics across several studies.
Mots clés

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Langue: En Journal: BioRxiv Année: 2024 Type de document: Article Pays d'affiliation: États-Unis d'Amérique

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Langue: En Journal: BioRxiv Année: 2024 Type de document: Article Pays d'affiliation: États-Unis d'Amérique