Censored Least Squares for Imputing Missing Values in PARAFAC Tensor Factorization-Reference-Cited by-同舟云学术

Censored Least Squares for Imputing Missing Values in PARAFAC Tensor Factorization

Published:2024-07-10 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Hung Ethan C.^ORCID,Hodzic Enio,Tan Zhixin Cyrillus^ORCID,Meyer Aaron S.^ORCID

Abstract

AbstractTensor factorization is a dimensionality reduction method applied to multidimensional arrays. These methods are useful for identifying patterns within a variety of biomedical datasets due to their ability to preserve the organizational structure of experiments and therefore aid in generating meaningful insights. However, missing data in the datasets being analyzed can impose challenges. Tensor factorization can be performed with some level of missing data and reconstruct a complete tensor. However, while tensor methods may impute these missing values, the choice of fitting algorithm may influence the fidelity of these imputations. Previous approaches, based on alternating least squares with prefilled values or direct optimization, suffer from introduced bias or slow computational performance. In this study, we propose that censored least squares can better handle missing values with data structured in tensor form. We ran censored least squares on four different biological datasets and compared its performance against alternating least squares with prefilled values and direct optimization. We used the error of imputation and the ability to infer masked values to benchmark their missing data performance. Censored least squares appeared best suited for the analysis of high-dimensional biological data by accuracy and convergence metrics across several studies.

Publisher

Cold Spring Harbor Laboratory

Reference37 articles.

1. Singular value decomposition for genome-wide expression data processing and modeling

2. Singular Value Decomposition and Principal Component Analysis

3. Tan ZC , Meyer AS . The structure is the message: preserving experimental context through tensor decomposition. February 2024. http://arxiv.org/abs/2402.16638.

4. Tensor Decompositions and Applications

5. A survey of some tensor analysis techniques for biological systems