Author:
Omanović Amra,Kazan Hilal,Oblak Polona,Curk Tomaž
Abstract
Abstract
Background
Matrix factorization methods are linear models, with limited capability to model complex relations. In our work, we use tropical semiring to introduce non-linearity into matrix factorization models. We propose a method called Sparse Tropical Matrix Factorization () for the estimation of missing (unknown) values in sparse data.
Results
We evaluate the efficiency of the method on both synthetic data and biological data in the form of gene expression measurements downloaded from The Cancer Genome Atlas (TCGA) database. Tests on unique synthetic data showed that approximation achieves a higher correlation than non-negative matrix factorization (), which is unable to recover patterns effectively. On real data, outperforms on six out of nine gene expression datasets. While assumes normal distribution and tends toward the mean value, can better fit to extreme values and distributions.
Conclusion
is the first work that uses tropical semiring on sparse data. We show that in certain cases semirings are useful because they consider the structure, which is different and simpler to understand than it is with standard linear algebra.
Publisher
Springer Science and Business Media LLC
Subject
Applied Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Structural Biology
Reference45 articles.
1. Koren Y, Bell R, Volinsky C. Matrix factorization techniques for recommender systems. Computer. 2009;42(8):30–7.
2. Xu W, Liu X, Gong Y. Document clustering based on non-negative matrix factorization. In: Proceedings of the 26th annual international ACM SIGIR conference on research and development in information retrieval, 2003. p. 267–273
3. Brunet J-P, Tamayo P, Golub TR, Mesirov JP. Metagenes and molecular pattern discovery using matrix factorization. Proc Nat Acad Sci. 2004;101(12):4164–9.
4. Karaev S, Hook J, Miettinen P. Latitude: a model for mixed linear-tropical matrix factorization. In: Proceedings of the 2018 SIAM international conference on data mining, 2018. p. 360–368. SIAM.
5. Lee DD, Seung HS. Learning the parts of objects by non-negative matrix factorization. Nature. 1999;401(6755):788.
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献