Abstract
AbstractSpherical embedding is an important tool in several fields of data analysis, including environmental data, spatial statistics, text mining, gene expression analysis, medical research and, in general, areas in which the geodesic distance is a relevant factor. Many data acquisition technologies are related to massive data acquisition, and these high-dimensional vectors are often normalised and transformed into spherical data. In this representation of data on spherical surfaces, multidimensional scaling plays an important role. Traditionally, the methods of clustering and representation have been combined, since the precision of the representation tends to decrease when a large number of objects are involved, which makes interpretation difficult. In this paper, we present a model that partitions objects into classes while simultaneously representing the cluster centres on a spherical surface based on geodesic distances. The model combines a partition algorithm based on the approximation of dissimilarities to geodesic distances with a representation procedure for geodesic distances. In this process, the dissimilarities are transformed in order to optimise the radius of the sphere. The efficiency of the procedure described is analysed by means of an extensive Monte Carlo experiment, and its usefulness is illustrated for real data sets. Supplementary material to this paper is provided online.
Publisher
Springer Science and Business Media LLC
Reference43 articles.
1. Alegría A, Porcu E, Furrer R, Mateu Jorge (2018) Covariance functions for multivariate Gaussian fields evolving temporally over planet earth. Stoch Environ Res Risk Assess 33:1593–1608. https://doi.org/10.1007/s00477-019-01707-w. (2019)
2. Banerjee A, Dhillon IS, Ghosh J, Sra S (2005) Clustering on the unit hypersphere using von Mises–Fisher distributions. J Mach Learn Res 6:1345–1382
3. Bentler PM, Weeks DG (1978) Restricted multidimensional scaling models. J Math Psychol 17(2):138–151. https://doi.org/10.1016/0022-2496(78)90027-5
4. Bock HH (1986) Multidimensional scaling in the framework of cluster analysis. In: Hermes HJ, Optiz O, Degens PO (eds) Studien zur Klassifikation: [Classification and its environment], vol 17. INDEKS-Verlag, Frankfurt, pp 247–258
5. Bock HH (1987) On the interface between cluster analysis, principal component analysis, and multidimensional scaling. In: Bozdogan H, Gupta AK (eds) Multivariate statistical modeling and data analysis. Reidel, New York, pp 17–34
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献