Affiliation:
1. Department of Chemistry, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
2. Science and Technology in Archaeology and Culture Research Center, The Cyprus Institute, Nicosia 2121, Cyprus
Abstract
This paper assesses algorithms proposed for constructing confidence ellipses in multidimensional scaling (MDS) solutions and proposes a new approach to interpreting these confidence ellipses via hierarchical cluster analysis (HCA). It is shown that the most effective algorithm for constructing confidence ellipses involves the generation of simulated distances based on the original multivariate dataset and then the creation of MDS maps that are scaled, reflected, rotated, translated, and finally superimposed. For this algorithm, the stability measure of the average areas tends to zero with increasing sample size n following the power model, An−B, with positive B values ranging from 0.7 to 2 and high R-squared fitting values around 0.99. This algorithm was applied to create confidence ellipses in the MDS plots of squared Euclidean and Mahalanobis distances for continuous and binary data. It was found that plotting confidence ellipses in MDS plots offers a better visualization of the distance map of the populations under study compared to plotting single points. However, the confidence ellipses cannot eliminate the subjective selection of clusters in the MDS plot based simply on the proximity of the MDS points. To overcome this subjective selection, we should quantify the formation of clusters of proximal samples. Thus, in addition to the algorithm assessment, we propose a new approach that estimates all possible cluster probabilities associated with the confidence ellipses by applying HCA using distance matrices derived from these ellipses.
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference35 articles.
1. Borg, I., and Groenen, P. (2005). Modern Multidimensional Scaling: Theory and Applications, Springer. [2nd ed.].
2. Cox, T.F., and Cox, M.A.A. (2001). Multidimensional Scaling, Chapman and Hall. [2nd ed.].
3. Johnson, R.A., and Wichern, D.W. (1998). Applied Multivariate Statistical Analysis, Prentice-Hall. [4th ed.].
4. More on Multidimensional Scaling and Unfolding in R: Smacof Version 2;Mair;J. Stat. Softw.,2022
5. De Leeuw, J. (2023, October 15). Pseudo Confidence Regions for MDS. Available online: https://rpubs.com/deleeuw/292595.