Abstract
AbstractThe concept of depth has proved very important for multivariate and functional data analysis, as it essentially acts as a surrogate for the notion of ranking of observations which is absent in more than one dimension. Motivated by the rapid development of technology, in particular the advent of ‘Big Data’, we extend here that concept to general metric spaces, propose a natural depth measure and explore its properties as a statistical depth function. Working in a general metric space allows the depth to be tailored to the data at hand and to the ultimate goal of the analysis, a very desirable property given the polymorphic nature of modern data sets. This flexibility is thoroughly illustrated by several real data analyses.
Funder
Ministerio de Ciencia Tecnología y Telecomunicaciones
Publisher
Springer Science and Business Media LLC
Subject
Computational Theory and Mathematics,Statistics, Probability and Uncertainty,Statistics and Probability,Theoretical Computer Science
Reference63 articles.
1. Arcones, M.A., Giné, E.: Limit theorems for U-processes. Ann. Probab. 21(3), 1494–1542 (1993)
2. Bartoszynski, R., Pearl, D.K., Lawrence, J.: A multidimensional goodness-of-fit test based on interpoint distances. J. Am. Stat. Assoc. 92, 577–586 (1997)
3. Bickel, P.J., Freedman, D.A.: Some asymptotic theory for the bootstrap. Ann. Stat. 9, 1196–1217 (1981)
4. Billard, L., Diday, E.: From the statistics of data to the statistics of knowledge: Symbolic Data Analysis. J. Am. Stat. Assoc. 98, 470–487 (2003)
5. Billard, L., Diday, E.: Symbolic Data Analysis: Conceptual Statistics and Data Mining, Wiley Series in Computational Statistics. Wiley (2007)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献