Approximate calculation of Tukey's depth and median with high-dimensional data
-
Published:2018
Issue:4
Volume:28
Page:475-499
-
ISSN:0354-0243
-
Container-title:Yugoslav Journal of Operations Research
-
language:en
-
Short-container-title:YUGOSLAV J OPERATION
Author:
Bogicevic Milica1,
Merkle Milan1
Affiliation:
1. School of Electrical Engineering, Belgrade
Abstract
We present a new fast approximate algorithm for Tukey (halfspace) depth level sets and its implementation-ABCDepth. Given a d-dimensional data set for any d ? 1, the algorithm is based on a representation of level sets as intersections of balls in Rd. Our approach does not need calculations of projections of sample points to directions. This novel idea enables calculations of approximate level sets in very high dimensions with complexity that is linear in d, which provides a great advantage over all other approximate algorithms. Using different versions of this algorithm, we demonstrate approximate calculations of the deepest set of points ("Tukey median") and Tukey's depth of a sample point or out-of-sample point, all with a linear in d complexity. An additional theoretical advantage of this approach is that the data points are not assumed to be in "general position". Examples with real and synthetic data show that the executing time of the algorithm in all mentioned versions in high dimensions is much smaller than the time of other implemented algorithms. Also, our algorithms can be used with thousands of multidimensional observations.
Funder
Ministry of Education, Science and Technological Development of the Republic of Serbia
Publisher
National Library of Serbia
Subject
Management Science and Operations Research
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Tukey’s Depth for Object Data;Journal of the American Statistical Association;2022-02-03
2. Subsampling Halfspace Depth;SSRN Electronic Journal;2022