Affiliation:
1. University of Maryland, College Park, MD
2. University of Minnesota, Minneapolis, MN
Abstract
Mapping of spatial hotspots, i.e., regions with significantly higher rates of generating cases of certain events (e.g., disease or crime cases), is an important task in diverse societal domains, including public health, public safety, transportation, agriculture, environmental science, and so on. Clustering techniques required by these domains differ from traditional clustering methods due to the high economic and social costs of spurious results (e.g., false alarms of crime clusters). As a result, statistical rigor is needed explicitly to control the rate of spurious detections. To address this challenge, techniques for statistically-robust clustering (e.g., scan statistics) have been extensively studied by the data mining and statistics communities. In this survey, we present an up-to-date and detailed review of the models and algorithms developed by this field. We first present a general taxonomy for statistically-robust clustering, covering key steps of data and statistical modeling, region enumeration and maximization, and significance testing. We further discuss different paradigms and methods within each of the key steps. Finally, we highlight research gaps and potential future directions, which may serve as a stepping stone in generating new ideas and thoughts in this growing field and beyond.
Funder
NSF
USDOD
ARPA-E
USDA
NIH
Google AI for Social Good
University of Maryland
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science,Theoretical Computer Science
Reference228 articles.
1. https://surveillance.cancer.gov// 2017 National Cancer Institute Surveillance Research Program
2. https://www.satscan.org/datasets/nebenchmark/index.html 2021 Northeastern US benchmark
3. https://www.safegraph.com/ 2021 SafeGraph
4. https://www.satscan.org/ 2021 SaTScan
5. https://www.satscan.org/datasets.html 2021 SaTScan datasets
Cited by
18 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献