Exploring predictive states via Cantor embeddings and Wasserstein distance
-
Published:2022-12
Issue:12
Volume:32
Page:123115
-
ISSN:1054-1500
-
Container-title:Chaos: An Interdisciplinary Journal of Nonlinear Science
-
language:en
-
Short-container-title:Chaos
Author:
Loomis Samuel P.1ORCID,
Crutchfield James P.1ORCID
Affiliation:
1. Complexity Sciences Center and Department of Physics and Astronomy, University of California at Davis, One Shields Avenue, Davis, California 95616, USA
Abstract
Predictive states for stochastic processes are a nonparametric and interpretable construct with relevance across a multitude of modeling paradigms. Recent progress on the self-supervised reconstruction of predictive states from time-series data focused on the use of reproducing kernel Hilbert spaces. Here, we examine how Wasserstein distances may be used to detect predictive equivalences in symbolic data. We compute Wasserstein distances between distributions over sequences (“predictions”) using a finite-dimensional embedding of sequences based on the Cantor set for the underlying geometry. We show that exploratory data analysis using the resulting geometry via hierarchical clustering and dimension reduction provides insight into the temporal structure of processes ranging from the relatively simple (e.g., generated by finite-state hidden Markov models) to the very complex (e.g., generated by infinite-state indexed grammars).
Funder
Army Research Office
Templeton World Charity Foundation
Foundational Questions Institute
Subject
Applied Mathematics,General Physics and Astronomy,Mathematical Physics,Statistical and Nonlinear Physics
Reference36 articles.
1. D. R. Upper, “Theory and algorithms for hidden Markov models and generalized hidden Markov models,” Ph.D. thesis (University of California, Berkeley, 1997).
2. Inferring statistical complexity
3. Observable Operator Models for Discrete Stochastic Time Series
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献