Retrieving Continuous Time Event Sequences using Neural Temporal Point Processes with Learnable Hashing-Reference-Cited by-同舟云学术

Retrieving Continuous Time Event Sequences using Neural Temporal Point Processes with Learnable Hashing

Published:2024-09-03 Issue: Volume: Page:
ISSN:2157-6904
Container-title:ACM Transactions on Intelligent Systems and Technology
language:en
Short-container-title:ACM Trans. Intell. Syst. Technol.

Author:

Gupta Vinayak¹^ORCID,Bedathur Srikanta²^ORCID,De Abir³^ORCID

Affiliation:

1. University of Washington Seattle, USA

2. Indian Institute of Technology Delhi, India

3. Indian Institute of Technology Bombay, India

Abstract

Temporal sequences have become pervasive in various real-world applications such as finance, spatial mobility, health records, etc. Consequently, the volume of data generated in the form of continuous time-event sequence(s) or CTES(s) has increased exponentially in the past few years. Thus, a significant fraction of the ongoing research on CTES datasets involves designing models to address downstream tasks such as next-event prediction, long-term forecasting, sequence classification etc. The recent developments in predictive modeling using marked temporal point processes (MTPP) have enabled an accurate characterization of several real-world applications involving the CTESs. However, due to the complex nature of these CTES datasets, the task of large-scale retrieval of temporal sequences has been overlooked by the past literature. In detail, by CTES retrieval we mean that for an input query sequence, a retrieval system must return a ranked list of relevant sequences from a large corpus. To tackle this, we propose NeuroSeqRet , a first-of-its-kind framework designed specifically for end-to-end CTES retrieval. Specifically, NeuroSeqRet introduces multiple enhancements over standard retrieval frameworks and first applies a trainable unwarping function on the query sequence which makes it comparable with corpus sequences, especially when a relevant query-corpus pair has individually different attributes. Next, it feeds the unwarped query sequence and the corpus sequence into MTPP-guided neural relevance models. We develop four variants of the relevance model for different kinds of applications based on the trade-off between accuracy and efficiency. We also propose an optimization framework to learn binary sequence embeddings from the relevance scores, suitable for the locality-sensitive hashing leading to a significant speedup in returning top-K results for a given query sequence. Our experiments with several datasets show the significant accuracy boost of NeuroSeqRet beyond several baselines, as well as the efficacy of our hashing mechanism.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3691349

Reference64 articles.

1. Rakesh Agrawal, Christos Faloutsos, and Arun N. Swami. 1993. Efficient Similarity Search In Sequence Databases. In FODO.

2. Sara Alaee Kaveh Kamgar and Eamonn Keogh. 2020. Matrix Profile XXII: Exact Discovery of Time Series Motifs under DTW. In ICDM.

3. The great time series classification bake off: a review and experimental evaluation of recent algorithmic advances

4. Survival of patients with severe congestive heart failure treated with oral milrinone

5. Mathieu Blondel Arthur Mensch and Jean-Philippe Vert. 2021. Differentiable Divergences Between Time Series. In AISTATS.