EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions-Reference-Cited by-同舟云学术

EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions

Published:2023-07 Issue:11 Volume:16 Page:2714-2727
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Zhang Enhao¹,Daum Maureen¹,He Dong¹,Haynes Brandon²,Krishna Ranjay¹,Balazinska Magdalena¹

Affiliation:

1. University of Washington

2. Microsoft Gray Systems Lab

Abstract

We introduce EQUI-VOCAL: a new system that automatically synthesizes queries over videos from limited user interactions. The user only provides a handful of positive and negative examples of what they are looking for. EQUI-VOCAL utilizes these initial examples and additional ones collected through active learning to efficiently synthesize complex user queries. Our approach enables users to find events without database expertise, with limited labeling effort, and without declarative specifications or sketches. Core to EQUI-VOCAL's design is the use of spatio-temporal scene graphs in its data model and query language and a novel query synthesis approach that works on large and noisy video data. Our system outperforms two baseline systems---in terms of F1 score, synthesis time, and robustness to noise---and can flexibly synthesize complex queries that the baselines do not support.

Publisher

Association for Computing Machinery (ACM)

Subject

General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development

Link

https://dl.acm.org/doi/pdf/10.14778/3611479.3611482

Reference85 articles.

1. Beam search pruning in speech recognition using a posterior probability-based confidence measure

2. Real-Time Video Analytics;Ananthanarayanan Ganesh;The Killer App for Edge Computing. Computer,2017

3. Michael R. Anderson , Michael J. Cafarella , Germán Ros , and Thomas F . Wenisch . 2019 . Physical Representation-Based Predicate Optimization for a Visual Analytics Database. In ICDE. 1466--1477. Michael R. Anderson, Michael J. Cafarella, Germán Ros, and Thomas F. Wenisch. 2019. Physical Representation-Based Predicate Optimization for a Visual Analytics Database. In ICDE. 1466--1477.

4. Unmanned Aerial Aircraft Systems for transportation engineering: Current practice and future challenges;Barmpounakis Emmanouil N;IJTST,2016

5. Favyen Bastani , Songtao He , Arjun Balasingam , Karthik Gopalakrishnan , Mohammad Alizadeh , Hari Balakrishnan , Michael J. Cafarella , Tim Kraska , and Sam Madden . 2020 . MIRIS: Fast Object Track Queries in Video. In SIGMOD. 1907--1921. Favyen Bastani, Songtao He, Arjun Balasingam, Karthik Gopalakrishnan, Mohammad Alizadeh, Hari Balakrishnan, Michael J. Cafarella, Tim Kraska, and Sam Madden. 2020. MIRIS: Fast Object Track Queries in Video. In SIGMOD. 1907--1921.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. EQUI-VOCAL Demonstration: Synthesizing Video Queries from User Interactions;Proceedings of the VLDB Endowment;2023-08

2. EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions;Proceedings of the VLDB Endowment;2023-07

3. Video Situation Monitoring to Improve Quality of Life;New Trends in Database and Information Systems;2023