AIx speed: Playback Speed Optimization using Listening Comprehension of Speech Recognition Models-Reference-Cited by-同舟云学术

AIx speed: Playback Speed Optimization using Listening Comprehension of Speech Recognition Models

Published:2022-10-28 Issue: Volume: Page:
ISSN:
Container-title:The Adjunct Publication of the 35th Annual ACM Symposium on User Interface Software and Technology
language:
Short-container-title:

Author:

Kawamura Kazuki¹,Rekimoto Jun¹

Affiliation:

1. The University of Tokyo, Japan and Sony CSL Kyoto, Japan

Funder

JST Moonshot R&D

JST CREST

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3526114.3558727

Reference19 articles.

1. Alexei Baevski , Steffen Schneider , and Michael Auli . 2020 . vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations . In International Conference on Learning Representations. Alexei Baevski, Steffen Schneider, and Michael Auli. 2020. vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations. In International Conference on Learning Representations.

2. Alexei Baevski Yuhao Zhou Abdelrahman Mohamed and Michael Auli. 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. In Advances in Neural Information Processing Systems. Alexei Baevski Yuhao Zhou Abdelrahman Mohamed and Michael Auli. 2020. wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. In Advances in Neural Information Processing Systems.

3. SmartPlayer

4. Why College Students Watch Streaming Drama at Higher Playback Speed:The Uses and Gratifications Perspective

5. Keita Higuchi , Ryo Yonetani , and Yoichi Sato . 2017 . EgoScanning: Quickly Scanning First-Person Videos with Egocentric Elastic Timelines . In Proc. ACM Conference on Human Factors in Computing Systems. Keita Higuchi, Ryo Yonetani, and Yoichi Sato. 2017. EgoScanning: Quickly Scanning First-Person Videos with Egocentric Elastic Timelines. In Proc. ACM Conference on Human Factors in Computing Systems.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The Three-Stage Hierarchical Logistic Model Controlling Personalized Playback of Audio Information for Intelligent Tutoring Systems;IEEE Transactions on Learning Technologies;2024