Affiliation:
1. Dongguan University of Technology, Dongguan, China and The Hong Kong University of Science and Technology
2. The Hong Kong University of Science and Technology
Abstract
We propose using active learning for extractive speech summarization in order to reduce human effort in generating reference summaries. Active learning chooses a selective set of samples to be labeled. We propose a combination of informativeness and representativeness criteria for selection. We further propose a semi-automatic method to generate reference summaries for presentation speech by using Relaxed Dynamic Time Warping (RDTW) alignment between presentation speech and its accompanied slides. Our summarization results show that the amount of labeled data needed for a given summarization accuracy can be reduced by more than 23% compared to random sampling.
Funder
Innovation and Technology Commmission
International Center for Advanced Communication Technologies (InterACT) at HKUST Funding
National Natural Science Foundation of China
Publisher
Association for Computing Machinery (ACM)
Subject
Computational Mathematics,Computer Science (miscellaneous)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Youtube Transcript Synthesis;2023 5th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N);2023-12-15
2. Speech Summarization for Tamil Language;Intelligent Speech Signal Processing;2019
3. Automatic Annotation of Voice Forum Content for Rural Users and Evaluation of Relevance;Proceedings of the 1st ACM SIGCAS Conference on Computing and Sustainable Societies;2018-06-20
4. Generic speech summarization of transcribed lecture videos: Using tags and their semantic relations;Journal of the Association for Information Science and Technology;2014-11-06