Affiliation:
1. School of Electronic Engineering, Beijing University of Posts and Telecommunications, Haidian, Beijing, China
Abstract
‘‘Audiobook” is a multimedia-based reading technology that has emerged in recent years. Realizing the alignment of e-book text and book audio is the most important part of its processing. This article describes an audio and text alignment algorithm using deep learning and neural network technology to improve the efficiency and quality of audiobook production. The algorithm first uses dual-threshold endpoint detection technology to segment long audio into short audio with sentence dimensions and recognizes it as short text. The threshold is calculated by AIC-FCM optimized based on simulated annealing genetic algorithm. Then the algorithm uses Doc2vec optimized by the threshold prediction method based on the average length of the short text to calculate the text similarity. Finally, proofread and output the text sequence and audio segment aligned in the time dimension to meet the needs of audiobook production. Experiments show that compared to traditional audio and text alignment algorithms, the proposed algorithm is closer to the ideal segmentation result in long audio segmentation, and the alignment effect is basically the same as Doc2vec and the time complexity is reduced by about 35%.
Publisher
Association for Computing Machinery (ACM)
Reference49 articles.
1. PMRSS: Privacy-preserving medical record searching scheme for intelligent diagnosis in IoT healthcare;Sun Y.;IEEE Transactions on Industrial Informatics,2021
2. Z. Guo Y. Shen A. K. Bashir M. Imran and K. Yu. 2020. Robust spammer detection using collaborative neural network in internet of thing applications. IEEE Internet of Things Journal 8 12 (2020) 9549–9558.
3. Non-linear MIMO for industrial internet of things in cyber-physical systems;Gong Y.;IEEE Transactions on Industrial Informatics,2020
4. High-performance isolation computing technology for smart IoT healthcare in cloud environments;Zhang Y.;IEEE Internet of Things Journal,2021
5. L. Tan H. Xiao K. Yu et al. 2021. A blockchain-empowered crowdsourcing system for 5G-enabled smart cities [J]. Computer Standards & Interfaces 76 (2021) 103517.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献