Affiliation:
1. National Defence University, Turkey
2. Bogazici University, Turkey
Abstract
Today, in the era of data and computing, fast and reliable retrieval of information has become of great importance for security and military applications, and continues to be such, as the amount available digital data increases every second. While the search and retrieval of text data has produced mature products and are today being used in search engines everyday by everyone, the retrieval of spoken content still remains a young research, especially for low resource languages where the available data is scarce to train reliable speech recognition systems. This chapter provides a thorough introduction of a speech retrieval task called “keyword search” and presents a novel similarity measure optimization-based approach. The case study was experimented on telephone conversations in three different languages and thousands of keywords randomly selected from each language were searched in the document. The experiments show that the technique introduced in this chapter offers a new methodology to handle the terms that does not even exist in the vocabulary of the speech recognition systems.
Reference28 articles.
1. Albrecht, T., & Muller, M. (2009). Dynamic Time Warping (DTW). Information Retreival for Music and Motion, 70-83.
2. Borges, J. L. (1998). The library of Babel: Collected fictions. Academic Press.
3. Lattice Indexing for Spoken Term Detection
4. Using proxies for OOV keywords in the keyword search task
5. Fiscus, J. G., Ajot, J., Garofolo, J. S., & Doddingtion, G. (2007). Results of the 2006 spoken term detection evaluation. In Proc. sigir (Vol. 7, pp. 51-57). Academic Press.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献