Semantic ontologies for multimedia indexing (SOMI)

Author:

Bendib Issam,Ridda Laouar Mohamed,Hacken Richard,Miles Mathew

Abstract

Purpose – The overwhelming speed and scale of digital media production greatly outpace conventional indexing methods by humans. The management of Big Data for e-library speech resources requires an automated metadata solution. The paper aims to discuss these issues. Design/methodology/approach – A conceptual model called semantic ontologies for multimedia indexing (SOMI) allows for assembly of the speech objects, encapsulation of semantic associations between phonic units and the definition of indexing techniques designed to invoke and maximize the semantic ontologies for indexing. A literature review and architectural overview are followed by evaluation techniques and a conclusion. Findings – This approach is only possible because of recent innovations in automated speech recognition. The introduction of semantic keyword spotting allows for indexing models that disambiguate and prioritize meaning using probability algorithms within a word confusion network. By the use of AI error-training procedures, optimization is sought for each index item. Research limitations/implications – Validation and implementation of this approach within the field of digital libraries still remain under development, but rapid developments in technology and research show rich conceptual promise for automated speech indexing. Practical implications – The SOMI process has been preliminarily tested, showing that hybrid semantic-ontological approaches produce better accuracy than semantic automation alone. Social implications – Even as testing proceeds on recorded conference talks at the University of Tebessa (Algeria), other digital archives can look toward similar indexing. This will mean greater access to sound file metadata. Originality/value – Huge masses of spoken data, unmanageable for a human indexer, can prospectively find semantically sorted and prioritized indexing – not transcription, but generated metadata – automatically, quickly and accurately.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference14 articles.

1. Chelba, C. , Silva, J. and Acero, A. (2007), “Soft indexing of speech content for search in spoken documents”, Computer Speech and Language, Vol. 21 No. 3, pp. 458-478.

2. El Meliani, R. and O'Shaughnessy, D. (1995), “Lexical fillers for task-independent-training based keyword spotting and detection of new words”, EUROSPEECH, Fourth European Conference on Speech Communication and Technology, Madrid, September 18-21, Universität Trier, Trier, pp. 2129-2133.

3. Jones, G.J.F. and Foote, J.T. (1996), “Retrieving spoken documents by combining multiple index sources”, in Frei, H.-P. , Harman, D. , Schäuble, P. and Wilkinson, R. (Eds), Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zürich, August 18-22, ACM Press, New York, NY, pp. 30-38.

4. Larson, M. (2001), “Sub-word-based language models for speech recognition: implications for spoken document retrieval”, Workshop Proceedings on Language Modeling and Information Retrieval, May 31-June 1, 2001, Carnegie Mellon University, Pittsburgh, pp. 78-82.

5. Logan, B. , Moreno, P. and Deshmukh, O. (2002), “Word and sub-word indexing approaches for reducing the effects of OOV queries on spoken audio”, Proceedings of the Second International Conference on Human Language Technology Research, San Diego, March 24-27, Morgan Kaufmann Publishers, San Francisco, CA, pp. 31-35.

Cited by 6 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. A novel data quality framework for assessment of scientific lecture video indexing;Library Hi Tech;2023-07-14

2. Factors affecting the adoption of integrated semantic digital libraries (SDLs): a systematic review;Library Hi Tech;2022-08-05

3. A Decision Support System for Managing Demand-Driven Collection Development in University Digital Libraries;Research Anthology on Decision Support Systems and Decision Management in Healthcare, Business, and Engineering;2021

4. Quran content representation in NLP;Proceedings of the 10th International Conference on Information Systems and Technologies;2020-06-04

5. A Decision Support System for Managing Demand-Driven Collection Development in University Digital Libraries;International Journal of Information Systems and Social Change;2019-10

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3