OAVA: the open audio-visual archives aggregator-Reference-Cited by-同舟云学术

OAVA: the open audio-visual archives aggregator

Published:2023-12-16 Issue: Volume: Page:
ISSN:1432-5012
Container-title:International Journal on Digital Libraries
language:en
Short-container-title:Int J Digit Libr

Author:

Charitidis Polychronis,Moschos Sotirios,Bakouras Chrysostomos,Doropoulos Stavros,Makris Giorgos,Mauropoulos Nikolas,Nitsos Ilias,Zapounidou Sofia,Malliari Afrodite^ORCID

Abstract

AbstractThe purpose of the current article is to provide an overview of an open-access audiovisual aggregation and search service platform developed for Greek audiovisual content during the OAVA (Open Access AudioVisual Archive) project. The platform allows the search of audiovisual resources utilizing metadata descriptions, as well as full-text search utilizing content generated from automatic speech recognition (ASR) processes through deep learning models. A dataset containing reliable Greek audiovisual content providers and their resources (1710 in total) is created. Both providers and resources are reviewed according to specific criteria already established and used for content aggregation purposes, to ensure the quality of the content and to avoid copyright infringements. Well-known aggregation services and well-established schemas for audiovisual resources have been studied and considered regarding both aggregated content and metadata. Most Greek audiovisual content providers do not use established metadata schemas when publishing their content, nor technical cooperation with them is guaranteed. Thus, a model is developed for reconciliation and aggregation. To utilize audiovisual resources the OAVA platform makes use of the latest state-of-the-art ASR approaches. OAVA platform supports Greek and English speech-to-text models. Specifically for Greek, to mitigate the scarcity of available datasets, a large-scale ASR dataset is annotated to train and evaluate deep learning architectures. The result of the above-mentioned efforts, namely selection of content, metadata, development of appropriate ASR techniques, and aggregation and enrichment of content and metadata, is the OAVA platform. This unified search mechanism for Greek audiovisual content will serve teaching, research, and cultural activities. OAVA platform is available at: https://openvideoarchives.gr/.

Funder

European Regional Development Fund

Operational Program Competitiveness, Entrepreneurship and Innovation

Publisher

Springer Science and Business Media LLC

Subject

Library and Information Sciences

Link

https://link.springer.com/content/pdf/10.1007/s00799-023-00384-z.pdf

Reference66 articles.

1. Ardila, R., Branson, M., Davis, K., et al.: Common voice: a massively-multilingual speech corpus (2019). arXiv preprint arXiv:1912.06670

2. Barry, M., Sifton, D.: Towards a cross-Canadian digital library platform. In: 2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL). IEEE, pp 1–2 (2017)

3. Bashir, B., Nasreen, N., Loan, F.A.: National digital library of India: an overview. Library Philosophy and Practice (e-journal) (2019). https://digitalcommons.unl.edu/libphilprac/2601 (visited April 28, 2020)

4. CIDOC (n.d.) Cidoc crm scope. https://www.cidoc-crm.org/scope. Last accessed on 2023-04-02

5. Cieri, C., Miller, D., Walker, K.: The fisher corpus: a resource for the next generations of speech-to-text. In: LREC, pp 69–71 (2004)