Building CLIA for Resource-Scarce African Languages

Author:

Tune Kula Kekeba1,Varma Vasudeva1

Affiliation:

1. International Institute of Information Technology, India

Abstract

Since most of the existing major search engines and commercial Information Retrieval (IR) systems are primarily designed for well-resourced European and Asian languages, they have paid little attention to the development of Cross-Language Information Access (CLIA) technologies for resource-scarce African languages. This paper presents the authors' experience in building CLIA for indigenous African languages, with a special focus on the development and evaluation of Oromo-English-CLIR. The authors have adopted a knowledge-based query translation approach to design and implement their initial Oromo-English CLIR (OMEN-CLIR). Apart from designing and building the first OMEN-CLIR from scratch, another major contribution of this study is assessing the performance of the proposed retrieval system at one of the well-recognized international Cross-Language Evaluation Forums like the CLEF campaign. The overall performance of OMEN-CLIR was found to be very promising and encouraging, given the limited amount of linguistic resources available for severely under-resourced African languages like Afaan Oromo.

Publisher

IGI Global

Reference40 articles.

1. Abdulsamed, M. (1994). Seerlugaa Afaan Oromoo. Finfinnee: Caffee Oromiyaa.

2. Building capacities in human language technology for African languages

3. Adugna, S., & Eisele, A. (2010). English – Oromo machine translation: An experiment using a statistical approach. In C. Nicoletta, Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC'10) (pp. 2196-2199). Valletta, Malta: European Language Resources Association (ELRA).

4. Arabic morphological analysis techniques: A comprehensive survey

5. An Amharic stemmer: Reducing words to their citation forms.;A.Alemu;Proceedings of the 5th Workshop on Important Unresolved Matters,2007

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3