A Comparison of Different Machine Transliteration Models-Reference-Cited by-同舟云学术

A Comparison of Different Machine Transliteration Models

Published:2006-10-18 Issue: Volume:27 Page:119-151
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Oh J.,Choi K.,Isahara H.

Abstract

Machine transliteration is a method for automatically converting words in one language into phonetically equivalent ones in another language. Machine transliteration plays an important role in natural language applications such as information retrieval and machine translation, especially for handling proper nouns and technical terms. Four machine transliteration models -- grapheme-based transliteration model, phoneme-based transliteration model, hybrid transliteration model, and correspondence-based transliteration model -- have been proposed by several researchers. To date, however, there has been little research on a framework in which multiple transliteration models can operate simultaneously. Furthermore, there has been no comparison of the four models within the same framework and using the same data. We addressed these problems by 1) modeling the four models within the same framework, 2) comparing them under the same conditions, and 3) developing a way to improve machine transliteration through this comparison. Our comparison showed that the hybrid and correspondence-based models were the most effective and that the four models can be used in a complementary manner to improve machine transliteration performance.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Thirukkural Alagiduthal and Kural Venba Validation using Python;Journal of Trends in Computer Science and Smart Technology;2023-03

2. LSTM-Based Encoder–Decoder Model for Transliteration of English, Marathi, and Hindi Query;Advances in Data Science and Computing Technologies;2023

3. A review of machine transliteration, translation, evaluation metrics and datasets in Indian Languages;Multimedia Tools and Applications;2022-11-25

4. Machine Transliteration for Indian languages: Survey;2022 Seventh International Conference on Parallel, Distributed and Grid Computing (PDGC);2022-11-25

5. A Hybrid Machine Transliteration Model Based on Multi-source Encoder–Decoder Framework: English to Manipuri;SN Computer Science;2022-01-11