Word Prediction Model using an elaborated RNN integrated with N-gram for efficient Text Input-Reference-Cited by-同舟云学术

Word Prediction Model using an elaborated RNN integrated with N-gram for efficient Text Input

Published:2023-08-14 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Ikegami Yukino¹,Tsuruta Setsuo²,Kutics Andrea³,Damiani Ernesto⁴,Knauf Rainer⁵

Affiliation:

1. IO Inc., Tokyo, Japan

2. Tokyo Denki University

3. International Christian University

4. Khalifa University of Science and Technology

5. Ilmenau University of Technology

Abstract

Abstract Smartphone users are beyond two billion worldwide. Heavy users of the texting application rely on input prediction to reduce typing effort. In languages based on the Roman alphabet, many techniques are available. However, Japanese text is based on multiple character sets such as Kanji, Hiragana and Katakana. For its time intensive input, next word prediction an open challenge. To tackle this, a hybrid language model is proposed. It integrates a Recurrent Neural Network (RNN) with an n-gram model. RNNs are powerful models for learning long sequences for next word prediction. N-gram models are best at current word completion. Our RNN language model predicts next words. According the “price” of the performance gain paid by a higher time complexity, our model best deploys on a client-server architecture. Heavily-loaded RNN-LM deploys on the server while the n-gram model on the client. It consists of an input layer equipped with word embedding, an output layer, and hidden layers connected with LSTMs (Long Short-Term Memories). Training is done via BPTT (Back Propagation Through Time). For robust training, BPTT is elaborated by learning rate refinement and gradient norm scaling. To avoid overfitting, the dropout technique is applied except for LSTM. Due to synergetic elaboration, it shows 10% lower perplexity than Zaremba’s excellent conventional models in our experiment. Our model has been incorporated into IME (Input Method Editor) we call Flick. In our experiment, Flick outperforms Mozc (Google Japanese Input) by 16% in time and 34% in the number of key strokes.

Publisher

Research Square Platform LLC

Reference62 articles.

1. Alex B (2005) An unsupervised system for identifying English inclusions in German text. Proc. of the ACL Student Research Workshop, Association for Computational Linguistics, pp. 133–138.

2. Alex B, Dubey A, Keller F (2007) Using foreign inclusion detection to improve parsing performance. Proc. of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, EMNLP-CoNLL, pp. 151–160.

3. Alsharif O, Ouyang T, Beaufays F, Zhai S, Breuel T, Schalkwyk J (2015) Long short term memory neural network for keyboard gesture decoding. Proc. of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2015, pp. 2076–2080.

4. An evolutionary algorithm that constructs recurrent neural networks;Angeline PJ;IEEE Transactions on Neural Networks,1994

5. Hacking smart machines with smarter ones: How to extract meaningful data from machine learning classifiers;Ateniese A;International Journal of Security and Networks,2015