Author:
Huang Guimin,Chen Jingru,Sun Zhenglin
Abstract
Abstract
Spell correction is already a mature field, the need to combine advantages of different methods for better performance arises in non-word problem. A combined spell correction method is proposed in this paper, which contains Levenshtein distance for comparation between misspelled words and correct spelled words in dictionary, improved Double Metaphone algorithm that includes vowel phoneme rule sets aimed at Chinese English learners, and global vectors(GloVe) for character representation that can generate vectors in order to obtain better suggestion lists for misspelled words. Result shows that the combined approach proposed in this paper is better than phonetic correction or edit distance method only, and a comparison with two generally implemented spell check tools is done for experiment which shows that this approach is better than them in correcting misspelled words, and the success rates of suggestion lists for spelling mistakes hit the spot.
Subject
General Physics and Astronomy
Reference12 articles.
1. A normalized Levenshtein distance metric;Yujian;IEEE transactions on pattern analysis and machine intelligence,2007
2. A technique for computer detection and correction of spelling errors;Damerau;Communications of the ACM,1964
3. Spell checking techniques in NLP: a survey;Gupta;International Journal of Advanced Research in Computer Science and Software Engineering,2012
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献