Author:
,Naya Daichi,Yoshimi Takehiko,
Abstract
This study proposes the following three improvements to address the shortfalls of existing methods for emotion classification based on similarity measures between Japanese sentences. (1) The proposed method utilizes, as a similarity measure, an automatic evaluation metric for machine translation based on correlation considering global word order. Similarity measures used in existing methods are too strict or lax in their consideration of the order of words during the processing of Japanese sentences. They fail to incorporate the fact that Japanese exhibits a higher degree of freedom in word order compared to English. (2) Rather than word-level alignment, the proposed method aligns two sentences at the level of a base phrase constituted by one or more content words followed by zero or more function words. Word-level alignment, used by existing methods, may cause inappropriate matching, because it independently processes content words and function words which constitute a base phrase. (3) The proposed method integrates, using linear interpolation, similarity scores by the alignment of word-level and those of base phrase-level. Integration can avoid data sparsity, namely a problem that a similarity score between two sentences becomes zero, occurring when base phrase-level alignment is made. The experimental results found that (1) the proposed method demonstrated significantly higher classification accuracy than existing alternatives; (2) the base phrase-level alignment was effective because the alignment exhibited higher classification accuracy than the word-level alignment; and (3) integration of the word-level and base phrase-level alignments was valid because the integrated method outperformed the base phrase-level alignment. Key Words: Emotion classification, Similarity measure, Japanese, Word order, Base phrase, Linear interpolation
Publisher
International Information Institute
Reference20 articles.
1. [1] 松本和幸,三品賢一,任福継,黒岩眞吾:"感情生起事象文型パターンに基づいた会話文からの感情推定手法",自然言語処理,Vol.14, No.3, pp.239-271 (2007)
2. [2] 土屋誠司,鈴木基之,芋野美紗子,吉村枝里子,渡部広一:"口語表現に対応した知識ベースと連想メカニズムによる感情判断手法",人工知能学会論文,Vol.29, No.1, pp.11-20 (2014)
3. [3] 石渡太智,安田有希,宮崎太郎,後藤淳:"発話順序に基づくGraph Attention Networksを用いた対話文における感情認識",自然言語処理,Vol.28, No.4, pp.1141-1161 (2021)
4. [4] 三品賢一,土屋誠司,鈴木基之,任福継:"コーパスごとの類似度を考慮した用例に基づく感情推定手法の改善",自然言語処理,Vol.17, No.4, pp.91-110 (2010)
5. [5] 徳久良子,乾健太郎,松本裕治:"Webから獲得した感情生起要因コーパスに基づく感情推定",情報処理学会論文誌,Vol.50, No.4, pp.1365-1374 (2009)