WordICA—emergence of linguistic representations for words by independent component analysis-Reference-Cited by-同舟云学术

WordICA—emergence of linguistic representations for words by independent component analysis

Published:2010-06-15 Issue:3 Volume:16 Page:277-308
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

HONKELA TIMO,HYVÄRINEN AAPO,VÄYRYNEN JAAKKO J.

Abstract

AbstractWe explore the use of independent component analysis (ICA) for the automatic extraction of linguistic roles or features of words. The extraction is based on the unsupervised analysis of text corpora. We contrast ICA with singular value decomposition (SVD), widely used in statistical text analysis, in general, and specifically in latent semantic analysis (LSA). However, the representations found using the SVD analysis cannot easily be interpreted by humans. In contrast, ICA applied on word context data gives distinct features which reflect linguistic categories. In this paper, we provide justification for our approach called WordICA, present the WordICA method in detail, compare the obtained results with traditional linguistic categories and with the results achieved using an SVD-based method, and discuss the use of the method in practical natural language engineering solutions such as machine translation systems. As the WordICA method is based on unsupervised learning and thus provides a general means for efficient knowledge acquisition, we foresee that the approach has a clear potential for practical applications.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference70 articles.

1. Further meta-evaluation of machine translation

2. Independent component analysis of fMRI data: Examining the assumptions

3. Unsupervised Learning

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A multi-aspect approach to ontology matching based on Bayesian cluster ensembles;Journal of Intelligent Information Systems;2019-11-23

2. Social event decomposition for constructing knowledge graph;Future Generation Computer Systems;2019-11

3. Ontology Matching based on Multi-Aspect Consensus Clustering of Communities;Proceedings of the 18th International Conference on Enterprise Information Systems;2016

4. Exploratory analysis of semantic categories: comparing data-driven and human similarity judgments;Computational Cognitive Science;2015-07-07

5. Quantifying the Effect of Meaning Variation in Survey Analysis;Artificial Neural Networks and Machine Learning – ICANN 2014;2014