Affiliation:
1. Institute for Language and Speech Processing, Epidavrou & Artemidos 6, 151 25 Maroussi, Greece
Abstract
We report on the application of the Self-Organizing Map (SOM) classification method to the task of categorizing texts according to their register and the style of their author. The SOM has been selected as its performance in various data-mining applications has been found to be highly successful. Here, the method is evaluated against the task of clustering textual data which are corpora of texts written in the Greek language; the parameters used depict linguistically important structural properties of the texts. The experiments reported indicate that the SOM results are equivalent to those generated by statistical methods.
Publisher
World Scientific Pub Co Pte Lt
Subject
Computer Networks and Communications,General Medicine
Reference15 articles.
1. Corpus Linguistics
2. Stylistic Experiments in Information Retrieval
3. S. Kaski, Computing Science and Statistics 29, ed. D. W. Scott (Interface Foundation of North America, Inc., Fairfax Station, VA, 1998) pp. 281–290.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献