Author:
Toyota Tetsuya, ,Nobuhara Hajime,
Abstract
In this paper, we propose a system to visualize the relationships in huge quantities of Internet news by twodimensional self-organizing maps instead of the conventional methods of listing Internet news. In the proposed method, morphological analysis is conducted on the texts of Internet news to generate input vectors with elements of keywords. The characteristics specific to Internet news that many of the vector elements become sparse allows dimensional reductions as well as speeding up of self-organizing mapping with restricted search regions in learning. We verify through evaluation experiments with the data of 80 pieces of news that the proposed system can reduce computation time by 75% to 99% and can create more efficient SOM compared with the generally available SOM.
Publisher
Fuji Technology Press Ltd.
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Human-Computer Interaction
Reference23 articles.
1. M. W. Berry and J. Kogan, “Text Mining: Applications and Theory,” Wiley, 2010.
2. T. Hashimoto, K. Murakami, K. Inui, K. Utsumi, and M. Ishikawa, “Topic Extraction and Social Problem Detection Based on Document Clustering,” Sociotechnica, Vol.5, pp. 216-226, 2008.
3. T. Iwata, T. Yamada, and N. Ueda, “Probabilistic Latent Semantic Visualization: Topic Model for Visualizing Documents,” Proc. of 14th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD2008), pp. 363-371, 2008.
4. S. Roweis and L. Saul, “Nonlinear dimensionality reduction by locally linear embedding,” Science, Vol.290, No.5500, pp. 2323-2326, 2000.
5. M. Trampus andM. Grobelnik, “Visualization of Online Discussion Forums,” Workshop on Applications of Pattern Analysis, Windsor, UK, Vol.11, pp. 134-141, 2010.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献