Affiliation:
1. University of Kassel, Germany
Abstract
Data Mining provides approaches for the identification and discovery of non-trivial patterns and models hidden in large collections of data. In the applied natural language processing domain, data mining usually requires preprocessed data that has been extracted from textual documents. Additionally, this data is often integrated with other data sources. This chapter provides an overview on data mining focusing on approaches for pattern mining, cluster analysis, and predictive model construction. For those, we discuss exemplary techniques that are especially useful in the applied natural language processing context. Additionally, we describe how the presented data mining approaches are connected to text mining, text classification, and clustering, and discuss interesting problems and future research directions.
Reference25 articles.
1. Summarization from medical documents: a survey
2. Data mining with decision trees and decision rules.;C.Apte;Computer Systems,1997
3. Atzmueller, M., Kluegl, P., & Puppe, F. (2008). Rule-based information extraction for structured data acquisition using TextMarker. In Proceedings LWA-2008, Special Track on Knowledge Discovery and Machine Learning. Wuerzburg, Germany: University of Wuerzburg.
4. Atzmueller, M., & Puppe, F. (2006). SD-map - A fast algorithm for exhaustive subgroup discovery. In Proceedings of the 10th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD 2006), (pp. 6-17). Berlin, Germany: Springer.
5. Adaptive Control Processes
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Big Data Comes to School;AERA Open;2016-04-01