Abstract
In this book, the focus is on data mining with RapidMiner. However, it's important to note that there are other essential steps to consider when delving into the realm of data mining. This chapter serves as an introduction to the process of data pre-processing using RapidMiner, allowing readers to practice with a data set example available on the platform. With RapidMiner, data pre-processing begins with exploring the data visually and then selecting the features that will be analysed with each data mining technique. Managing missing values in a feature is also a crucial step in this process, which can be achieved by either eliminating or replacing them with appropriate values. In addition, RapidMiner allows data scientists to detect outliers and normalize features easily using diagram design, without requiring any computer programming skills. To help readers become familiar with the tools offered by RapidMiner, a classification technique will be demonstrated step-by-step in the book.
Reference14 articles.
1. A Novel Approach for Dealing with Missing Values in Machine Learning Datasets with Discrete Values
2. Classification Of a bank data set on various data mining platforms. 2018 Electric Electronics, Computer Science, Biomedical Engineerings’;M. S.Başarslan;Meeting,2018
3. Cai, S., Sun, R., Hao, S., Li, S., & Yuan, G. (2016). An efficient outlier detection approach on weighted data stream based on minimal rare pattern mining. China Communications, 16(10), 83-99.
4. Research and implementation of big data preprocessing system based on Hadoop
5. Efficiency comparison of Python and RapidMiner