Affiliation:
1. Ph.D. Student, School of Science, Edith Cowan University
2. Ph.D., Lecturer, School of Business and Law, Edith Cowan University
Abstract
The term “big data” characterizes the massive amounts of data generation by the advanced technologies in different domains using 4Vs – volume, velocity, variety, and veracity - to indicate the amount of data that can only be processed via computationally intensive analysis, the speed of their creation, the different types of data, and their accuracy. High-dimensional financial data, such as time-series and space-time data, contain a large number of features (variables) while having a small number of samples, which are used to measure various real-time business situations for financial organizations. Such datasets are normally noisy, and complex correlations may exist between their features, and many domains, including financial, lack the al analytic tools to mine the data for knowledge discovery because of the high-dimensionality. Feature selection is an optimization problem to find a minimal subset of relevant features that maximizes the classification accuracy and reduces the computations. Traditional statistical-based feature selection approaches are not adequate to deal with the curse of dimensionality associated with big data. Cooperative co-evolution, a meta-heuristic algorithm and a divide-and-conquer approach, decomposes high-dimensional problems into smaller sub-problems. Further, MapReduce, a programming model, offers a ready-to-use distributed, scalable, and fault-tolerant infrastructure for parallelizing the developed algorithm. This article presents a knowledge management overview of evolutionary feature selection approaches, state-of-the-art cooperative co-evolution and MapReduce-based feature selection techniques, and future research directions.
Publisher
LLC CPC Business Perspectives
Subject
Strategy and Management,Business and International Management,General Business, Management and Accounting,Information Systems and Management,Law,Sociology and Political Science,Public Administration
Reference143 articles.
1. Text feature selection using ant colony optimization
2. Ahmad, S. S. S., & Pedrycz, W. (2011). Feature and Instance Selection Via Cooperative PSO. 2011 IEEE International Conference on Systems, Man, and Cybernetics (SMC), 2127-2132.
3. Ahmed, S., Zhang, M. J., & Peng, L. F. (2013). Enhanced Feature Selection for Biomarker Discovery in LC-MS Data using GP. 2013 Ieee Congress on Evolutionary Computation (Cec), 584-591.
4. Ali, M. M., Rattanawiboonsom, V., Hassan, F., & Nedelea, A. M. (2019). Knowledge Management at Higher Educational Institutes in Bangladesh: The case study of self-assessed processes of two educational Institutions. Ecoforum Journal, 8(1). - http://www.ecoforumjournal.ro/index.php/eco/article/view/901/572
5. Challenges in the Analysis of Mass-Throughput Data: A Technical Commentary from the Statistical Machine Learning Perspective
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献