A critical review of multi-objective optimization in data mining-Reference-Cited by-同舟云学术

A critical review of multi-objective optimization in data mining

Published:2004-12 Issue:2 Volume:6 Page:77-86
ISSN:1931-0145
Container-title:ACM SIGKDD Explorations Newsletter
language:en
Short-container-title:SIGKDD Explor. Newsl.

Author:

Freitas Alex A.¹

Affiliation:

1. University of Kent, Canterbury, UK

Abstract

This paper addresses the problem of how to evaluate the quality of a model built from the data in a multi-objective optimization scenario, where two or more quality criteria must be simultaneously optimized. A typical example is a scenario where one wants to maximize both the accuracy and the simplicity of a classification model or a candidate attribute subset in attribute selection. One reviews three very different approaches to cope with this problem, namely: (a) transforming the original multi-objective problem into a single-objective problem by using a weighted formula; (b) the lexicographical approach, where the objectives are ranked in order of priority; and (c) the Pareto approach, which consists of finding as many non-dominated solutions as possible and returning the set of non-dominated solutions to the user. One also presents a critical review of the case for and against each of these approaches. The general conclusions are that the weighted formula approach -- which is by far the most used in the data mining literature -- is to a large extent an ad-hoc approach for multi-objective optimization, whereas the lexicographic and the Pareto approach are more principled approaches, and therefore deserve more attention from the data mining community.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/1046456.1046467

Reference31 articles.

1. Evolutionary algorithms in data mining

2. The use of the area under the ROC curve in the evaluation of machine learning algorithms

3. {Bruha & Tkadlec 2003} I. Bruha and J. Tkadlec. Rule quality for multiple-rule classifier: empirical expertise and theoretical methodology. Intelligent Data Analysis 7(2) 2003 99--124. {Bruha & Tkadlec 2003} I. Bruha and J. Tkadlec. Rule quality for multiple-rule classifier: empirical expertise and theoretical methodology. Intelligent Data Analysis 7(2) 2003 99--124.

Cited by 104 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-objective Ant Colony Optimization: Review;Archives of Computational Methods in Engineering;2024-09-10

2. Handling Varied Objectives by Online Decision Making;Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2024-08-24

3. The Case for Hybrid Multi-Objective Optimisation in High-Stakes Machine Learning Applications;ACM SIGKDD Explorations Newsletter;2024-07-24

4. An Experimental Analysis on Automated Machine Learning for Software Defect Prediction;2024 IEEE Congress on Evolutionary Computation (CEC);2024-06-30

5. A lexicographic optimisation approach to promote more recent features on longitudinal decision-tree-based classifiers: applications to the English Longitudinal Study of Ageing;Artificial Intelligence Review;2024-03-09