Iteratively Divide-and-Conquer Learning for Nonlinear Classification and Ranking-Reference-Cited by-同舟云学术

Iteratively Divide-and-Conquer Learning for Nonlinear Classification and Ranking

Published:2018-03-31 Issue:2 Volume:9 Page:1-26
ISSN:2157-6904
Container-title:ACM Transactions on Intelligent Systems and Technology
language:en
Short-container-title:ACM Trans. Intell. Syst. Technol.

Author:

Wu Ou¹^ORCID,Mao Xue²,Hu Weiming²

Affiliation:

1. Center for Applied Mathematics, Tianjin University, China

2. NLPR, Institute of Automation, Chinese Academy of Sciences

Abstract

Nonlinear classifiers (i.e., kernel support vector machines (SVMs)) are effective for nonlinear data classification. However, nonlinear classifiers are usually prohibitively expensive when dealing with large nonlinear data. Ensembles of linear classifiers have been proposed to address this inefficiency, which is called the ensemble linear classifiers for nonlinear data problem. In this article, a new iterative learning approach is introduced that involves two steps at each iteration: partitioning the data into clusters according to Gaussian mixture models with local consistency and then training basic classifiers (i.e., linear SVMs) for each cluster. The two divide-and-conquer steps are combined into a graphical model. Meanwhile, with training, each classifier is regarded as a task; clustered multitask learning is employed to capture the relatedness among different tasks and avoid overfitting in each task. In addition, two novel extensions are introduced based on the proposed approach. First, the approach is extended for quality-aware web data classification. In this problem, the types of web data vary in terms of information quality. The ignorance of the variations of information quality of web data leads to poor classification models. The proposed approach can effectively integrate quality-aware factors into web data classification. Second, the approach is extended for listwise learning to rank to construct an ensemble of linear ranking models, whereas most existing listwise ranking methods construct a solely linear ranking model. Experimental results on benchmark datasets show that our approach outperforms state-of-the-art algorithms. During prediction for nonlinear classification, it also obtains comparable classification performance to kernel SVMs, with much higher efficiency.

Funder

NSFC

Publisher

Association for Computing Machinery (ACM)

Subject

Artificial Intelligence,Theoretical Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3122802

Reference48 articles.

1. K. Bache and M. Lichman. 2013. UCI Machine Learning Repository. Retrieved September 8 2017 from http://archive.ics.uci.edu/ml. K. Bache and M. Lichman. 2013. UCI Machine Learning Repository. Retrieved September 8 2017 from http://archive.ics.uci.edu/ml.

2. An Adaptive SVM Nearest Neighbor Classifier for Remotely Sensed Imagery

3. Generalized SMO Algorithm for SVM-Based Multitask Learning

4. Learning to rank

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of incremental high-utility pattern mining based on storage structure;Journal of Intelligent & Fuzzy Systems;2021-08-11

2. Stability-Based Generalization Analysis of Distributed Learning Algorithms for Big Data;IEEE Transactions on Neural Networks and Learning Systems;2020-03