On Subsampling Procedures for Support Vector Machines-Reference-Cited by-同舟云学术

On Subsampling Procedures for Support Vector Machines

Published:2022-10-13 Issue:20 Volume:10 Page:3776
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Bárcenas Roberto^ORCID,Gonzalez-Lima Maria,Ortega Joaquin,Quiroz Adolfo

Abstract

Herein, theoretical results are presented to provide insights into the effectiveness of subsampling methods in reducing the amount of instances required in the training stage when applying support vector machines (SVMs) for classification in big data scenarios. Our main theorem states that under some conditions, there exists, with high probability, a feasible solution to the SVM problem for a randomly chosen training subsample, with the corresponding classifier as close as desired (in terms of classification error) to the classifier obtained from training with the complete dataset. The main theorem also reflects the curse of dimensionalityin that the assumptions made for the results are much more restrictive in large dimensions; thus, subsampling methods will perform better in lower dimensions. Additionally, we propose an importance sampling and bagging subsampling method that expands the nearest-neighbors ideas presented in previous work. Using different benchmark examples, the method proposed herein presents a faster solution to the SVM problem (without significant loss in accuracy) compared with the available state-of-the-art techniques.

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/10/20/3776/pdf

Reference40 articles.

1. A Training Algorithm for Optimal Margin Classifiers;Boser;Proceedings of the COLT’92 Proceedings of the Fifth annual Workshop on Computational Learning Theory,1992

2. Support-vector networks

3. Support Vector Machines and Other Kernel-Based Learning Methods;Cristianini,2000

4. Nearest neighbors methods for support vector machines

5. LIBSVM

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Graph Classification Method Based on Support Vector Machines and Locality-Sensitive Hashing;IEEE Access;2024

2. Predicting reference evapotranspiration in semi-arid-region by regression- based machine learning methods using limited climatic inputs;2023-03-09