Parallel Distance-Based Instance Selection Algorithm for Feed-Forward Neural Network-Reference-Cited by-同舟云学术

Parallel Distance-Based Instance Selection Algorithm for Feed-Forward Neural Network

Published:2017-04-01 Issue:2 Volume:26 Page:335-358
ISSN:2191-026X
Container-title:Journal of Intelligent Systems
language:
Short-container-title:

Author:

Fuangkhon Piyabute¹

Affiliation:

1. 1Department of Business Information Systems, Assumption University, Samut Prakan 10540, Kingdom of Thailand

Abstract

AbstractInstance selection endeavors to decide which instances from the data set should be maintained for further use during the learning process. It can result in increased generalization of the learning model, shorter time of the learning process, or scaling up to large data sources. This paper presents a parallel distance-based instance selection approach for a feed-forward neural network (FFNN), which can utilize all available processing power to reduce the data set while obtaining similar levels of classification accuracy as when the original data set is used. The algorithm identifies the instances at the decision boundary between consecutive classes of data, which are essential for placing hyperplane decision surfaces, and retains these instances in the reduced data set (subset). Each identified instance, called a prototype, is one of the representatives of the decision boundary of its class that constitutes the shape or distribution model of the data set. No feature or dimension is sacrificed in the reduction process. Regarding reduction capability, the algorithm obtains approximately 85% reduction power on non-overlapping two-class synthetic data sets, 70% reduction power on highly overlapping two-class synthetic data sets, and 77% reduction power on multiclass real-world data sets. Regarding generalization, the reduced data sets obtain similar levels of classification accuracy as when the original data set is used on both FFNN and support vector machine. Regarding execution time requirement, the speedup of the parallel algorithm over the serial algorithm is proportional to the number of threads the processor can run concurrently.

Publisher

Walter de Gruyter GmbH

Subject

Artificial Intelligence,Information Systems,Software

Link

https://www.degruyter.com/document/doi/10.1515/jisys-2015-0039/pdf

Reference48 articles.

1. LIBSVM: a library for support vector machines;ACM Trans. Intell. Syst. Technol.,2011

2. Fuzzy logic approaches to structure preserving dimensionality reduction;IEEE Trans. Fuzzy Syst.,2002

3. Training data reduction to speed up SVM training;Appl. Intell.,2014

4. Asymptotic properties of nearest neighbor rules using edited data;IEEE Trans. Syst. Man Cybern.,1972

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Boosting interclass boundary preservation (BIBP): a KD-tree enhanced data reduction algorithm;International Journal of Information Technology;2024-07-18

2. Effect of the distance functions on the distance-based instance selection for the feed-forward neural network;Evolutionary Intelligence;2021-04-30

3. Recent developments of artificial intelligence in drying of fresh food: A review;Critical Reviews in Food Science and Nutrition;2018-05-22