Incremental Learning for Classification of Unstructured Data Using Extreme Learning Machine-Reference-Cited by-同舟云学术

Incremental Learning for Classification of Unstructured Data Using Extreme Learning Machine

Published:2018-10-17 Issue:10 Volume:11 Page:158
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Madhusudhanan Sathya,Jaganathan Suresh^ORCID,L S Jayashree

Abstract

Unstructured data are irregular information with no predefined data model. Streaming data which constantly arrives over time is unstructured, and classifying these data is a tedious task as they lack class labels and get accumulated over time. As the data keeps growing, it becomes difficult to train and create a model from scratch each time. Incremental learning, a self-adaptive algorithm uses the previously learned model information, then learns and accommodates new information from the newly arrived data providing a new model, which avoids the retraining. The incrementally learned knowledge helps to classify the unstructured data. In this paper, we propose a framework CUIL (Classification of Unstructured data using Incremental Learning) which clusters the metadata, assigns a label for each cluster and then creates a model using Extreme Learning Machine (ELM), a feed-forward neural network, incrementally for each batch of data arrived. The proposed framework trains the batches separately, reducing the memory resources, training time significantly and is tested with metadata created for the standard image datasets like MNIST, STL-10, CIFAR-10, Caltech101, and Caltech256. Based on the tabulated results, our proposed work proves to show greater accuracy and efficiency.

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

http://www.mdpi.com/1999-4893/11/10/158/pdf

Reference32 articles.

1. Co-trained support vector machines for large scale unstructured document classification using unlabeled data and syntactic information

2. Using Support Vector Machine to identify imaging biomarkers of neurological and psychiatric disease: A critical review

3. Multi-label maximum entropy model for social emotion classification over short text

4. A Comprehensive Survey of Clustering Algorithms

5. Effect of different distance measures on the performance of k-means algorithm: An experimental study in matlab;Bora;Int. J. Comput. Sci. Inf. Technol.,2014

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Experimental study of rehearsal-based incremental classification of document streams;International Journal on Document Analysis and Recognition (IJDAR);2024-05-11

2. Imbalanced data classification using improved synthetic minority over-sampling technique;Multiagent and Grid Systems;2023-10-06

3. Kevin Kwan’s Crazy Rich Asians: Opinion Mining and Emotion Detection on Fans’ Comments on Social Media;Advances on Intelligent Computing and Data Science;2023

4. Data Integration from Heterogeneous Control Levels for the Purposes of Analysis within Industry 4.0 Concept;Sensors;2022-12-15

5. ITL-IDS: Incremental Transfer Learning for Intrusion Detection Systems;Knowledge-Based Systems;2022-10