Abstract
Purpose
Data mining is the process of detecting knowledge from a given huge data set. Among the data set, multimedia is the data which contains diverse data such as audio, video, image, text and motion. In this growing field of video data, mining the video data plays vital role in the field of video data mining. In video data mining, video data are grouped into frames. In this vast amount of video frames, the fast retrieval of needed information is important one. This paper aims to propose a Birch-based clustering method for content-based image retrieval.
Design/methodology/approach
In image retrieval system, image segmentation plays a very important role. A text file, normally, is divided into sections, that is, piece, sentences, word and character for this information which are organized and indexed effectively like in a video, the information is dynamic in nature and this information is converted to static for easy retrieval. For this, video files are divided into a number of frames or segments. After the segmentation process, images are trained for retrieval process, and from these, unwanted images are removed from the data set. The noise or unwanted image removal pseudo-code is shown below. In the code image, pixel value represents the value of the difference between the two adjacent image pixel values. By assuming a threshold for the image value, the duplicate images are found. After finding the duplicate image, it is removed from the data set. Clustering is used in many applications as a stand-alone tool to get insight into data distribution and as a pre-processing step for other algorithms (Ester et al., 1996). Specifically, it is used in pattern recognition, spatial data analysis, image processing, economic science document classification, etc. Hierarchical clustering algorithms are classified as agglomerative or divisive. BRICH uses clustering attribute (CA) and clustering feature hierarchy (CA_Hierarchy) for the formation of clusters. It perform multidimensional data objects. Every BRICH algorithm based on the memory-oriented information, that is, memory constrains, is involved in the processing of the data sets. This information is represented in Figures 6-10. For forming clusters, they use the amount of object in the cluster (A), the sum of all points in the data set (S) and need the square value of the all objects (P).
Findings
The proposed technique brings an effective result for cluster formation.
Originality/value
BRICH uses a novel approach to model the degree of inter-connectivity and closeness between each pair of clusters that takes into account the internal characteristics of the clusters themselves.
Subject
Electrical and Electronic Engineering,Mechanical Engineering,Mechanics of Materials,Geotechnical Engineering and Engineering Geology,Civil and Structural Engineering
Reference11 articles.
1. A database interface for clustering in large spatial databases,1995
2. A density-based algorithm for discovering clusters in large spatial database with noise,1996
3. Effective multimedia content retrieval;International Journal of Applied Environmental Sciences,2015
4. Segment based indexing technique for data file;Procedia of Computer Science,2016
5. Video substance extraction using image future population based techniques;ARPN Journal of Engineering and Applied Sciences,2016
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献