Affiliation:
1. College of Computer Science and Engineering, Yulin Normal University, Yulin, Guangxi, P.R.China
Abstract
Set-valued data is a significant kind of data, such as data obtained from different search engines, market data, patients’ symptoms and behaviours. An information system (IS) based on incomplete set-valued data is called an incomplete set-valued information system (ISVIS), which generalized model of a single-valued incomplete information system. This paper gives feature selection for an ISVIS by means of uncertainty measurement. Firstly, the similarity degree between two information values on a given feature of an ISVIS is proposed. Then, the tolerance relation on the object set with respect to a given feature subset in an ISVIS is obtained. Next, λ-reduction in an ISVIS is presented. What’s more, connections between the proposed feature selection and uncertainty measurement are exhibited. Lastly, feature selection algorithms based on λ-discernibility matrix, λ-information granulation, λ-information entropy and λ-significance in an ISVIS are provided. In order to better prove the practical significance of the provided algorithms, a numerical experiment is carried out, and experiment results show the number of features and average size of features by each feature selection algorithm.
Subject
Artificial Intelligence,General Engineering,Statistics and Probability
Reference23 articles.
1. Sequential covering rule induction algorithm for variable consistency rough set approaches;Blaszczynski;Information Sciences,2011
2. Attribute selection with fuzzy decision reducts;Cornelis;Information Sciences,2010
3. Attribute reduction of set-valued information systems based on a tolerance relation;Chen;Computer Science,2010
4. Entropy measures and granularity measures for set-valued information systems;Dai;Information Sciences,2013
5. Fuzzy rough set model for set-valued data;Dai;Fuzzy Sets and Systems,2013