Open AccessThis article is
- freely available
Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems
School of Electronic and Information Engineering, Xi’an Jiaotong University, No.28, Xianning West Road, Xi’an 710049, China
* Author to whom correspondence should be addressed.
Received: 30 October 2012; in revised form: 11 December 2012 / Accepted: 24 December 2012 / Published: 27 December 2012
Abstract: Feature selection, also known as attribute selection, is the technique of selecting a subset of relevant features for building robust object models. It is becoming more and more important for large-scale sensors applications with AI capabilities. The core idea of this paper is derived from a straightforward and intuitive principle saying that, if a feature subset (pattern) has more representativeness, it should be more self-organized, and as a result it should be more insensitive to artificially seeded noise points. In the light of this heuristic finding, we established the whole set of theoretical principles, based on which we proposed a two-stage framework to evaluate the relative importance of feature subsets, called seeding and harvest (S&H for short). At the first stage, we inject a number of artificial noise points into the original dataset; then at the second stage, we resort to an outlier detector to identify them under various feature patterns. The more precisely the seeded points can be extracted under a particular feature pattern, the more valuable and important the corresponding feature pattern should be. Besides, we compared our method with several state-of-the-art feature selection methods on a number of real-life datasets. The experiment results significantly confirm that our method can accomplish feature reduction tasks with high accuracy as well as low computing complexity.
Keywords: feature selection; seeding and harvest; noise injection
Citations to this Article
Cite This Article
MDPI and ACS Style
Chen, G.; Cai, Y.; Shi, J. Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems. Sensors 2013, 13, 292-333.
Chen G, Cai Y, Shi J. Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems. Sensors. 2013; 13(1):292-333.
Chen, Gang; Cai, Yuanli; Shi, Juan. 2013. "Seeding and Harvest: A Framework for Unsupervised Feature Selection Problems." Sensors 13, no. 1: 292-333.