Affiliation:
1. The College of Liberal Arts and Science, National University of Defense Technology, Hunan, China
2. The Institute of Big Data Science and Industry, Shanxi University, Shanxi, China
Abstract
Recently, learning and mining from data streams with incremental feature spaces have attracted extensive attention, where data may dynamically expand over time in both volume and feature dimensions. Existing approaches usually assume that the incoming instances can always receive true labels. However, in many real-world applications, e.g., environment monitoring, acquiring the true labels is costly due to the need of human effort in annotating the data. To tackle this problem, we propose a novel incremental Feature spaces Learning with Label Scarcity (FLLS) algorithm, together with its two variants. When data streams arrive with augmented features, we first leverage the margin-based online active learning to select valuable instances to be labeled and thus build superior predictive models with minimal supervision. After receiving the labels, we combine the online passive-aggressive update rule and margin-maximum principle to jointly update the dynamic classifier in the shared and augmented feature space. Finally, we use the projected truncation technique to build a sparse but efficient model. We theoretically analyze the error bounds of FLLS and its two variants. Also, we conduct experiments on synthetic data and real-world applications to further validate the effectiveness of our proposed algorithms.
Funder
Key NSF of China
NSF of China
NSF for Distinguished Young Scholars of Hunan Province
Publisher
Association for Computing Machinery (ACM)
Reference36 articles.
1. Zhenyu Zhang, Peng Zhao, Yuan Jiang, and Zhi-Hua Zhou. 2020. Learning with feature and distribution evolvable streams. In Proceedings of the 37th International Conference on Machine Learning, Vol. 119, PMLR, 11317–11327.
2. Online Learning from Trapezoidal Data Streams
3. Bo-Jian Hou, Lijun Zhang, and Zhi-Hua Zhou. 2017. Learning with feature evolvable streams. In Proceedings of the 31st International Conference on Neural Information Processing Systems. 1417–1427.
4. One-Pass Learning with Incremental and Decremental Features
5. Online Feature Selection with Capricious Streaming Features: A General Framework
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献