Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective-Reference-Cited by-同舟云学术

Traceable Group-Wise Self-Optimizing Feature Transformation Learning: A Dual Optimization Perspective

Published:2023-12-20 Issue: Volume: Page:
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Xiao Meng¹,Wang Dongjie²,Wu Min³,Liu Kunpeng⁴,Xiong Hui⁵,Zhou Yuanchun⁶,Fu Yanjie⁷

Affiliation:

1. 1.Computer Network Information Center, Chinese Academy of Sciences, Beijing; 2.University of Chinese Academy of Sciences, Beijing, China

2. Department of Computer Science, University of Central Florida, Orlando, USA

3. Institute for Infocomm Research, Agency for Science, Singapore

4. Department of Computer Science, Portland State University, USA

5. 1.The Hong Kong University of Science and Technology (Guangzhou); 2. Guangzhou HKUST Fok Ying Tung Research Institute, China

6. Computer Network Information Center, Chinese Academy of Sciences; 2.University of Chinese Academy of Sciences, Beijing, China

7. Arizona State University, School of Computing and AI, Tempe, USA

Abstract

Feature transformation aims to reconstruct an effective representation space by mathematically refining the existing features. It serves as a pivotal approach to combat the curse of dimensionality, enhance model generalization, mitigate data sparsity, and extend the applicability of classical models. Existing research predominantly focuses on domain knowledge-based feature engineering or learning latent representations. However, these methods, while insightful, lack full automation and fail to yield a traceable and optimal representation space. An indispensable question arises: Can we concurrently address these limitations when reconstructing a feature space for a machine learning task? Our initial work took a pioneering step towards this challenge by introducing a novel self-optimizing framework. This framework leverages the power of three cascading reinforced agents to automatically select candidate features and operations for generating improved feature transformation combinations. Despite the impressive strides made, there was room for enhancing its effectiveness and generalization capability. In this extended journal version, we advance our initial work from two distinct yet interconnected perspectives: 1) We propose a refinement of the original framework, which integrates a graph-based state representation method to capture the feature interactions more effectively and develop different Q-learning strategies to alleviate Q-value overestimation further. 2) We utilize a new optimization technique (actor-critic) to train the entire self-optimizing framework in order to accelerate the model convergence and improve the feature transformation performance. Finally, to validate the improved effectiveness and generalization capability of our framework, we perform extensive experiments and conduct comprehensive analyses. These provide empirical evidence of the strides made in this journal version over the initial work, solidifying our framework’s standing as a substantial contribution to the field of automated feature transformation. To improve the reproducibility, we have released the associated code and data by the Github link https://github.com/coco11563/TKDD2023_code.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3638059

Reference38 articles.

1. Yoshua Bengio , Aaron Courville , and Pascal Vincent . 2013. Representation learning: A review and new perspectives . IEEE transactions on pattern analysis and machine intelligence 35, 8( 2013 ), 1798–1828. Yoshua Bengio, Aaron Courville, and Pascal Vincent. 2013. Representation learning: A review and new perspectives. IEEE transactions on pattern analysis and machine intelligence 35, 8(2013), 1798–1828.

2. David M Blei , Andrew Y Ng , and Michael I Jordan . 2003. Latent dirichlet allocation. the Journal of machine Learning research 3 ( 2003 ), 993–1022. David M Blei, Andrew Y Ng, and Michael I Jordan. 2003. Latent dirichlet allocation. the Journal of machine Learning research 3 (2003), 993–1022.

3. Emmanuel J Candès , Xiaodong Li , Yi Ma , and John Wright . 2011. Robust principal component analysis?Journal of the ACM (JACM) 58, 3 ( 2011 ), 1–37. Emmanuel J Candès, Xiaodong Li, Yi Ma, and John Wright. 2011. Robust principal component analysis?Journal of the ACM (JACM) 58, 3 (2011), 1–37.

4. Neural Feature Search: A Neural Architecture for Automated Feature Engineering

5. Techniques for Automated Machine Learning

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation;2023 IEEE International Conference on Data Mining Workshops (ICDMW);2023-12-04

2. RDKG: A Reinforcement Learning Framework for Disease Diagnosis on Knowledge Graph;2023 IEEE International Conference on Data Mining (ICDM);2023-12-01

3. Resolving the Imbalance Issue in Hierarchical Disciplinary Topic Inference via LLM-based Data Augmentation;2023 IEEE International Conference on Data Mining (ICDM);2023-12-01

4. Self-optimizing Feature Generation via Categorical Hashing Representation and Hierarchical Reinforcement Crossing;2023 IEEE International Conference on Data Mining (ICDM);2023-12-01

5. Beyond Discrete Selection: Continuous Embedding Space Optimization for Generative Feature Selection;2023 IEEE International Conference on Data Mining (ICDM);2023-12-01