Affiliation:
1. Computer Science, University of Windsor, 401 Sunset Ave, Windsor, ON N9B 3P4, Canada
Abstract
In this paper, we present a new approach to improve tabular datasets by applying the lottery ticket hypothesis to tabular neural networks. Prior approaches were required to train the original large-sized model to find these lottery tickets. In this paper we eliminate the need to train the original model and discover lottery tickets using networks a fraction of the model’s size. Moreover, we show that we can remove up to 95% of the training dataset to discover lottery tickets, while still maintaining similar accuracy. The approach uses a genetic algorithm (GA) to train candidate pruned models by encoding the nodes of the original model for selection measured by performance and weight metrics. We found that the search process does not require a large portion of the training data, but when the final pruned model is selected it can be retrained on the full dataset, even if it is often not required. We propose a lottery sample hypothesis similar to the lottery ticket hypotheses where a subsample of lottery samples of the training set can train a model with equivalent performance to the original dataset. We show that the combination of finding lottery samples alongside lottery tickets can allow for faster searches and greater accuracy.
Funder
the Ontario Graduate Scholarship (OGS) and Natural Sciences and Engineering Research Council of Canada
Subject
General Economics, Econometrics and Finance
Reference32 articles.
1. Lottery Ticket Structured Node Pruning for Tabular Datasets;Bluteau;Mach. Learn. Knowl. Extr.,2022
2. Tandjung, M.D., Wu, J.C.M., Wang, J.C., and Li, Y.H. (2021, January 16–17). An Implementation of FastAI Tabular Learner Model for Parkinson’s Disease Identification. Proceedings of the 2021 9th International Conference on Orange Technology (ICOT), Tainan, Taiwan.
3. Nanni, L., Lumini, A., and Brahnam, S. (2022). Neural networks for anatomical therapeutic chemical (ATC) classification. Appl. Comput. Inform.
4. Blending gradient boosted trees and neural networks for point and probabilistic forecasting of hierarchical time series;Nasios;Int. J. Forecast.,2022
5. Zhang, Y., Cutts, R., and Xu, J. (2021). Implementing Machine Learning With Highway Datasets, State Highway Administration. Office of Policy & Research. Technical Report.