Abstract
Tabular data is commonly used in business and literature and can be analyzed using tree-based Machine Learning (ML) algorithms to extract meaningful information. Deep Learning (DL) excels in data such as image, sound, and text, but it is less frequently utilized with tabular data. However, it is possible to use tools to convert tabular data into images for use with Convolutional Neural Networks (CNNs) which are powerful DL models for image classification. The goal of this work is to compare the performance of converters for tabular data into images, select the best one, optimize a CNN using random search, and compare it with an optimized ML algorithm, the XGBoost. Results show that even a basic CNN, with only 1 convolutional layer, can reach comparable metrics to the XGBoost, which was trained on the original tabular data and optimized with grid search and feature selection. However, further optimization of the CNN with random search did not significantly improve its performance.
Publisher
Public Library of Science (PLoS)
Reference42 articles.
1. Big data in healthcare: management, analysis and future prospects;S Dash;Journal of Big Data,2019
2. Deep neural networks and tabular data: A survey;V Borisov;IEEE Transactions on Neural Networks and Learning Systems,2022
3. Is Deep Learning on Tabular Data Enough? An Assessment;SA Fayaz;International Journal of Advanced Computer Science and Applications,2022
4. Artificial intelligence in medicine;P Hamet;Metabolism,2017
5. Machine learning and artificial intelligence in research and healthcare;L Rubinger;Injury,2022
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献