Does Deep Learning Work Well for Categorical Datasets with Mainly Nominal Attributes?-Reference-Cited by-同舟云学术

Does Deep Learning Work Well for Categorical Datasets with Mainly Nominal Attributes?

Published:2020-11-21 Issue:11 Volume:9 Page:1966
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Hayashi Yoichi

Abstract

Given the complexity of real-world datasets, it is difficult to present data structures using existing deep learning (DL) models. Most research to date has concentrated on datasets with only one type of attribute: categorical or numerical. Categorical data are common in datasets such as the German (-categorical) credit scoring dataset, which contains numerical, ordinal, and nominal attributes. The heterogeneous structure of this dataset makes very high accuracy difficult to achieve. DL-based methods have achieved high accuracy (99.68%) for the Wisconsin Breast Cancer Dataset, whereas DL-inspired methods have achieved high accuracy (97.39%) for the Australian credit dataset. However, to our knowledge, no such method has been proposed to classify the German credit dataset. This study aimed to provide new insights into the reasons why DL-based and DL-inspired classifiers do not work well for categorical datasets, mainly consisting of nominal attributes. We also discuss the problems associated with using nominal attributes to design high-performance classifiers. Considering the expanded utility of DL, this study's findings should aid in the development of a new type of DL that can handle categorical datasets consisting of mainly nominal attributes, which are commonly used in risk evaluation, finance, banking, and marketing.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/9/11/1966/pdf

Reference46 articles.

1. Handwritten digit recognition with a back-propagation network;LeCun,1989

2. Backpropagation Applied to Handwritten Zip Code Recognition

3. Deep learning

4. Stacked generalization

5. The Existence of A Priori Distinctions Between Learning Algorithms

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Why Do Tree Ensemble Approximators Not Outperform the Recursive-Rule eXtraction Algorithm?;Machine Learning and Knowledge Extraction;2024-03-16

2. Malware Prediction Using Tabular Deep Learning Models;Advances in Intelligent Systems and Computing;2024

3. Research on SPDTRS-PNN based intelligent assistant diagnosis for breast cancer;Scientific Reports;2023-03-16

4. Deep learning models for improved reliability of tree aboveground biomass prediction in the tropical evergreen broadleaf forests;Forest Ecology and Management;2022-03