Adaptive Sparse Representation of Continuous Input for Tsetlin Machines Based on Stochastic Searching on the Line-Reference-Cited by-同舟云学术

Adaptive Sparse Representation of Continuous Input for Tsetlin Machines Based on Stochastic Searching on the Line

Published:2021-08-30 Issue:17 Volume:10 Page:2107
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Abeyrathna Kuruge Darshana,Granmo Ole-Christoffer,Goodwin Morten

Abstract

This paper introduces a novel approach to representing continuous inputs in Tsetlin Machines (TMs). Instead of using one Tsetlin Automaton (TA) for every unique threshold found when Booleanizing continuous input, we employ two Stochastic Searching on the Line (SSL) automata to learn discriminative lower and upper bounds. The two resulting Boolean features are adapted to the rest of the clause by equipping each clause with its own team of SSLs, which update the bounds during the learning process. Two standard TAs finally decide whether to include the resulting features as part of the clause. In this way, only four automata altogether represent one continuous feature (instead of potentially hundreds of them). We evaluate the performance of the new scheme empirically using five datasets, along with a study of interpretability. On average, TMs with SSL feature representation use 4.3 times fewer literals than the TM with static threshold-based features. Furthermore, in terms of average memory usage and F1-Score, our approach outperforms simple Multi-Layered Artificial Neural Networks, Decision Trees, Support Vector Machines, K-Nearest Neighbor, Random Forest, Gradient Boosted Trees (XGBoost), and Explainable Boosting Machines (EBMs), as well as the standard and real-value weighted TMs. Our approach further outperforms Neural Additive Models on Fraud Detection and StructureBoost on CA-58 in terms of the Area Under Curve while performing competitively on COMPAS.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering

Link

https://www.mdpi.com/2079-9292/10/17/2107/pdf

Reference54 articles.

1. Deep learning for healthcare: review, opportunities and challenges

2. Predictive data mining in clinical medicine: Current issues and guidelines

3. Acceptance of rules generated by machine learning among medical experts;Pazzani;Methods Inf. Med.,2001

4. Building intelligent credit scoring systems using decision tables;Baesens,2004

5. An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tsetlin Machine in DNA sequence classification : Application to prokaryote gene prediction / A match made in silico;2023 International Symposium on the Tsetlin Machine (ISTM);2023-08-29