X-Class-Reference-Cited by-同舟云学术

X-Class

Published:2013-01 Issue:1 Volume:31 Page:1-40
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Costa Gianni¹,Ortale Riccardo¹,Ritacco Ettore¹

Affiliation:

1. ICAR-CNR

Abstract

The supervised classification of XML documents by structure involves learning predictive models in which certain structural regularities discriminate the individual document classes. Hitherto, research has focused on the adoption of prespecified substructures. This is detrimental for classification effectiveness, since the a priori chosen substructures may not accord with the structural properties of the XML documents. Therein, an unexplored question is how to choose the type of structural regularity that best adapts to the structures of the available XML documents. We tackle this problem through X-Class, an approach that handles all types of tree-like substructures and allows for choosing the most discriminatory one. Algorithms are designed to learn compact rule-based classifiers in which the chosen substructures discriminate the classes of XML documents. X-Class is studied across various domains and types of substructures. Its classification performance is compared against several rule-based and SVM-based competitors. Empirical evidence reveals that the classifiers induced by X-Class are compact, scalable, and at least as effective as the established competitors. In particular, certain substructures allow the induction of very compact classifiers that generally outperform the rule-based competitors in terms of effectiveness over all chosen corpora of XML data. Furthermore, such classifiers are substantially as effective as the SVM-based competitor, with the additional advantage of a high-degree of interpretability.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/2414782.2414785

Reference78 articles.

1. Xproj

2. Route kernels for trees

3. XML search

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An application for predicting phishing attacks: A case of implementing a support vector machine learning model;Cyber Security and Applications;2024

2. Rule-Based Detection of Anomalous Patterns in Device Behavior for Explainable IoT Security;IEEE Transactions on Services Computing;2023-11

3. A Review Selection Method Based on Consumer Decision Phases in E-commerce;ACM Transactions on Information Systems;2023-08-21

4. A parallel and balanced SVM algorithm on spark for data-intensive computing;Intelligent Data Analysis;2023-07-20

5. New Associative Classification Method Based on Rule Pruning for Classification of Datasets;IEEE Access;2019