Regularized Pairwise Relationship based Analytics for Structured Data

Author:

Luo Zhaojing1ORCID,Cai Shaofeng1ORCID,Wang Yatong2ORCID,Ooi Beng Chin1ORCID

Affiliation:

1. National University of Singapore, Singapore, Singapore

2. University of Electronic Science and Technology of China, Chengdu, China

Abstract

In line with the increasing machine learning model inference accuracy, deep learning (DL) models have been increasingly applied to structured data for a wide spectrum of real-world applications, including product recommendations, online advertisement, healthcare analytics and risk analysis. However, unlike unstructured data, structured data is high-dimensional and sparse and therefore engenders a large number of parameters in DL, making DL models more prone to overfitting. To alleviate the overfitting problem, various regularization methods have been designed to constrain the model parameters as a means to control the model complexity. Unfortunately, these methods are often restricted to regularizing the parameter values directly without considering the intrinsic correlations and dependencies between attribute fields of structured data which is however key to effective structured data modeling. In this paper, we re-examine DL for structured data from a new perspective of attribute interactions. In particular, we seek to explicitly model and regularize the pairwise relationships between attribute fields of structured data, in a field-adaptive manner, via a proposed attentive and interpretable framework called ATT-Reg. Specifically, in this framework, a set of attentive weight matrices are introduced to each attribute field for modeling obviously different relationships with its neighboring attribute fields. Further, we derive from the Bayesian viewpoint a novel Attentive Regularization method for imposing adaptive regularization strengths on different pairs of attribute fields, based on the informativeness of their relationship, which is calculated using both data-driven information and functional dependency (FD) knowledge. Such adaptive regularization facilitates each attribute field to learn discriminative and diversified representations for more effective predictive analytics. We also develop a feature attribution method for supporting more interpretable predictions We validate the effectiveness of our ATT-Reg on six real-world datasets. Extensive experimental results show that ATT-Reg achieves significant improvement over state-of-the-art graph models, attentive models as well as regularization methods and supports an excellent degree of interpretation.

Funder

Singapore Ministry of Education Academic Research Fund Tier 3

Publisher

Association for Computing Machinery (ACM)

Reference64 articles.

1. Yuichiro Anzai . 2012. Pattern recognition and machine learning . Elsevier . Yuichiro Anzai. 2012. Pattern recognition and machine learning. Elsevier.

2. Nabiha Asghar and Amira Ghenai . 2015. Automatic discovery of functional dependencies and conditional functional dependencies: a comparative study. university of Waterloo ( 2015 ). Nabiha Asghar and Amira Ghenai. 2015. Automatic discovery of functional dependencies and conditional functional dependencies: a comparative study. university of Waterloo (2015).

3. Or Biran and Courtenay Cotton . 2017 . Explanation and justification in machine learning: A survey . In IJCAI-17 workshop on explainable AI , Vol. 8 . 8--13. Or Biran and Courtenay Cotton. 2017. Explanation and justification in machine learning: A survey. In IJCAI-17 workshop on explainable AI, Vol. 8. 8--13.

4. Using Word Embedding to Enable Semantic Queries in Relational Databases

5. Model slicing for supporting complex analytics with elastic inference cost and resource constraints

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Applications and Challenges for Large Language Models: From Data Management Perspective;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

2. DMRNet: Effective Network for Accurate Discharge Medication Recommendation;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13

3. Database Native Model Selection: Harnessing Deep Neural Networks in Database Systems;Proceedings of the VLDB Endowment;2024-01

4. ECGGAN: A Framework for Effective and Interpretable Electrocardiogram Anomaly Detection;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04

5. MINT: Detecting Fraudulent Behaviors from Time-Series Relational Data;Proceedings of the VLDB Endowment;2023-08

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3