A multi-scenario approach to continuously learn and understand norm violations
-
Published:2023-08-16
Issue:2
Volume:37
Page:
-
ISSN:1387-2532
-
Container-title:Autonomous Agents and Multi-Agent Systems
-
language:en
-
Short-container-title:Auton Agent Multi-Agent Syst
Author:
Freitas dos Santos Thiago,Osman Nardine,Schorlemmer Marco
Abstract
AbstractUsing norms to guide and coordinate interactions has gained tremendous attention in the multiagent community. However, new challenges arise as the interest moves towards dynamic socio-technical systems, where human and software agents interact, and interactions are required to adapt to changing human needs. For instance, different agents (human or software) might not have the same understanding of what it means to violate a norm (e.g., what characterizes hate speech), or their understanding of a norm might change over time (e.g., what constitutes an acceptable response time). The challenge is to address these issues by learning to detect norm violations from the limited interaction data and to explain the reasons for such violations. To do that, we propose a framework that combines Machine Learning (ML) models and incremental learning techniques. Our proposal is equipped to solve tasks in both tabular and text classification scenarios. Incremental learning is used to continuously update the base ML models as interactions unfold, ensemble learning is used to handle the imbalance class distribution of the interaction stream, Pre-trained Language Model (PLM) is used to learn from text sentences, and Integrated Gradients (IG) is the interpretability algorithm. We evaluate the proposed approach in the use case of Wikipedia article edits, where interactions revolve around editing articles, and the norm in question is prohibiting vandalism. Results show that the proposed framework can learn to detect norm violation in a setting with data imbalance and concept drift.
Funder
EU’s Horizon 2020 Generalitat de Catalunya Agencia Estatal de Investigación EU's Horizon 2022 Instituto de Investigación en Inteligencia Artificial
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence
Reference111 articles.
1. Adelani, D. I., Mai, H., Fang, F., Nguyen, H. H, Yamagishi, J., & Echizen, I. (2020). Generating sentiment-preserving fake online reviews using neural language models and their human-and machine-based detection. In: Advanced information networking and applications: Proceedings of the 34th international conference on advanced information networking and applications (AINA-2020), (pp. 1341–1354), Springer. 2. Thomas Adler, B., de Alfaro, L., Mola-Velasco, S. M., Rosso, P., & West, A. G. (2011). Wikipedia vandalism detection: Combining natural language, metadata, and reputation features. Computational linguistics and intelligent text processing (pp. 277–288). Berlin Heidelberg: Springer. 3. Afroz, S., Brennan, M., & Greenstadt, R. (2012). Detecting hoaxes, frauds, and deception in writing style online. 2012 IEEE symposium on security and privacy (pp. 461–475). IEEE, San Francisco, CA, USA: IEEE. 4. Aires, J. P., & Meneguzzi, F. (2021). Norm conflict identification using a convolutional neural network. In A. A. Tubella, S. Cranefield, C. Frantz, F. Meneguzzi, & W. Vasconcelos (Eds.), Coordination, organizations, institutions, norms, and ethics for governance of multi-agent systems XIII (pp. 3–19). Cham: Springer International Publishing. 5. Ajmeri, N., Guo, H., Murukannaiah, P. K., & Singh, M. P. (2020). Elessar: Ethics in norm-aware agents. In: Proceedings of the 19th international conference on autonomous agents and multiagent systems. (pp. 16–24), International foundation for autonomous agents and multiagent systems, Richland, SC.
|
|