Towards combining commonsense reasoning and knowledge acquisition to guide deep learning

Author:

Sridharan MohanORCID,Mota Tiago

Abstract

AbstractAlgorithms based on deep network models are being used for many pattern recognition and decision-making tasks in robotics and AI. Training these models requires a large labeled dataset and considerable computational resources, which are not readily available in many domains. Also, it is difficult to explore the internal representations and reasoning mechanisms of these models. As a step towards addressing the underlying knowledge representation, reasoning, and learning challenges, the architecture described in this paper draws inspiration from research in cognitive systems. As a motivating example, we consider an assistive robot trying to reduce clutter in any given scene by reasoning about the occlusion of objects and stability of object configurations in an image of the scene. In this context, our architecture incrementally learns and revises a grounding of the spatial relations between objects and uses this grounding to extract spatial information from input images. Non-monotonic logical reasoning with this information and incomplete commonsense domain knowledge is used to make decisions about stability and occlusion. For images that cannot be processed by such reasoning, regions relevant to the tasks at hand are automatically identified and used to train deep network models to make the desired decisions. Image regions used to train the deep networks are also used to incrementally acquire previously unknown state constraints that are merged with the existing knowledge for subsequent reasoning. Experimental evaluation performed using simulated and real-world images indicates that in comparison with baselines based just on deep networks, our architecture improves reliability of decision making and reduces the effort involved in training data-driven deep network models.

Funder

Air Force Office of Scientific Research

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Reference71 articles.

1. Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G., Davis, A., Dean, J., Devin, M. et al. (2016). TensorFlow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467https://arxiv.org/abs/1603.04467

2. Assaf, R., Schumann, A. (2019). Explainable deep neural networks for multivariate time series predictions. In International Joint Conference on Artificial Intelligence.

3. Balai, E., Gelfond, M., Zhang, Y. (2013). Towards answer set programming with sorts. In International Conference on Logic Programming and Nonmonotonic Reasoning, Corunna, Spain. https://link.springer.com/chapter/10.1007/978-3-642-40564-8_14

4. Balduccini, M., Gelfond, M. (2003). Logic programs with consistency-restoring rules. In AAAI Spring Symposium on Logical Formalization of Commonsense Reasoning, pp 9–18

5. Battaglia, P. W., Hamrick, J. B., & Tenenbaum, J. B. (2013). Simulation as an engine of physical scene understanding. Proceedings of the National Academy of Sciences, 110, 18327–18332. https://doi.org/10.1073/pnas.1306572110

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. ChatGPT/AI in Healthcare Management;Journal of Clinical Medical Research;2023-09-18

2. Knowledge-based Reasoning and Learning under Partial Observability in Ad Hoc Teamwork;Theory and Practice of Logic Programming;2023-06-26

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3