$$\alpha$$ILP: thinking visual scenes as differentiable logic programs-Reference-Cited by-同舟云学术

$$\alpha$$ILP: thinking visual scenes as differentiable logic programs

Published:2023-03-14 Issue:5 Volume:112 Page:1465-1497
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Shindo Hikaru,Pfanschilling Viktor,Dhami Devendra Singh,Kersting Kristian

Abstract

AbstractDeep neural learning has shown remarkable performance at learning representations for visual object categorization. However, deep neural networks such as CNNs do not explicitly encode objects and relations among them. This limits their success on tasks that require a deep logical understanding of visual scenes, such as Kandinsky patterns and Bongard problems. To overcome these limitations, we introduce

$$\alpha {\textit{ILP}}$$

α ILP , a novel differentiable inductive logic programming framework that learns to represent scenes as logic programs—intuitively, logical atoms correspond to objects, attributes, and relations, and clauses encode high-level scene information.

$$\alpha$$

α ILP has an end-to-end reasoning architecture from visual inputs. Using it,

$$\alpha$$

α ILP performs differentiable inductive logic programming on complex visual scenes, i.e., the logical rules are learned by gradient descent. Our extensive experiments on Kandinsky patterns and CLEVR-Hans benchmarks demonstrate the accuracy and efficiency of

$$\alpha {\textit{ILP}}$$

α ILP in learning complex visual-logical concepts.

Funder

SPAICER

TAILOR

AICO

Technische Universität Darmstadt

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s10994-023-06320-1.pdf

Reference74 articles.

1. Amizadeh, S., Palangi, H., Polozov, A., Huang, Y., & Koishida, K. (2020). Neuro-symbolic visual reasoning: Disentangling visual from reasoning. Proceedings of the 37th international conference on machine learning (ICML) (Vol. 119, pp. 279–290).

2. Antol, S., Agrawal, A., Lu, J., Mitchell, M., Batra, D., Zitnick, C. L., & Parikh, D. (2015). Vqa: Visual question answering. In Proceedings of the IEEE international conference on computer vision (ICCV).

3. Badreddine, S., d’Avila Garcez, A., Serafini, L., & Spranger, M. (2022). Logic tensor networks. Artificial Intelligence, 303, 103649.

4. Bellodi, E., & Riguzzi, F. (2015). Structure learning of probabilistic logic programs by searching the clause space. Theory and Practice of Logic Programming, 15(2), 169–212.

5. Besold, T. R., d’Avila Garcez, A. S., Bader, S., Bowman, H., Domingos, P. M., Hitzler, P., Kühnberger, K., Lamb, L. C., Lowd, D., Lima, P. M. V., de Penning, L., Pinkas, G., Poon, H., & Zaverucha, G. (2017). Neural-symbolic learning and reasoning: A survey and interpretation. In CoRRarXiv:1711.03902.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Neuro-symbolic Predicate Invention: Learning relational concepts from visual scenes;Neurosymbolic Artificial Intelligence;2024-08-21

2. From statistical relational to neurosymbolic artificial intelligence: A survey;Artificial Intelligence;2024-03

3. The Role of Foundation Models in Neuro-Symbolic Learning and Reasoning;Lecture Notes in Computer Science;2024

4. Embed2Rule Scalable Neuro-Symbolic Learning via Latent Space Weak-Labelling;Lecture Notes in Computer Science;2024