Combining Common Sense Rules and Machine Learning to Understand Object Manipulation-Reference-Cited by-同舟云学术

Combining Common Sense Rules and Machine Learning to Understand Object Manipulation

Published:2019-05-21 Issue:1 Volume:24 Page:157-172
ISSN:0324-721X
Container-title:Acta Cybernetica
language:
Short-container-title:Acta Cybern

Author:

Sárkány András^ORCID,Csákvári Máté^ORCID,Olasz Mike^ORCID

Abstract

Automatic situation understanding in videos has improved remarkably in recent years. However, state-of-the-art methods still have considerable shortcomings: they usually require training data for each object class present and may have high false positive or false negative rates, making them impractical for general applications. We study a case that has a limited goal in a narrow context and argue about the complexity of the general problem. We suggest to solve this problem by including common sense rules and by exploiting various state-of-the art deep neural networks (DNNs) as the detectors of the conditions of those rules. We want to deal with the manipulation of unknown objects at a remote table. We have two action types to be detected: `picking up an object from the table' and `putting an object onto the table' and due to remote monitoring, we consider monocular observation. We quantitatively evaluate the performance of the system on manually annotated video segments, present precision and recall scores. We also discuss issues on machine reasoning. We conclude that the proposed neural-symbolic approach a) diminishes the required size of training data and b) enables new applications where labeled data are difficult or expensive to get.

Publisher

University of Szeged

Subject

Computer Vision and Pattern Recognition,Software,Computer Science (miscellaneous),Electrical and Electronic Engineering,Information Systems and Management,Management Science and Operations Research,Theoretical Computer Science

Reference23 articles.

1. Abdulla, Waleed. Mask r-cnn for object detection and instance segmentation on keras and tensorflow, 2017.

2. Ahad, Md Atiqur Rahman. Motion history images for action recognition and understanding. Springer Science & Business Media, 2012.

3. Joint discovery of object states and manipulation actions;Alayrac,2017

4. Bellman, Richard E. Adaptive control processes: a guided tour, volume 2045. Princeton University Press, 2015.

5. Bishop, Christopher. Pattern Recognition and Machine Learning. Springer-Verlag New York, 2006.