Explainable Image Similarity: Integrating Siamese Networks and Grad-CAM-Reference-Cited by-同舟云学术

Explainable Image Similarity: Integrating Siamese Networks and Grad-CAM

Published:2023-10-14 Issue:10 Volume:9 Page:224
ISSN:2313-433X
Container-title:Journal of Imaging
language:en
Short-container-title:J. Imaging

Author:

Livieris Ioannis E.¹^ORCID,Pintelas Emmanuel²^ORCID,Kiriakidou Niki³^ORCID,Pintelas Panagiotis²^ORCID

Affiliation:

1. Department of Statistics & Insurance, University of Piraeus, GR 185-34 Piraeus, Greece

2. Department of Mathematics, University of Patras, GR 265-00 Patras, Greece

3. Department of Informatics and Telematics, Harokopio University of Athens, GR 177-78 Athens, Greece

Abstract

With the proliferation of image-based applications in various domains, the need for accurate and interpretable image similarity measures has become increasingly critical. Existing image similarity models often lack transparency, making it challenging to understand the reasons why two images are considered similar. In this paper, we propose the concept of explainable image similarity, where the goal is the development of an approach, which is capable of providing similarity scores along with visual factual and counterfactual explanations. Along this line, we present a new framework, which integrates Siamese Networks and Grad-CAM for providing explainable image similarity and discuss the potential benefits and challenges of adopting this approach. In addition, we provide a comprehensive discussion about factual and counterfactual explanations provided by the proposed framework for assisting decision making. The proposed approach has the potential to enhance the interpretability, trustworthiness and user acceptance of image-based systems in real-world image similarity applications.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Computer Graphics and Computer-Aided Design,Computer Vision and Pattern Recognition,Radiology, Nuclear Medicine and imaging

Link

https://www.mdpi.com/2313-433X/9/10/224/pdf

Reference45 articles.

1. End-to-end learning of deep visual representations for image retrieval;Gordo;Int. J. Comput. Vis.,2017

2. Bell, S., Zitnick, C.L., Bala, K., and Girshick, R. (July, January 26). Inside-outside net: Detecting objects in context with skip pooling and recurrent neural networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA.

3. Gygli, M., Grabner, H., Riemenschneider, H., and Van Gool, L. (2014, January 6–12). Creating summaries from user videos. Proceedings of the Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland. Proceedings, Part VII 13.

4. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning;Shin;IEEE Trans. Med Imaging,2016

5. Fine-tuning CNN image retrieval with no human annotation;Tolias;IEEE Trans. Pattern Anal. Mach. Intell.,2018

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Improving flood forecast accuracy based on explainable convolutional neural network by Grad-CAM method;Journal of Hydrology;2024-10

2. ExplainLFS: Explaining neural architectures for similarity learning from local perturbations in the latent feature space;Information Fusion;2024-08

3. Lung Sound Classification for Respiratory Disease Identification Using Deep Learning: A Survey;International Journal of Online and Biomedical Engineering (iJOE);2024-07-16

4. Automatic classification and segmentation of multiclass jaw lesions in cone-beam CT using deep learning;Dentomaxillofacial Radiology;2024-06-27