Do you see what I see? Measuring the semantic differences in image‐recognition services' outputs-Reference-Cited by-同舟云学术

Do you see what I see? Measuring the semantic differences in image‐recognition services' outputs

Published:2023-09-05 Issue:11 Volume:74 Page:1307-1324
ISSN:2330-1635
Container-title:Journal of the Association for Information Science and Technology
language:en
Short-container-title:Asso for Info Science & Tech

Author:

Berg Anton¹^ORCID,Nelimarkka Matti¹²^ORCID

Affiliation:

1. University of Helsinki Helsinki Finland

2. Aalto University Espoo Finland

Abstract

AbstractAs scholars increasingly undertake large‐scale analysis of visual materials, advanced computational tools show promise for informing that process. One technique in the toolbox is image recognition, made readily accessible via Google Vision AI, Microsoft Azure Computer Vision, and Amazon's Rekognition service. However, concerns about such issues as bias factors and low reliability have led to warnings against research employing it. A systematic study of cross‐service label agreement concretized such issues: using eight datasets, spanning professionally produced and user‐generated images, the work showed that image‐recognition services disagree on the most suitable labels for images. Beyond supporting caveats expressed in prior literature, the report articulates two mitigation strategies, both involving the use of multiple image‐recognition services: Highly explorative research could include all the labels, accepting noisier but less restrictive analysis output. Alternatively, scholars may employ word‐embedding‐based approaches to identify concepts that are similar enough for their purposes, then focus on those labels filtered in.

Publisher

Wiley

Subject

Library and Information Sciences,Information Systems and Management,Computer Networks and Communications,Information Systems

Reference53 articles.

1. Automated Visual Content Analysis (AVCA) in Communication Research: A Protocol for Large Scale Image Classification with Pre-Trained Computer Vision Models

2. Visual Representations of Disaster

3. Answering Mobile Surveys With Images: An Exploration Using a Computer Vision API

4. Why Classifications Matter

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. To share or not to share? Image data sharing in the social sciences and humanities;Information Research an international electronic journal;2024-06-18