On Annotation Methodologies for Image Search Evaluation-Reference-Cited by-同舟云学术

On Annotation Methodologies for Image Search Evaluation

Published:2019-07-31 Issue:3 Volume:37 Page:1-32
ISSN:1046-8188
Container-title:ACM Transactions on Information Systems
language:en
Short-container-title:ACM Trans. Inf. Syst.

Author:

Shao Yunqiu¹^ORCID,Liu Yiqun¹,Zhang Fan¹,Zhang Min¹,Ma Shaoping¹

Affiliation:

1. Tsinghua University

Abstract

Image search engines differ significantly from general web search engines in the way of presenting search results. The difference leads to different interaction and examination behavior patterns, and therefore requires changes in evaluation methodologies. However, evaluation of image search still utilizes the methods for general web search. In particular, offline metrics are calculated based on coarse-fine topical relevance judgments with the assumption that users examine results in a sequential manner. In this article, we investigate annotation methods via crowdsourcing for image search evaluation based on a lab-based user study. Using user satisfaction as the golden standard, we make several interesting findings. First, instead of item-based annotation, annotating relevance in a row-based way is more efficient without hurting performance. Second, besides topical relevance, image quality plays a crucial role when evaluating the image search results, and the importance of image quality changes with search intent. Third, compared to traditional four-level scales, the fine-grain annotation method outperforms significantly. To our best knowledge, our work is the first to systematically study how diverse factors in data annotation impact image search evaluation. Our results suggest different strategies for exploiting the crowdsourcing to get data annotated under different conditions.

Funder

National Natural Science Foundation of China

National Key Research and Development Program of China

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,General Business, Management and Accounting,Information Systems

Link

https://dl.acm.org/doi/pdf/10.1145/3309994

Reference59 articles.

1. The relationship between IR effectiveness measures and user satisfaction

2. Using crowdsourcing for TREC relevance assessment