TallyQA: Answering Complex Counting Questions-Reference-Cited by-同舟云学术

TallyQA: Answering Complex Counting Questions

Published:2019-07-17 Issue: Volume:33 Page:8076-8084
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Acharya Manoj,Kafle Kushal,Kanan Christopher

Abstract

Most counting questions in visual question answering (VQA) datasets are simple and require no more than object detection. Here, we study algorithms for complex counting questions that involve relationships between objects, attribute identification, reasoning, and more. To do this, we created TallyQA, the world’s largest dataset for open-ended counting. We propose a new algorithm for counting that uses relation networks with region proposals. Our method lets relation networks be efficiently used with high-resolution imagery. It yields stateof-the-art results compared to baseline and recent systems on both TallyQA and the HowMany-QA benchmark.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Robust Visual Question Answering: Datasets, Methods, and Future Challenges;IEEE Transactions on Pattern Analysis and Machine Intelligence;2024-08

2. A Balanced Counting Visual Question Answering Dataset;Lecture Notes in Networks and Systems;2024

3. Evaluation of Systematic Errors in Visual Question Answering;Lecture Notes in Networks and Systems;2024

4. Counting-based visual question answering with serial cascaded attention deep learning;Pattern Recognition;2023-12

5. A Symbolic Characters Aware Model for Solving Geometry Problems;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26