Multimodal Distributional Semantics-Reference-Cited by-同舟云学术

Multimodal Distributional Semantics

Published:2014-01-23 Issue: Volume:49 Page:1-47
ISSN:1076-9757
Container-title:Journal of Artificial Intelligence Research
language:
Short-container-title:jair

Author:

Bruni E.,Tran N. K.,Baroni M.

Abstract

Distributional semantic models derive computational representations of word meaning from the patterns of co-occurrence of words in text. Such models have been a success story of computational linguistics, being able to provide reliable estimates of semantic relatedness for the many semantic tasks requiring them. However, distributional models extract meaning information exclusively from text, which is an extremely impoverished basis compared to the rich perceptual sources that ground human semantic knowledge. We address the lack of perceptual grounding of distributional models by exploiting computer vision techniques that automatically identify discrete visual words in images, so that the distributional representation of a word can be extended to also encompass its co-occurrence with the visual words of images it is associated with. We propose a flexible architecture to integrate text- and image-based distributional information, and we show in a set of empirical tests that our integrated model is superior to the purely text-based approach, and it provides somewhat complementary semantic information with respect to the latter.

Publisher

AI Access Foundation

Subject

Artificial Intelligence

Cited by 356 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MuSAM: Mutual-Scenario-Aware Multimodal-Enhanced Representation Learning for Semantic Similarity;IEEE Transactions on Industrial Informatics;2024-09

2. Bridging the Gap: The Integration of Sustainable Technologies in Artificial General Intelligence;Advanced Technologies and Societal Change;2024-08-31

3. Visual experience modulates the sensitivity to the distributional history of words in natural language;Psychonomic Bulletin & Review;2024-08-22

4. Symbol ungrounding: what the successes (and failures) of large language models reveal about human cognition;Philosophical Transactions of the Royal Society B: Biological Sciences;2024-08-19

5. The pluralization palette: unveiling semantic clusters in English nominal pluralization through distributional semantics;Morphology;2024-07-12