A multimodal grammar of artificial intelligence: Measuring the gains and losses in generative AI-Reference-Cited by-同舟云学术

A multimodal grammar of artificial intelligence: Measuring the gains and losses in generative AI

Published:2023-12-28 Issue: Volume: Page:
ISSN:2634-9795
Container-title:Multimodality & Society
language:en
Short-container-title:Multimodality & Society

Author:

Cope Bill¹,Kalantzis Mary¹

Affiliation:

1. University of Illinois at Urbana-Champaign, USA

Abstract

This paper analyzes the scope of Artificial Intelligence (AI) from the perspective of a multimodal grammar. Its focal point is Generative AI, a technology that puts so-called Large Language Models to work. The first part of the paper analyzes Generative AI, based as it is on the statistical probability of one token (a word or part of a word) following another. If the relation of tokens is meaningful, this is circumstantial and no more, because its mechanisms of statistical analysis eschew any theory of meaning. This is the case not only for the written text that Generative AI leverages, but by extension image and multimodal forms of meaning that it can generate. The AI can only work with non-textual forms of meaning after applying language labels, and to that extent is captive not only to the limits of probabilistic statistics but the limits of written language as well. While acknowledging gains arising from the brute statistical power of Generative AI, in its second part the paper goes on to map what is lost in its statistical and text-bound approaches to multimodal meaning-making. Our measure of these gains and losses is guided by the concept of grammar, defined here as a theory of the elemental patterns of meaning in the world—not just written text and speech, but also image, space, object, body, and sound. Ironically, a good deal of what is lost by Generative AI is computable. The third and final part of the paper briefly discusses educational applications of Generative AI. Given both its power and intrinsic limitations, we have been experimenting with the application of Generative AI in educational settings and the ways it might be put to pedagogical use. How does a grammatical analysis help us to identify the scope of worthwhile application? Finally, if more of human experience is computable than can be captured in text-bound AI, how might it be possible at the level of code to create a synthesis in which grammatical and multimodal approaches complement Generative AI?

Publisher

SAGE Publications

Subject

General Medicine

Link

http://journals.sagepub.com/doi/pdf/10.1177/26349795231221699

Reference72 articles.

1. An introduction to cybernetics.

2. Hallucinating faces

3. Bender EM, Gebru T, McMillan-Major A, et al. (2021) On the dangers of stochastic parrots: can Language Models Be too big? In: FAccT ’21: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, Canada, March 3–10, 2021, pp. 610–623.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The generic uniqueness of AI imagery: A critical approach to Dall-E as semiotic technology;Discourse & Society;2024-09-12

2. Epilogue;Text & Talk;2024-05-01

3. Generative AI as a Writing Technology: Challenges and Opportunities for School Writing;Encyclopedia of Educational Innovation;2024