Word-As-Image for Semantic Typography-Reference-Cited by-同舟云学术

Word-As-Image for Semantic Typography

Published:2023-07-26 Issue:4 Volume:42 Page:1-11
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Iluz Shir¹^ORCID,Vinker Yael¹^ORCID,Hertz Amir¹^ORCID,Berio Daniel²^ORCID,Cohen-Or Daniel¹^ORCID,Shamir Ariel³^ORCID

Affiliation:

1. Tel Aviv University, Tel Aviv, Israel

2. Goldsmiths University of London, London, United Kingdom

3. Reichman University, Herzliya, Israel

Abstract

A word-as-image is a semantic typography technique where a word illustration presents a visualization of the meaning of the word, while also preserving its readability. We present a method to create word-as-image illustrations automatically. This task is highly challenging as it requires semantic understanding of the word and a creative idea of where and how to depict these semantics in a visually pleasing and legible manner. We rely on the remarkable ability of recent large pretrained language-vision models to distill textual concepts visually. We target simple, concise, black-and-white designs that convey the semantics clearly. We deliberately do not change the color or texture of the letters and do not use embellishments. Our method optimizes the outline of each letter to convey the desired concept, guided by a pretrained Stable Diffusion model. We incorporate additional loss terms to ensure the legibility of the text and the preservation of the style of the font. We show high quality and engaging results on numerous examples and compare to alternative techniques. Code and demo will be available at our project page.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Link

https://dl.acm.org/doi/pdf/10.1145/3592123

Reference64 articles.

1. Blended Diffusion for Text-driven Editing of Natural Images

2. Multi-content GAN for Few-Shot Font Style Transfer

3. Learning A Stroke‐Based Representation for Fonts

4. Brad Barber and Hannu Huhdanpaa . 1995. QHull. The Geometry Center , University of Minnesota , http://www.geom.umn.edu/software/qhull ( 1995 ). Brad Barber and Hannu Huhdanpaa. 1995. QHull. The Geometry Center, University of Minnesota, http://www.geom.umn.edu/software/qhull (1995).

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep permutation type design for Chinese characters;The Design Journal;2024-07-16

2. Fabricable 3D Wire Art;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

3. The Chosen One: Consistent Characters in Text-to-Image Diffusion Models;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13

4. GPT, large language models (LLMs) and generative artificial intelligence (GAI) models in geospatial science: a systematic review;International Journal of Digital Earth;2024-05-20

5. TypeDance: Creating Semantic Typographic Logos from Image through Personalized Generation;Proceedings of the CHI Conference on Human Factors in Computing Systems;2024-05-11