Abstract
In the realm of artificial intelligence (AI), generative systems, most notably Midjourney, have tremendous power to generate creative images of buildings and sites of Islamic architectural heritage through text-to-image generation based on the internet. The AI-generated representations have significant potential for architects, specialists, and common users. However, the system has considerable limitations when generating images for some buildings and sites where the representations appear too far from their original represented structures. This research article attempts to answer the question: What are the limitations of using the AI system of Midjourney in producing images similar to the original buildings and sites of the Islamic architectural heritage? The research employs prompt engineering techniques based on historical sources as inputs to examine the accuracy of the output of the AI-generated images of selected examples of structures of the Islamic tradition. It compares the output with the original look by employing direct observation and critical analysis of human intelligence (HI). It categorizes these limitations into four groups: (1) limits of the prompt, (2) limits of fame, (3) limits of regionality and historical styles, and (4) limits of architectural elements and details. It concludes that while Midjourney has great capability to represent high-end AI-generated images inspired by the Islamic tradition, it currently falls short of presenting the actual appearance of some of their original structures.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献