Evaluating a Large Language Model on Searching for GUI Layouts

Author:

Brie Paul1ORCID,Burny Nicolas2ORCID,Sluÿters Arthur2ORCID,Vanderdonckt Jean2ORCID

Affiliation:

1. TeleportHQ, Cluj-Napoca, Romania

2. Université catholique de Louvain, Louvain-la-Neuve, Belgium

Abstract

The field of generative artificial intelligence has seen significant advancements in recent years with the advent of large language models, which have shown impressive results in software engineering tasks but not yet in engineering user interfaces. Thus, we raise a specific research question: would an LLM-based system be able to search for relevant GUI layouts? To address this question, we conducted a controlled study evaluating how Instigator, an LLM-based system for searching GUI layouts of web pages by generative pre-trained training, would return GUI layouts that are relevant to a given instruction and what would be the user experience of (N =34) practitioners interacting with Instigator. Our results identify a very high similarity and a moderate correlation between the rankings of the GUI layouts generated by Instigator and the rankings of the practitioners with respect to their relevance to a given design instruction. We highlight the results obtained through thirteen UEQ+ scales that characterize the user experience of the practitioner with Instigator, which we use to discuss perspectives for improving such future tools.

Funder

Fonds de la Recherche Scientifique-FRS

EU EIC Pathfinder-Awareness Inside challenge

Fonds De La Recherche Scientifique - FNRS

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Human-Computer Interaction,Social Sciences (miscellaneous)

Reference107 articles.

1. Toufique Ahmed and Premkumar Devanbu . 2023 . Few-Shot Training LLMs for Project-Specific Code-Summarization . In Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering ( Rochester, MI, USA) (ASE '22). Association for Computing Machinery, New York, NY, USA, Article 177, 5 pages. https://doi.org/10.1145/ 3551349.3559555 Toufique Ahmed and Premkumar Devanbu. 2023. Few-Shot Training LLMs for Project-Specific Code-Summarization. In Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering (Rochester, MI, USA) (ASE '22). Association for Computing Machinery, New York, NY, USA, Article 177, 5 pages. https://doi.org/10.1145/ 3551349.3559555

2. Kei Aoki , Hajime Kimura , and Shigenobu Kobayashi . 2004 . Distributed Reinforcement Learning using Bi-directional Decision Making for Multicriteria Control of Multi-Stage Flow Systems . In Proceedings of IAS '04 . http://sysplan. nams.kyushu-u.ac.jp/gen/papers/klab2004/IAS8AokiBDM.pdf Kei Aoki, Hajime Kimura, and Shigenobu Kobayashi. 2004. Distributed Reinforcement Learning using Bi-directional Decision Making for Multicriteria Control of Multi-Stage Flow Systems. In Proceedings of IAS '04. http://sysplan. nams.kyushu-u.ac.jp/gen/papers/klab2004/IAS8AokiBDM.pdf

3. Gideon Avrahami , Kenneth P. Brooks , and Marc H. Brown . 1989. A Two-View Approach to Constructing User Interfaces . In Proceedings of the 16th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH '89) . Association for Computing Machinery, New York, NY, USA, 137--146. https://doi.org/10.1145/74333.74347 10.1145/74333.74347 Gideon Avrahami, Kenneth P. Brooks, and Marc H. Brown. 1989. A Two-View Approach to Constructing User Interfaces. In Proceedings of the 16th Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH '89). Association for Computing Machinery, New York, NY, USA, 137--146. https://doi.org/10.1145/74333.74347

4. Dzmitry Bahdanau , Kyunghyun Cho , and Yoshua Bengio . 2015 . Neural Machine Translation by Jointly Learning to Align and Translate . In Proc. of 3rd International Conference on Learning Representations ( San Diego, CA, USA) (ICLR '15), Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1409.0473 Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. 2015. Neural Machine Translation by Jointly Learning to Align and Translate. In Proc. of 3rd International Conference on Learning Representations (San Diego, CA, USA) (ICLR '15), Yoshua Bengio and Yann LeCun (Eds.). http://arxiv.org/abs/1409.0473

5. F. Behrang , S. P. Reiss , and A. Orso . 2018. GUIFetch: Supporting App Design and Development through GUI Search . In 2018 IEEE/ACM 5th International Conference on Mobile Software Engineering and Systems (MOBILESoft). IEEE Computer Society , Los Alamitos, CA, USA, 236--246. https://doi.ieeecomputersociety.org/ F. Behrang, S. P. Reiss, and A. Orso. 2018. GUIFetch: Supporting App Design and Development through GUI Search. In 2018 IEEE/ACM 5th International Conference on Mobile Software Engineering and Systems (MOBILESoft). IEEE Computer Society, Los Alamitos, CA, USA, 236--246. https://doi.ieeecomputersociety.org/

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3