COSTELLO: Contrastive Testing for Embedding-Based Large Language Model as a Service Embeddings-Reference-Cited by-同舟云学术

COSTELLO: Contrastive Testing for Embedding-Based Large Language Model as a Service Embeddings

Published:2024-07-12 Issue:FSE Volume:1 Page:906-928
ISSN:2994-970X
Container-title:Proceedings of the ACM on Software Engineering
language:en
Short-container-title:Proc. ACM Softw. Eng.

Author:

Jiang Weipeng¹^ORCID,Zhai Juan²^ORCID,Ma Shiqing²^ORCID,Zhang Xiaoyu¹^ORCID,Shen Chao¹^ORCID

Affiliation:

1. Xi'an Jiaotong University, Xi'an, China

2. University of Massachusetts Amherst, Amherst, USA

Abstract

Large language models have gained significant popularity and are often provided as a service (i.e., LLMaaS). Companies like OpenAI and Google provide online APIs of LLMs to allow downstream users to create innovative applications. Despite its popularity, LLM safety and quality assurance is a well-recognized concern in the real world, requiring extra efforts for testing these LLMs. Unfortunately, while end-to-end services like ChatGPT have garnered rising attention in terms of testing, the LLMaaS embeddings have comparatively received less scrutiny. We state the importance of testing and uncovering problematic individual embeddings without considering downstream applications. The abstraction and non-interpretability of embedded vectors, combined with the black-box inaccessibility of LLMaaS, make testing a challenging puzzle. This paper proposes COSTELLO, a black-box approach to reveal potential defects in abstract embedding vectors from LLMaaS by contrastive testing . Our intuition is that high-quality LLMs can adequately capture the semantic relationships of the input texts and properly represent their relationships in the high-dimensional space. For the given interface of LLMaaS and seed inputs, COSTELLO can automatically generate test suites and output words with potential problematic embeddings. The idea is to synthesize contrastive samples with guidance, including positive and negative samples, by mutating seed inputs. Our synthesis guide will leverage task-specific properties to control the mutation procedure and generate samples with known partial relationships in the high-dimensional space. Thus, we can compare the expected relationship (oracle) and embedding distance (output of LLMs) to locate potential buggy cases. We evaluate COSTELLO on 42 open-source (encoder-based) language models and two real-world commercial LLMaaS. Experimental results show that COSTELLO can effectively detect semantic violations, where more than 62% of violations on average result in erroneous behaviors (e.g., unfairness) of downstream applications.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3643767

Reference74 articles.

1. 2021. Embedding API-Build faster great AI algorithms for HR.. https://hrflow.ai/embedding//

2. 2022. Introducing Text and Code Embeddings in the OpenAI API. https://openai.com/blog/introducing-text-and-code-embeddings/

3. 2023. ChatGPT could be used for good but like many other AI models it’s rife with racist and discriminatory bias. https://www.insider.com/chatgpt-is-like-many-other-ai-models-rife-with-bias-2023-1

4. 2023. DeepL. https://www.deepl.com/

5. 2023. Hugging Face. https://huggingface.co/