Counterfactual Explanation of Shapley Value in Data Coalitions-Reference-Cited by-同舟云学术

Counterfactual Explanation of Shapley Value in Data Coalitions

Published:2024-07 Issue:11 Volume:17 Page:3332-3345
ISSN:2150-8097
Container-title:Proceedings of the VLDB Endowment
language:en
Short-container-title:Proc. VLDB Endow.

Author:

Si Michelle¹,Pei Jian¹

Affiliation:

1. Duke University, Durham, NC, USA

Abstract

The Shapley value is widely used for data valuation in data markets. However, explaining the Shapley value of an owner in a data coalition is an unexplored and challenging task. To tackle this, we formulate the problem of finding the counterfactual explanation of Shapley value in data coalitions. Essentially, given two data owners A and B such that A has a higher Shapley value than B , a counter-factual explanation is a smallest subset of data entries in A such that transferring the subset from A to B makes the Shapley value of A less than that of B. We show that counterfactual explanations always exist, but finding an exact counterfactual explanation is NP-hard. Using Monte Carlo estimation to approximate counterfactual explanations directly according to the definition is still very costly, since we have to estimate the Shapley values of owners A and B after each possible subset shift. We develop a series of heuristic techniques to speed up computation by estimating differential Shapley values, computing the power of singular data entries, and shifting subsets greedily, culminating in the SV-Exp algorithm. Our experimental results on real datasets clearly demonstrate the efficiency of our method and the effectiveness of counterfactuals in interpreting the Shapley value of an owner.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.14778/3681954.3682004

Reference62 articles.

1. The Economics of Privacy

2. A Marketplace for Data

3. Recommender Systems

4. Privacy-Preserving Data Mining: A Survey

5. Priced Oblivious Transfer: How to Sell Digital Goods