Efficient rank aggregation using partial data-Reference-Cited by-同舟云学术

Efficient rank aggregation using partial data

Published:2012-06-07 Issue:1 Volume:40 Page:355-366
ISSN:0163-5999
Container-title:ACM SIGMETRICS Performance Evaluation Review
language:en
Short-container-title:SIGMETRICS Perform. Eval. Rev.

Author:

Ammar Ammar¹,Shah Devavrat¹

Affiliation:

1. Massachusetts Institute of Technology, Cambridge, MA, USA

Abstract

The need to rank items based on user input arises in many practical applications such as elections, group decision making and recommendation systems. The primary challenge in such scenarios is to decide on a global ranking based on partial preferences provided by users. The standard approach to address this challenge is to ask users to provide explicit numerical ratings (cardinal information) of a subset of the items. The main appeal of such an approach is the ease of aggregation. However, the rating scale as well as the individual ratings are often arbitrary and may not be consistent from one user to another. A more natural alternative to numerical ratings requires users to compare pairs of items (ordinal information). On the one hand, such comparisons provide an "absolute" indicator of the user's preference. On the other hand, it is often hard to combine or aggregate these comparisons to obtain a consistent global ranking. In this work, we provide a tractable framework for utilizing comparison data as well as first-order marginal information (see Section 2) for the purpose of ranking. We treat the available information as partial samples from an unknown distribution over permutations. We then reduce ranking problems of interest to performing inference on this distribution. Specifically, we consider the problems of (a) finding an aggregate ranking of n items, (b) learning the mode of the distribution, and (c) identifying the top k items. For many of these problems, we provide efficient algorithms to infer the ranking directly from the data without the need to estimate the underlying distribution. In other cases, we use the Principle of Maximum Entropy to devise a concise parameterization of a distribution consistent with observations using only O(n 2 ) parameters, where n is the number of items in question. We propose a distributed, iterative algorithm for estimating the parameters of the distribution. We establish the correctness of the algorithm and identify its rate of convergence explicitly.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/2318857.2254799

Reference30 articles.

1. Mit150 celebrations. http://mit150.mit.edu. Mit150 celebrations. http://mit150.mit.edu.

2. Who had the "worst year in washington"? http://voices.washingtonpost.com/thefix/worst-week-in-washington/worst-%year-in-washington.html. Who had the "worst year in washington"? http://voices.washingtonpost.com/thefix/worst-week-in-washington/worst-%year-in-washington.html.

3. Generalization bounds for the area under the roc curve;Agarwal S.;Journal of Machine Learning Research,2006

4. Parimutuel Betting on Permutations

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Lagrangian Inference for Ranking Problems;Operations Research;2023-01

2. Knowledge of the temporal structure of events in relation to autistic traits and social ability;Acta Psychologica;2022-11

3. A System-Level Analysis of Conference Peer Review;Proceedings of the 23rd ACM Conference on Economics and Computation;2022-07-12

4. Ordinal UNLOC: Target Localization With Noisy and Incomplete Distance Measures;IEEE Internet of Things Journal;2021-12-01

5. Peer Grading the Peer Reviews: A Dual-Role Approach for Lightening the Scholarly Paper Review Process;Proceedings of the Web Conference 2021;2021-04-19