Localized Centering: Reducing Hubness in Large-Sample Data-Reference-Cited by-同舟云学术

Localized Centering: Reducing Hubness in Large-Sample Data

Published:2015-02-21 Issue:1 Volume:29 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Hara Kazuo,Suzuki Ikumi,Shimbo Masashi,Kobayashi Kei,Fukumizu Kenji,Radovanović Miloš

Abstract

Hubness has been recently identified as a problematic phenomenon occurring in high-dimensional space. In this paper, we address a different type of hubness that occurs when the number of samples is large. We investigate the difference between the hubness in high-dimensional data and the one in large-sample data. One finding is that centering, which is known to reduce the former, does not work for the latter. We then propose a new hub-reduction method, called localized centering. It is an extension of centering, yet works effectively for both types of hubness. Using real-world datasets consisting of a large number of documents, we demonstrate that the proposed method improves the accuracy of k-nearest neighbor classification.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Hubs and Hyperspheres: Reducing Hubness and Improving Transductive Few-Shot Learning with Hyperspherical Embeddings;2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2023-06

2. Cross Modal Retrieval with Querybank Normalisation;2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR);2022-06

3. Pre-Trained Word Embedding and Language Model Improve Multimodal Machine Translation: A Case Study in Multi30K;IEEE Access;2022