Positive-Unlabeled Learning from Imbalanced Data-Reference-Cited by-同舟云学术

Positive-Unlabeled Learning from Imbalanced Data

Published:2021-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Su Guangxin¹,Chen Weitong¹,Xu Miao¹²

Affiliation:

1. The University of Queensland

2. RIKEN AIP

Abstract

Positive-unlabeled (PU) learning deals with the binary classification problem when only positive (P) and unlabeled (U) data are available, without negative (N) data. Existing PU methods perform well on the balanced dataset. However, in real applications such as financial fraud detection or medical diagnosis, data are always imbalanced. It remains unclear whether existing PU methods can perform well on imbalanced data. In this paper, we explore this problem and propose a general learning objective for PU learning targeting specially at imbalanced data. By this general learning objective, state-of-the-art PU methods based on optimizing a consistent risk can be adapted to conquer the imbalance. We theoretically show that in expectation, optimizing our learning objective is equivalent to learning a classifier on the oversampled balanced data with both P and N data available, and further provide an estimation error bound. Finally, experimental results validate the effectiveness of our proposal compared to state-of-the-art PU methods.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 20 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Escaping the neutralization effect of modality features fusion in multimodal Fake News Detection;Information Fusion;2024-11

2. A Recommendation Method Based on Fusion of Graph Neural Network Single-Layer Mixing Negative Samples;2024-06-27

3. A Novel Classification Method: Neighborhood-Based Positive Unlabeled Learning Using Decision Tree (NPULUD);Entropy;2024-05-04

4. Dense-PU: Learning a Density-Based Boundary for Positive and Unlabeled Learning;IEEE Access;2024

5. A Quantum-Inspired Direct Learning Strategy for Positive and Unlabeled Data;International Journal of Computational Intelligence Systems;2023-12-06