Multiple weak supervision for short text classification-Reference-Cited by-同舟云学术

Multiple weak supervision for short text classification

Published:2022-01-01 Issue:8 Volume:52 Page:9101-9116
ISSN:0924-669X
Container-title:Applied Intelligence
language:en
Short-container-title:Appl Intell

Author:

Chen Li-Ming^ORCID,Xiu Bao-Xin,Ding Zhao-Yun

Abstract

AbstractFor short text classification, insufficient labeled data, data sparsity, and imbalanced classification have become three major challenges. For this, we proposed multiple weak supervision, which can label unlabeled data automatically. Different from prior work, the proposed method can generate probabilistic labels through conditional independent model. What’s more, experiments were conducted to verify the effectiveness of multiple weak supervision. According to experimental results on public dadasets, real datasets and synthetic datasets, unlabeled imbalanced short text classification problem can be solved effectively by multiple weak supervision. Notably, without reducingprecision,recall, andF1-scorecan be improved by adding distant supervision clustering, which can be used to meet different application needs.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

https://link.springer.com/content/pdf/10.1007/s10489-021-02958-3.pdf

Reference91 articles.

1. Ratner A, et al. (2017) Snorkel: Rapid Training Data Creation with Weak Supervision. Proc VLDB Endowment 11(3):269–282

2. Sun C, et al. (2017) Revisiting Unreasonable Effectiveness of Data in Deep Learning Era. In: 2017 IEEE International Conference on Computer Vision (ICCV)

3. Bach SH, et al. (2019) Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale. Proc ACM SIGMOD Int Conf Manag Data 2019:362–375

4. Zhou Z (2018) A brief introduction to weakly supervised learning. Ntl Sci Rev 5(1):44–53

5. Ratner A, et al. (2016) Data Programming: Creating Large Training Sets, Quickly. Adv Neural Inf Process Syst 29:3567–3575

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A two-stage clustering ensemble algorithm applicable to risk assessment of railway signaling faults;Expert Systems with Applications;2024-09

2. Modeling of Micro Aluminum Particle Flames Using Particle Burning Time;Combustion Science and Technology;2024-07-29

3. Knowledge and separating soft verbalizer based prompt-tuning for multi-label short text classification;Applied Intelligence;2024-06-19

4. Short text classification using semantically enriched topic model;Journal of Information Science;2024-03-20

5. A Deep Learning Short Text Classification Model Integrating Part of Speech Features;2024 4th International Conference on Neural Networks, Information and Communication (NNICE);2024-01-19