Weakly Supervised Sequence Tagging from Noisy Rules-Reference-Cited by-同舟云学术

Weakly Supervised Sequence Tagging from Noisy Rules

Published:2020-04-03 Issue:04 Volume:34 Page:5570-5578
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Safranchik Esteban,Luo Shiying,Bach Stephen

Abstract

We propose a framework for training sequence tagging models with weak supervision consisting of multiple heuristic rules of unknown accuracy. In addition to supporting rules that vote on tags in the output sequence, we introduce a new type of weak supervision, called linking rules, that vote on how sequence elements should be grouped into spans with the same tag. These rules are an alternative to candidate span generators that require significantly more human effort. To estimate the accuracies of the rules and combine their conflicting outputs into training data, we introduce a new type of generative model, linked hidden Markov models (linked HMMs), and prove they are generically identifiable (up to a tag permutation) without any observed training labels. We find that linked HMMs provide an average 7 F1 point boost on benchmark named entity recognition tasks versus generative models that assume the tags are i.i.d. Further, neural sequence taggers trained with these structure-aware generative models outperform comparable state-of-the-art approaches to weak supervision by an average of 2.6 F1 points.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 25 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. KGRED: Knowledge-graph-based rule discovery for weakly supervised data labeling;Information Processing & Management;2024-09

2. WELL: Applying bug detectors to bug localization via weakly supervised learning;Journal of Software: Evolution and Process;2024-04-23

3. Language Models in the Loop: Incorporating Prompting into Weak Supervision;ACM / IMS Journal of Data Science;2024-04-08

4. Leveraging Large Language Models for Structure Learning in Prompted Weak Supervision;2023 IEEE International Conference on Big Data (BigData);2023-12-15

5. Neural-Hidden-CRF: A Robust Weakly-Supervised Sequence Labeler;Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2023-08-04