Merging Statistical Feature via Adaptive Gate for Improved Text Classification-Reference-Cited by-同舟云学术

Merging Statistical Feature via Adaptive Gate for Improved Text Classification

Published:2021-05-18 Issue:15 Volume:35 Page:13288-13296
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Li Xianming,Li Zongxi,Xie Haoran,Li Qing

Abstract

Currently, text classification studies mainly focus on training classifiers by using textual input only, or enhancing semantic features by introducing external knowledge (e.g., hand-craft lexicons and domain knowledge). In contrast, some intrinsic statistical features of the corpus, like word frequency and distribution over labels, are not well exploited. Compared with external knowledge, the statistical features are deterministic and naturally compatible with corresponding tasks. In this paper, we propose an Adaptive Gate Network (AGN) to consolidate semantic representation with statistical features selectively. In particular, AGN encodes statistical features through a variational component and merges information via a well-designed valve mechanism. The valve adapts the information flow into the classifier according to the confidence of semantic features in decision making, which can facilitate training a robust classifier and can address the overfitting caused by using statistical features. Extensive experiments on datasets of various scales show that, by incorporating statistical information, AGN can improve the classification performance of CNN, RNN, Transformer, and Bert based models effectively. The experiments also indicate the robustness of AGN against adversarial attacks of manipulating statistical information.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Knowledge Graph-Based Hierarchical Text Semantic Representation;International Journal of Intelligent Systems;2024-01-12

2. Event assigning based on hierarchical features and enhanced association for Chinese mayor's hotline;Computational Intelligence;2024-01-04

3. Sentence-level sentiment analysis based on supervised gradual machine learning;Scientific Reports;2023-09-04

4. Contrastive Learning Models for Sentence Representations;ACM Transactions on Intelligent Systems and Technology;2023-06-15

5. URM4DMU: An User Representation Model for Darknet Markets Users;ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2023-06-04