CrowdTC: Crowd-powered Learning for Text Classification-Reference-Cited by-同舟云学术

CrowdTC: Crowd-powered Learning for Text Classification

Published:2021-07-03 Issue:1 Volume:16 Page:1-23
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Yang Keyu¹,Gao Yunjun¹,Liang Lei¹,Bian Song¹,Chen Lu¹,Zheng Baihua²

Affiliation:

1. Zhejiang University, China

2. Singapore Management University, Singapore

Abstract

Text classification is a fundamental task in content analysis. Nowadays, deep learning has demonstrated promising performance in text classification compared with shallow models. However, almost all the existing models do not take advantage of the wisdom of human beings to help text classification. Human beings are more intelligent and capable than machine learning models in terms of understanding and capturing the implicit semantic information from text. In this article, we try to take guidance from human beings to classify text. We propose Crowd-powered learning for Text Classification (CrowdTC for short). We design and post the questions on a crowdsourcing platform to extract keywords in text. Sampling and clustering techniques are utilized to reduce the cost of crowdsourcing. Also, we present an attention-based neural network and a hybrid neural network to incorporate the extracted keywords as human guidance into deep neural networks. Extensive experiments on public datasets confirm that CrowdTC improves the text classification accuracy of neural networks by using the crowd-powered keyword guidance.

Funder

National Key R&D Program of China

NSFC

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3457216

Reference67 articles.

1. Selectivity-Based Keyword Extraction Method

2. YAKE! Keyword extraction from single documents using multiple local features

3. Citation-Enhanced Keyphrase Extraction from Research Papers: A Supervised Approach

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Challenges and Opportunities of Text-Based Emotion Detection: A Survey;IEEE Access;2024

2. Text classification using deep learning techniques: a bibliometric analysis and future research directions;Benchmarking: An International Journal;2023-08-18

3. A Method of Sustainable Development for Three Chinese Short-Text Datasets Based on BERT-CAM;Electronics;2023-03-24

4. Recruitment Fraud Detection Method Based on Crowdsourcing and Multi-feature Fusion;2022 5th International Conference on Artificial Intelligence and Big Data (ICAIBD);2022-05-27

5. Robust Multimodal Sentiment Analysis Via Tag Encoding of Uncertain Missing Modalities;IEEE Transactions on Multimedia;2022