Automated Topic Categorisation of Citizens’ Contributions: Reducing Manual Labelling Efforts Through Active Learning

Author:

Romberg JuliaORCID,Escher TobiasORCID

Abstract

AbstractPolitical authorities in democratic countries regularly consult the public on specific issues but subsequently evaluating the contributions requires substantial human resources, often leading to inefficiencies and delays in the decision-making process. Among the solutions proposed is to support human analysts by thematically grouping the contributions through automated means. While supervised machine learning would naturally lend itself to the task of classifying citizens’ proposal according to certain predefined topics, the amount of training data required is often prohibitive given the idiosyncratic nature of most public participation processes. One potential solution to minimise the amount of training data is the use of active learning. While this semi-supervised procedure has proliferated in recent years, these promising approaches have never been applied to the evaluation of participation contributions. Therefore we utilise data from online participation processes in three German cities, provide classification baselines and subsequently assess how different active learning strategies can reduce manual labelling efforts while maintaining a good model performance. Our results show not only that supervised machine learning models can reliably classify topic categories for public participation contributions, but that active learning significantly reduces the amount of training data required. This has important implications for the practice of public participation because it dramatically cuts the time required for evaluation from which in particular processes with a larger number of contributions benefit.

Publisher

Springer International Publishing

Reference25 articles.

1. Aitamurto, T., Chen, K., Cherif, A., Galli, J.S., Santana, L.: Civic CrowdAnalytics: making sense of crowdsourced civic input with big data tools. In: Proceedings of the 20th International Academic Mindtrek Conference, AcademicMindtrek 2016, pp. 86–94. Association for Computing Machinery, New York (2016)

2. Arana-Catania, M., et al.: Citizen participation and machine learning for a better democracy. Digit. Gov. Res. Pract. 2(3), 1–22 (2021)

3. Ash, J.T., Chicheng, Z., Akshay, K., John, L., Alekh, A.: Deep batch active learning by diverse, uncertain gradient lower BoundsDeep batch active learning by diverse, uncertain gradient lower bounds. In: International Conference on Learning Representations 2020 (ICLR 2020) (2020)

4. Lecture Notes in Computer Science;D Balta,2019

5. Cai, G., Sun, F., Sha, Y.: Interactive visualization for topic model curation. In: Proceedings of the ACM IUI 2018 Workshop on Exploratory Search and Interactive Data Analytics (2018)

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Enhancing the design of voting advice applications with BERT language model;Frontiers in Artificial Intelligence;2024-08-06

2. A Multi-Label Classifier for Online Petition Systems;Proceedings of the 25th Annual International Conference on Digital Government Research;2024-06-11

3. Making Sense of Citizens’ Input through Artificial Intelligence;Digital Government: Research and Practice;2023-06-03

4. Evaluating Prototypes and Criticisms for Explaining Clustered Contributions in Digital Public Participation Processes;Communications in Computer and Information Science;2023

5. Residents’ Voices on Proposals;Electronic Participation;2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3