Natural language processing for humanitarian action: Opportunities, challenges, and the path toward humanitarian NLP

Author:

Rocca Roberta,Tamagnone Nicolò,Fekih Selim,Contla Ximena,Rekabsaz Navid

Abstract

Natural language processing (NLP) is a rapidly evolving field at the intersection of linguistics, computer science, and artificial intelligence, which is concerned with developing methods to process and generate language at scale. Modern NLP tools have the potential to support humanitarian action at multiple stages of the humanitarian response cycle. Both internal reports, secondary text data (e.g., social media data, news media articles, or interviews with affected individuals), and external-facing documents like Humanitarian Needs Overviews (HNOs) encode information relevant to monitoring, anticipating, or responding to humanitarian crises. Yet, lack of awareness of the concrete opportunities offered by state-of-the-art techniques, as well as constraints posed by resource scarcity, limit adoption of NLP tools in the humanitarian sector. This paper provides a pragmatically-minded primer to the emerging field of humanitarian NLP, reviewing existing initiatives in the space of humanitarian NLP, highlighting potentially impactful applications of NLP in the humanitarian sector, and describing criteria, challenges, and potential solutions for large-scale adoption. In addition, as one of the main bottlenecks is the lack of data and standards for this domain, we present recent initiatives (the DEEP and HumSet) which are directly aimed at addressing these gaps. With this work, we hope to motivate humanitarians and NLP experts to create long-term impact-driven synergies and to co-develop an ambitious roadmap for the field.

Publisher

Frontiers Media SA

Subject

Artificial Intelligence,Information Systems,Computer Science (miscellaneous)

Reference58 articles.

1. “Crisisbench: Benchmarking crisis-related social media datasets for humanitarian information processing,”;Alam,2021

2. Scibert: a pretrained language model for scientific text;Beltagy;arXiv preprint,2019

3. “On the dangers of stochastic parrots: can language models be too big?,”;Bender,2021

4. Language (technology) is power: a critical survey of “bias” in NLP;Blodgett;arXiv Preprint,2020

5. Enriching word vectors with subword information;Bojanowski;arXiv Preprint,2016

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Exploring the role of large language models in radiation emergency response;Journal of Radiological Protection;2024-02-15

2. Using machine learning techniques to investigate learner engagement with TikTok media literacy campaigns;Journal of Research on Technology in Education;2024-01-02

3. Information Technology of Transcribing Ukrainian-Language Content Based on Deep Learning;2023 IEEE 18th International Conference on Computer Science and Information Technologies (CSIT);2023-10-19

4. Universal skepticism of ChatGPT: a review of early literature on chat generative pre-trained transformer;Frontiers in Big Data;2023-08-23

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3