DePNR: A DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature

Author:

Li Weirong12,Sun Kai3,Wang Shu4ORCID,Zhu Yunqiang45,Dai Xiaoliang46,Hu Lei46

Affiliation:

1. College of Environment and Resources Guangxi Normal University Guilin China

2. Guangxi Key Laboratory of Environmental Processes and Remediation in Ecologically Fragile Regions Guangxi Normal University Guilin China

3. GeoAI Lab, Department of Geography University at Buffalo Buffalo New York USA

4. State Key Laboratory of Resources and Environmental Information System Institute of Geographic Sciences and Natural Resources Research, Chinese Academy of Sciences Beijing China

5. Jiangsu Center for Collaborative Innovation in Geographical Information Resource Development and Application Nanjing China

6. College of Resources and Environment University of Chinese Academy of Sciences Beijing China

Abstract

AbstractPlace names play an important role in linking physical places to human perception and are highly frequently used in the daily lives of people to refer to places in natural language. However, many place names may not be recorded in typical gazetteers due to their new establishment, colloquial nature, and different concerns. These unrecorded toponyms are often discussed in geographical literature; thus, it is necessary to automatically identify them from geographical literature and update existing gazetteers using computational approaches. Currently, the most advanced approaches are deep learning‐based models. However, existing models used only partial position information rather than complete position information of words in a sentence, which limits their performance in recognizing toponyms. To this end, we develop DePNR, a DeBERTa‐based deep learning model with complete position embedding for place name recognition from geographical literature. We train DePNR on two datasets and test it on a real dataset from geographical literature to evaluate its performance. The results show that DePNR achieves an F‐score of 0.8282, outperforming previous approaches, and can recognize new toponyms from literature text, potentially enriching existing gazetteers.

Funder

National Key Research and Development Program of China

Publisher

Wiley

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3