Affiliation:
1. School of Cyberspace Security, Hainan University, Haikou 570228, China
Abstract
Toponymic entity recognition is currently a critical research hotspot in knowledge graphs. Under the guidance of the national ancient book protection policy and the promotion of the wave of digital humanities research, this paper proposes a toponymic entity recognition model (ALBERT-Conv1D-BiLSTM-CRF) based on the fusion of a pre-trained language model and local features to address the problems of toponymic ambiguity and the differences in ancient and modern grammatical structures in the field of the Genglubu. This model extracts global features with the ALBERT module, fuses global and local features with the Conv1D module, performs sequence modeling with the BiLSTM module to capture deep semantics and long-distance dependency information, and finally, completes sequence annotation with the CRF module. The experiments show that while taking into account the computational resources and cost, this improved model is significantly improved compared with the benchmark model (ALBERT-BiLSTM-CRF), and the precision, recall, and F1 are increased by 0.74%, 1.28%, and 1.01% to 98.08%, 96.67%, and 97.37%, respectively. The model achieved good results in the field of Genglubu.
Funder
National Natural Science Foundation of China
Hainan Province Key R&D plan project
Subject
Electrical and Electronic Engineering,Computer Networks and Communications,Hardware and Architecture,Signal Processing,Control and Systems Engineering
Reference35 articles.
1. A preliminary study on the South China Sea voyages GENG LU BU Peng Zhengkai transcript;Wang;Qilu J.,2015
2. Research on the place names of Siam Bay in Guihai Nian Gengliubu in the perspective of digital humanities;Li;Geogr. Res.,2021
3. Transformer based named entity recognition for place name extraction from unstructured text;Berragan;Int. J. Geogr. Inf. Sci.,2023
4. Lenc, L., Martínek, J., Baloun, J., Prantl, M., and Král, P. (2022, January 22–25). Historical map toponym extraction for efficient information retrieval. Proceedings of the Document Analysis Systems: 15th IAPR International Workshop, DAS 2022, La Rochelle, France.
5. Aldana-Bobadilla, E., Molina-Villegas, A., Lopez-Arevalo, I., Reyes-Palacios, S., Muñiz-Sanchez, V., and Arreola-Trapala, J. (2020). Adaptive geoparsing method for toponym recognition and resolution in unstructured text. Remote Sens., 12.