Named Entity Recognition for Chinese Texts on Marine Coral Reef Ecosystems Based on the BERT-BiGRU-Att-CRF Model

Author:

Zhao Danfeng1ORCID,Chen Xiaolian1,Chen Yan2

Affiliation:

1. College of Information Technology, Shanghai Ocean University, Shanghai 201306, China

2. College of Information Technology, Shanghai Jian Qiao University, Shanghai 201306, China

Abstract

In addressing the challenges of non-standardization and limited annotation resources in Chinese marine domain texts, particularly with complex entities like long and nested entities in coral reef ecosystem-related texts, existing Named Entity Recognition (NER) methods often fail to capture deep semantic features, leading to inefficiencies and inaccuracies. This study introduces a deep learning model that integrates Bidirectional Encoder Representations from Transformers (BERT), Bidirectional Gated Recurrent Units (BiGRU), and Conditional Random Fields (CRF), enhanced by an attention mechanism, to improve the recognition of complex entity structures. The model utilizes BERT to capture context-relevant character vectors, employs BiGRU to extract global semantic features, incorporates an attention mechanism to focus on key information, and uses CRF to produce optimized label sequences. We constructed a specialized coral reef ecosystem corpus to evaluate the model’s performance through a series of experiments. The results demonstrated that our model achieved an F1 score of 86.54%, significantly outperforming existing methods. The contributions of this research are threefold: (1) We designed an efficient named entity recognition framework for marine domain texts, improving the recognition of long and nested entities. (2) By introducing the attention mechanism, we enhanced the model’s ability to recognize complex entity structures in coral reef ecosystem texts. (3) This work offers new tools and perspectives for marine domain knowledge graph construction and study, laying a foundation for future research. These advancements propel the development of marine domain text analysis technology and provide valuable references for related research fields.

Funder

National Natural Science Foundation of China, the Youth Science Foundation Project

Shanghai Science and Technology Commission part of the local university capacity building projects

Publisher

MDPI AG

Reference40 articles.

1. Coral reefs in the Anthropocene;Hughes;Nature,2017

2. Zhao, D., Lou, Y., Song, W., Huang, D., and Wang, X. (Aquac. Fish., 2023). Stability analysis of reef fish communities based on symbiotic graph model, Aquac. Fish., in press.

3. Chinese named entity recognition: The state of the art;Liu;Neurocomputing,2022

4. Liu, C., Zhang, W., Zhao, Y., Luu, A.T., and Bing, L. (2024). Is translation all you need? A study on solving multilingual tasks with large language models. arXiv.

5. Named entity recognition using hidden Markov model (HMM);Morwal;Int. J. Nat. Lang. Comput.,2012

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3