Recognition of Unknown Entities in Specific Financial Field Based on ERNIE-Doc-BiLSTM-CRF

Author:

Xin Li1ORCID,Xiaoyan Hao1

Affiliation:

1. College of Information and Computer, Taiyuan University of Technology, Taiyuan, Shanxi 030000, China

Abstract

The Internet is rich in information related to the financial field. The financial entity information text containing new internet vocabulary has a certain impact on the results of existing recognition algorithms. How to solve the problems of new vocabulary and polysemy is a problem to be solved in the current field. This paper proposes an ERNIE-Doc-BiLSTM-CRF named entity recognition model based on the pretrained language model. Compared with the traditional model, the ERNIE-Doc pretrained language model constructs a unique word vector from the word vector and combines the location coding, which solves polysemy problem well. The intensive skimming mechanism realizes the long text processing well and captures the context information effectively. The experimental results show that the accuracy of this model is 86.72%, the recall rate is 83.39%, and the F1 value is 85.02%, which is 13.36% higher than other models; the recall rate is increased by 13.05%, and the F1 value is increased by 13.21%.

Funder

Key R&D Projects in Shanxi Province

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Reference18 articles.

1. Extracting company names from text

2. A Decision Tree Method for Finding and Classifying Names in Japanese texts;S. Sekine

3. Clinical named entity recognition:ECUST in the CCKS-2017 shared task 2;Y. H. Xia;CEUR Workshop Proceedings,2017

4. Flytxt_NTNU at SemEval-2018 Task 8: Identifying and Classifying Malware Text Using Conditional Random Fields and Naïve Bayes Classifiers

5. Named entity recognition in Chinese electronic medical records based on BERT;L. Li;Journal of Inner Mongolia University of Science and Technology,2020

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3