Affiliation:
1. School of Sports Engineering, Beijing Sport University, Beijing 100084, China
Abstract
Football is one of the most popular sports in the world, arousing a wide range of research topics related to its off- and on-the-pitch performance. The extraction of football entities from football news helps to construct sports frameworks, integrate sports resources, and timely capture the dynamics of the sports through visual text mining results, including the connections among football players, football clubs, and football competitions, and it is of great convenience to observe and analyze the developmental tendencies of football. Therefore, in this paper, we constructed a 1000,000-word Chinese corpus in the field of football and proposed a BiLSTM-based model for named entity recognition. The ALBERT-BiLSTM combination model of deep learning is used for entity extraction of football textual data. Based on the BiLSTM model, we introduced ALBERT as a pre-training model to extract character and enhance the generalization ability of word embedding vectors. We then compared the results of two different annotation schemes, BIO and BIOE, and two deep learning models, ALBERT-BiLSTM-CRF and ALBERT BiLSTM. It was verified that the BIOE tagging was superior than BIO, and the ALBERT-BiLSTM model was more suitable for football datasets. The precision, recall, and F-Score of the model were 85.4%, 83.47%, and 84.37%, correspondingly.
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Fundamental Research Funds for the Central Universities of China
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference53 articles.
1. A review of Chinese named entity recognition;Cheng;KSII Trans. Internet Inf. Syst.,2021
2. Biomedical Relation Extraction: From Binary to Complex;Zhou;Comput. Math. Method Med.,2014
3. A Review on the Application of Knowledge Graph Technology in the Medical Field;Qu;Sci. Program.,2022
4. Transformer-Based Models for Automatic Identification of Argument Relations: A Cross-Domain Evaluation;Alemany;IEEE Intell. Syst.,2021
5. A Hybrid Approach for Persian Named Entity Recognition;Moradi;Iran. J. Sci. Technol. Trans. A-Sci.,2017
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献