RENAR

Author:

Zaghouani Wajdi1

Affiliation:

1. University of Pennsylvania

Abstract

Named entity recognition has served many natural language processing tasks such as information retrieval, machine translation, and question answering systems. Many researchers have addressed the name identification issue in a variety of languages and recently some research efforts have started to focus on named entity recognition for the Arabic language. We present a working Arabic information extraction (IE) system that is used to analyze large volumes of news texts every day to extract the named entity (NE) types person, organization, location, date, and number, as well as quotations (direct reported speech) by and about people. The named entity recognition (NER) system was not developed for Arabic, but instead a multilingual NER system was adapted to also cover Arabic. The Semitic language Arabic substantially differs from the Indo-European and Finno-Ugric languages currently covered. This article thus describes what Arabic language-specific resources had to be developed and what changes needed to be made to the rule set in order to be applicable to the Arabic language. The achieved evaluation results are generally satisfactory, but could be improved for certain entity types.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Reference23 articles.

1. Benajiba D. Y. 2009a. Named entity recognition. Doctoral dissertation Universidad Politecnica de Valencia. Benajiba D. Y. 2009a. Named entity recognition. Doctoral dissertation Universidad Politecnica de Valencia.

2. ANERsys: An Arabic Named Entity Recognition System Based on Maximum Entropy

Cited by 27 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. TTK: A toolkit for Tunisian linguistic analysis;Computer Speech & Language;2024-06

2. Comparing Open Arabic Named Entity Recognition Tools;2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI);2023-08

3. A Novel Named Entity Recognition Mehod for Bus Route Identification in Social Media;2023 2nd International Conference on Machine Learning, Cloud Computing and Intelligent Mining (MLCCIM);2023-07-25

4. Research on Entity recognition of Chinese place Names based on BERT;2023 IEEE 6th Information Technology,Networking,Electronic and Automation Control Conference (ITNEC);2023-02-24

5. ATPM-REAP: A Simple and Efficient Address Tracking and Parsing for Vietnamese Real Estate Advertisement Posts;2022 14th International Conference on Knowledge and Systems Engineering (KSE);2022-10-19

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3