An Improved Word Representation for Deep Learning Based NER in Indian Languages-Reference-Cited by-同舟云学术

An Improved Word Representation for Deep Learning Based NER in Indian Languages

Published:2019-05-30 Issue:6 Volume:10 Page:186
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

A P Ajees^ORCID,K Manju,Mary Idicula Sumam

Abstract

Named Entity Recognition (NER) is the process of identifying the elementary units in a text document and classifying them into predefined categories such as person, location, organization and so forth. NER plays an important role in many Natural Language Processing applications like information retrieval, question answering, machine translation and so forth. Resolving the ambiguities of lexical items involved in a text document is a challenging task. NER in Indian languages is always a complex task due to their morphological richness and agglutinative nature. Even though different solutions were proposed for NER, it is still an unsolved problem. Traditional approaches to Named Entity Recognition were based on the application of hand-crafted features to classical machine learning techniques such as Hidden Markov Model (HMM), Support Vector Machine (SVM), Conditional Random Field (CRF) and so forth. But the introduction of deep learning techniques to the NER problem changed the scenario, where the state of art results have been achieved using deep learning architectures. In this paper, we address the problem of effective word representation for NER in Indian languages by capturing the syntactic, semantic and morphological information. We propose a deep learning based entity extraction system for Indian languages using a novel combined word representation, including character-level, word-level and affix-level embeddings. We have used ‘ARNEKT-IECSIL 2018’ shared data for training and testing. Our results highlight the improvement that we obtained over the existing pre-trained word representations.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/10/6/186/pdf

Reference64 articles.

1. Survey of Named Entity Recognition Systems with respect to Indian and Foreign Languages

2. Named Entity Identifier for Malayalam Using Linguistic Principles Employing Statistical Methods;Bindu;Int. J. Comput. Sci. Issues,2011

3. Statistical Arabic Name Entity Recognition Approaches: A Survey

4. Semantic processing of multimedia data for e-government applications

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Named Entity Recognition in Bengali and Hindi Using MuRIL and Conditional Random Fields;SN Computer Science;2024-09-03

2. Named Entity Recognition for Indic Languages: A Comprehensive Survey;2024 1st International Conference on Trends in Engineering Systems and Technologies (ICTEST);2024-04-11

3. Hybrid Model for Named Entity Recognition;International Journal of Distributed Artificial Intelligence;2022-10-07

4. Named entity recognition using neural language model and CRF for Hindi language;Computer Speech & Language;2022-07

5. Urdu Named Entity Recognition System Using Deep Learning Approaches;The Computer Journal;2022-04-23