Affiliation:
1. Computerized Intelligence Systems Laboratory, Department of Computer Engineering, University of Tabriz, Tabriz, Iran
Abstract
Named entity recognition (NER) is a subfield of natural language processing (NLP). It is able to identify proper nouns, such as person names, locations, and organizations, and has been widely used in various tasks. NER can be practical in extracting information from social media data. However, the unstructured and noisy nature of social media (such as grammatical errors and typos) causes new challenges for NER, especially for low-resource languages such as Persian, and existing NER methods mainly focus on formal texts and English social media. To overcome this challenge, we consider Persian NER as an optimization problem and use the binary Gray Wolf Optimization (GWO) algorithm to segment posts into small possible phrases of named entities. Later, named entities are recognized based on their score. Also, we prove that even human opinion can differ in the NER task and compare our method with other systems with the
dataset and the results show that our proposed system obtains a higher F1 score in comparison with other methods.
Subject
Computer Science Applications,Software
Reference32 articles.
1. Persian Named Entity Recognition based with Local Filters
2. Persian Named Entity Recognition
3. Rule-based named entity recognition in Urdu;K. Riaz
4. A rule-based named entity recognition system for speech input;J. H. Kim
5. Rule-based named entity recognition for Greek financial texts;D. Farmakiotou